Raw data downloads extracted from Wiktionary

This page contains download links for the raw data extracted from Wiktionary using Wiktextract. This data is updated regularly (usually at least once a week). The current version was extracted from the enwiktionary dump dated 2021-11-20. It contains data for hundreds of languages, and has glosses and other metadata in English. The data formats are documented at https://github.com/tatuylonen/wiktextract.

For post-processed data, please look at the download links at the end of the main page for each language (or the page for all languages combined) under https://kaikki.org/dictionary/.


This page is a part of the kaikki.org machine-readable dictionary. This dictionary is based on structured data extracted on 2021-11-26 from the enwiktionary dump dated 2021-11-20 using wiktextract.