Go to languages sorted by count
Extraction errors and warnings
To download the full raw data that was extracted from Wiktionary using wiktextract, please see the raw data download page.
DEPRECATED: To download the post-processed data used on this site for the kaikki.org machine-readable dictionary, please look for the download links near the end of the main page for each language (or the all languages combined page) and various subpages.
This data is extracted from Wiktionary and is updated regularly. The full original Wiktionary data can be downloaded from Wikimedia dumps. This data is made available under the same licenses as Wiktionary - both CC-BY-SA and GFDL. See Wiktionary copyright page for more information.
This page is a part of the kaikki.org machine-readable dictionary. This dictionary is based on structured data extracted on 2025-10-22 from the thwiktionary dump dated 2025-10-20 using wiktextract (da1f971 and f26afeb). The data shown on this site has been post-processed and various details (e.g., extra categories) removed, some information disambiguated, and additional data merged from other sources. See the raw data download page for the unprocessed wiktextract data.
If you use this data in academic research, please cite Tatu Ylonen: Wiktextract: Wiktionary as Machine-Readable Structured Data, Proceedings of the 13th Conference on Language Resources and Evaluation (LREC), pp. 1317-1325, Marseille, 20-25 June 2022. Linking to the relevant page(s) under https://kaikki.org would also be greatly appreciated.