Wiktionary data extraction errors and warnings

Even inflections

Download data in csv format

Word Part of speech Count Form Tags Other examples (may be other parts of speech)
мудан noun 351 mudan romanization

Word Part of speech Count Form Tags Other examples (may be other parts of speech)
ӈэлдэй verb 336 ӈэ̄лдэ̄й canonical
verb 336 ŋə̄ldə̄j romanization

Word Part of speech Count Form Tags Other examples (may be other parts of speech)
х character 27 h romanization
character 27 Х uppercase

Word Part of speech Count Form Tags Other examples (may be other parts of speech)
ш character 5 ш (s) canonical
character 5 Ш uppercase

Word Part of speech Count Form Tags Other examples (may be other parts of speech)
ʒ character 5 Ʒ uppercase

Word Part of speech Count Form Tags Other examples (may be other parts of speech)
B character 2 ʙ lowercase

Word Part of speech Count Form Tags Other examples (may be other parts of speech)
А character 1 А (A) canonical
character 1 а lowercase

Word Part of speech Count Form Tags Other examples (may be other parts of speech)
ӈ character 1 ŋ romanization
character 1 Ӈ uppercase
character 1 ң alternative

Word Part of speech Count Form Tags Other examples (may be other parts of speech)
оран noun 1 oran romanization
noun 1 орар plural

Word Part of speech Count Form Tags Other examples (may be other parts of speech)
нэдэй verb 1 nədəj romanization
verb 1 нэ̄дэ̄й alternative


This page is a part of the kaikki.org machine-readable dictionary. This dictionary is based on structured data extracted on 2025-11-18 from the enwiktionary dump dated 2025-11-01 using wiktextract (22806f4 and a050b89). The data shown on this site has been post-processed and various details (e.g., extra categories) removed, some information disambiguated, and additional data merged from other sources. See the raw data download page for the unprocessed wiktextract data.

If you use this data in academic research, please cite Tatu Ylonen: Wiktextract: Wiktionary as Machine-Readable Structured Data, Proceedings of the 13th Conference on Language Resources and Evaluation (LREC), pp. 1317-1325, Marseille, 20-25 June 2022. Linking to the relevant page(s) under https://kaikki.org would also be greatly appreciated.