Wiktionary data extraction errors and warnings

Mandarin inflections

Download data in csv format

Word Part of speech Count Form Tags Other examples (may be other parts of speech)
Qiánwéi romanization 62731 ㄑㄧㄢˊ ㄨㄟˊ bopomofo

Word Part of speech Count Form Tags Other examples (may be other parts of speech)
character 2118 yuān Pinyin error-unknown-tag
character 2118 yüan¹ Pinyin Wade-Giles

Word Part of speech Count Form Tags Other examples (may be other parts of speech)
mai3 romanization 1657 mai³ canonical
romanization 1657 ㄇㄞˇ bopomofo

Word Part of speech Count Form Tags Other examples (may be other parts of speech)
Zhāng romanization 1284 zhang¹ alternative
romanization 1284 ㄓㄤ bopomofo

Word Part of speech Count Form Tags Other examples (may be other parts of speech)
character 535 chàn Pinyin error-unknown-tag
character 535 ch'an⁴ Pinyin Wade-Giles
character 535 tien¹ Pinyin
character 535 t'an⁴ Pinyin

Word Part of speech Count Form Tags Other examples (may be other parts of speech)
character 8 Pinyin error-unknown-tag

Word Part of speech Count Form Tags Other examples (may be other parts of speech)
lv3 romanization 8 lv³ canonical

Word Part of speech Count Form Tags Other examples (may be other parts of speech)
character 6 miè Pinyin

Word Part of speech Count Form Tags Other examples (may be other parts of speech)
character 5 uppercase

Word Part of speech Count Form Tags Other examples (may be other parts of speech)
character 4 ʂ lowercase

Word Part of speech Count Form Tags Other examples (may be other parts of speech)
character 1 蒒 Teochew si5 canonical
character 1 shī Pinyin error-unknown-tag
character 1 shih¹ Pinyin Wade-Giles

Word Part of speech Count Form Tags Other examples (may be other parts of speech)
character 1 jiè Pinyin
character 1 zǔn Pinyin
character 1 chieh³ Pinyin Wade-Giles
character 1 tsun³ Pinyin


This page is a part of the kaikki.org machine-readable dictionary. This dictionary is based on structured data extracted on 2024-05-05 from the enwiktionary dump dated 2024-05-02 using wiktextract (f4fd8c9 and c9440ce). The data shown on this site has been post-processed and various details (e.g., extra categories) removed, some information disambiguated, and additional data merged from other sources. See the raw data download page for the unprocessed wiktextract data.

If you use this data in academic research, please cite Tatu Ylonen: Wiktextract: Wiktionary as Machine-Readable Structured Data, Proceedings of the 13th Conference on Language Resources and Evaluation (LREC), pp. 1317-1325, Marseille, 20-25 June 2022. Linking to the relevant page(s) under https://kaikki.org would also be greatly appreciated.