Wiktionary data extraction errors and warnings

Mandarin inflections

Download data in csv format

Word Part of speech Count Form Tags Other examples (may be other parts of speech)
romanization 64812 ㄖㄜˊ bopomofo

Word Part of speech Count Form Tags Other examples (may be other parts of speech)
㶿 character 2574 Pinyin error-unknown-tag
character 2574 po² Wade-Giles

Word Part of speech Count Form Tags Other examples (may be other parts of speech)
ti2 romanization 1658 ti² canonical
romanization 1658 ㄊㄧˊ bopomofo

Word Part of speech Count Form Tags Other examples (may be other parts of speech)
cuàn romanization 1287 cuan⁴ alternative
romanization 1287 ㄘㄨㄢˋ bopomofo

Word Part of speech Count Form Tags Other examples (may be other parts of speech)
character 8 Pinyin error-unknown-tag

Word Part of speech Count Form Tags Other examples (may be other parts of speech)
lv2 romanization 8 lv² canonical

Word Part of speech Count Form Tags Other examples (may be other parts of speech)
character 6 Pinyin

Word Part of speech Count Form Tags Other examples (may be other parts of speech)
character 5 uppercase

Word Part of speech Count Form Tags Other examples (may be other parts of speech)
character 4 lowercase

Word Part of speech Count Form Tags Other examples (may be other parts of speech)
西德尼 name 2 Xīdéní romanization

Word Part of speech Count Form Tags Other examples (may be other parts of speech)
character 1 jiè Pinyin
character 1 zǔn Pinyin
character 1 chieh³ Wade-Giles
character 1 tsun³ Wade-Giles


This page is a part of the kaikki.org machine-readable dictionary. This dictionary is based on structured data extracted on 2025-01-25 from the enwiktionary dump dated 2025-01-20 using wiktextract (c15a5ce and 5c11237). The data shown on this site has been post-processed and various details (e.g., extra categories) removed, some information disambiguated, and additional data merged from other sources. See the raw data download page for the unprocessed wiktextract data.

If you use this data in academic research, please cite Tatu Ylonen: Wiktextract: Wiktionary as Machine-Readable Structured Data, Proceedings of the 13th Conference on Language Resources and Evaluation (LREC), pp. 1317-1325, Marseille, 20-25 June 2022. Linking to the relevant page(s) under https://kaikki.org would also be greatly appreciated.