Wiktionary data extraction errors and warnings

Mandarin inflections

Download data in csv format

Word Part of speech Count Form Tags Other examples (may be other parts of speech)
fǎnyīngxióng romanization 65945 ㄈㄢˇ ㄧㄥ ㄒㄩㄥˊ Bopomofo

Word Part of speech Count Form Tags Other examples (may be other parts of speech)
character 2416 Pinyin error-unknown-tag
character 2416 p'u³ Wade-Giles

Word Part of speech Count Form Tags Other examples (may be other parts of speech)
shu4 romanization 1667 shu⁴ canonical
romanization 1667 ㄕㄨˋ Bopomofo

Word Part of speech Count Form Tags Other examples (may be other parts of speech)
shā romanization 545 sha¹ alternative
romanization 545 ㄕㄚ Bopomofo

Word Part of speech Count Form Tags Other examples (may be other parts of speech)
lǎn romanization 397 lan³ alternative
romanization 397 ㄌㄢˇ Bopomofo
romanization 397 lan nonstandard alternative

Word Part of speech Count Form Tags Other examples (may be other parts of speech)
chǎng romanization 347 chang³ alternative
romanization 347 ㄔㄤˇ Bopomofo
romanization 347 chang nonstandard alternative
romanization 347 ĉǎŋ alternative
romanization 347 ĉǎng alternative
romanization 347 chǎŋ alternative

Word Part of speech Count Form Tags Other examples (may be other parts of speech)
romanization 179 ㄏㄚˋ Bopomofo
romanization 179 ha nonstandard alternative

Word Part of speech Count Form Tags Other examples (may be other parts of speech)
lv4 romanization 8 lv⁴ canonical

Word Part of speech Count Form Tags Other examples (may be other parts of speech)
𫂙 character 8 Pinyin error-unknown-tag

Word Part of speech Count Form Tags Other examples (may be other parts of speech)
character 6 dài Pinyin

Word Part of speech Count Form Tags Other examples (may be other parts of speech)
ɪ character 5 I uppercase

Word Part of speech Count Form Tags Other examples (may be other parts of speech)
ng romanization 5 ˙ㆭ Bopomofo
romanization 5 ŋ alternative

Word Part of speech Count Form Tags Other examples (may be other parts of speech)
character 4 ʂ lowercase

Word Part of speech Count Form Tags Other examples (may be other parts of speech)
ǹg romanization 3 ㆭˋ Bopomofo
romanization 3 ng nonstandard alternative
romanization 3 ŋ̀ alternative

Word Part of speech Count Form Tags Other examples (may be other parts of speech)
huang romanization 2 huaŋ alternative

Word Part of speech Count Form Tags Other examples (may be other parts of speech)
西德尼 name 2 Xīdéní romanization

Word Part of speech Count Form Tags Other examples (may be other parts of speech)
pu verb 1 no-table-tags table-tags
verb 1 l-self inflection-template
verb 1 xee apu first-person singular
verb 1 ndee erepu second-person singular
verb 1 ha'e opu third-person singular
verb 1 nhande japu inclusive first-person plural
verb 1 ore oropu exclusive first-person plural
verb 1 peẽ pepu second-person plural
verb 1 ha'e opu third-person plural

Word Part of speech Count Form Tags Other examples (may be other parts of speech)
character 1 jiè Pinyin
character 1 zǔn Pinyin
character 1 chieh³ Wade-Giles
character 1 tsun³ Wade-Giles

Word Part of speech Count Form Tags Other examples (may be other parts of speech)
duāng romanization 1 duang¹ alternative
romanization 1 ㄉㄨㄤ Bopomofo
romanization 1 duāŋ alternative

Word Part of speech Count Form Tags Other examples (may be other parts of speech)
a/languages M to Z romanization 1 a canonical
romanization 1 Bopomofo
romanization 1 a canonical


This page is a part of the kaikki.org machine-readable dictionary. This dictionary is based on structured data extracted on 2026-01-03 from the enwiktionary dump dated 2026-01-01 using wiktextract (96027d6 and 9905b1f). The data shown on this site has been post-processed and various details (e.g., extra categories) removed, some information disambiguated, and additional data merged from other sources. See the raw data download page for the unprocessed wiktextract data.

If you use this data in academic research, please cite Tatu Ylonen: Wiktextract: Wiktionary as Machine-Readable Structured Data, Proceedings of the 13th Conference on Language Resources and Evaluation (LREC), pp. 1317-1325, Marseille, 20-25 June 2022. Linking to the relevant page(s) under https://kaikki.org would also be greatly appreciated.