Wiktionary data extraction errors and warnings

Chinois inflections

Download data in csv format

Word Part of speech Count Form Tags Other examples (may be other parts of speech)
欧洲人 noun 4256 歐洲人 Traditional Chinese

Word Part of speech Count Form Tags Other examples (may be other parts of speech)
理髮 verb 4072 理发 Simplified Chinese

Word Part of speech Count Form Tags Other examples (may be other parts of speech)
character 137 Simplified Chinese
character 137 巌/岩 Traditional Chinese

Word Part of speech Count Form Tags Other examples (may be other parts of speech)
六月 noun 36 6月 error-NO-TAGS-REPORT-THIS

Word Part of speech Count Form Tags Other examples (may be other parts of speech)
为甚么 adv 11 為甚麼 Traditional Chinese
adv 11 为什么 error-NO-TAGS-REPORT-THIS

Word Part of speech Count Form Tags Other examples (may be other parts of speech)
noun 11 Simplified Chinese
noun 11 Traditional Chinese
noun 11 Simplified Chinese

Word Part of speech Count Form Tags Other examples (may be other parts of speech)
同義詞 noun 9 同义词 Simplified Chinese
noun 9 同意詞 error-NO-TAGS-REPORT-THIS

Word Part of speech Count Form Tags Other examples (may be other parts of speech)
verb 4 dialectal

Word Part of speech Count Form Tags Other examples (may be other parts of speech)
character 2 Traditional Chinese
character 2 Simplified Chinese
character 2 Traditional Chinese

Word Part of speech Count Form Tags Other examples (may be other parts of speech)
台灣 name 2 台湾 Simplified Chinese
name 2 臺灣 Traditional Chinese
name 2 臺灣 error-NO-TAGS-REPORT-THIS

Word Part of speech Count Form Tags Other examples (may be other parts of speech)
蜘蛛 noun 1 鼅鼄 archaic


This page is a part of the kaikki.org machine-readable dictionary. This dictionary is based on structured data extracted on 2025-07-19 from the frwiktionary dump dated 2025-07-01 using wiktextract (45c4a21 and f1c2b61). The data shown on this site has been post-processed and various details (e.g., extra categories) removed, some information disambiguated, and additional data merged from other sources. See the raw data download page for the unprocessed wiktextract data.

If you use this data in academic research, please cite Tatu Ylonen: Wiktextract: Wiktionary as Machine-Readable Structured Data, Proceedings of the 13th Conference on Language Resources and Evaluation (LREC), pp. 1317-1325, Marseille, 20-25 June 2022. Linking to the relevant page(s) under https://kaikki.org would also be greatly appreciated.