Wiktionary data extraction errors and warnings

Cantonese inflections

Download data in csv format

Word Part of speech Count Form Tags Other examples (may be other parts of speech)
cong1 romanization 2117 cong¹ canonical

Word Part of speech Count Form Tags Other examples (may be other parts of speech)
character 1851 zyun4 Jyutping
character 1851 jyun4 Yale

Word Part of speech Count Form Tags Other examples (may be other parts of speech)
character 35 yu6 Yale

Word Part of speech Count Form Tags Other examples (may be other parts of speech)
character 1 ciu1 Jyutping
character 1 diu1 Jyutping


This page is a part of the kaikki.org machine-readable dictionary. This dictionary is based on structured data extracted on 2026-01-03 from the enwiktionary dump dated 2026-01-01 using wiktextract (96027d6 and 9905b1f). The data shown on this site has been post-processed and various details (e.g., extra categories) removed, some information disambiguated, and additional data merged from other sources. See the raw data download page for the unprocessed wiktextract data.

If you use this data in academic research, please cite Tatu Ylonen: Wiktextract: Wiktionary as Machine-Readable Structured Data, Proceedings of the 13th Conference on Language Resources and Evaluation (LREC), pp. 1317-1325, Marseille, 20-25 June 2022. Linking to the relevant page(s) under https://kaikki.org would also be greatly appreciated.