Wiktionary data extraction errors and warnings

高棉語 inflections

Download data in csv format

Word Part of speech Count Form Tags Other examples (may be other parts of speech)
ពន្លាក noun 586 pɔɔpŭənliək romanization

Word Part of speech Count Form Tags Other examples (may be other parts of speech)
ញ៉ាំ verb 94 ñam romanization
verb 94 ការញ៉ាំ abstract-noun

Word Part of speech Count Form Tags Other examples (may be other parts of speech)
អិស្លាម noun 36 អ៊ីស្លាម alternative
noun 36 ឥស្លាម alternative
noun 36 ʼihslaam romanization

Word Part of speech Count Form Tags Other examples (may be other parts of speech)
ខ្ញុំ pron 7 khñom romanization
pron 7 khñuṃ alternative
pron 7 kñuṃ alternative
pron 7 ក្ញុំ alternative
pron 7 kñuṃm alternative
pron 7 ក្ញុំម៑ alternative
pron 7 kñum alternative
pron 7 ក្ញុម៑ alternative
pron 7 kñumm alternative
pron 7 ក្ញុម្ម alternative
ខ្ញុំ pron 7 kñaṃ alternative
pron 7 ក្ញំ alternative
pron 7 kñaum alternative
pron 7 ក្ញៅម៑ alternative
pron 7 kyuṃ alternative
pron 7 ក្យុំ alternative

Word Part of speech Count Form Tags Other examples (may be other parts of speech)
character 4 ឞ, ្ឞ canonical
character 4 ssô romanization

Word Part of speech Count Form Tags Other examples (may be other parts of speech)
ហ៊ាន verb 2 hiən romanization
verb 2 ការហ៊ាន abstract-noun
verb 2 ហាន alternative

Word Part of speech Count Form Tags Other examples (may be other parts of speech)
ផើង noun 2 phaəng romanization
noun 2 ផើង error-NO-TAGS-REPORT-THIS

Word Part of speech Count Form Tags Other examples (may be other parts of speech)
ខ្ញី adj 2 ខ្ញែ alternative
adj 2 khñəy romanization
adj 2 ភាពខ្ញី abstract-noun


This page is a part of the kaikki.org machine-readable dictionary. This dictionary is based on structured data extracted on 2026-03-24 from the zhwiktionary dump dated 2026-03-04 using wiktextract (05c257f and 9d9a410). The data shown on this site has been post-processed and various details (e.g., extra categories) removed, some information disambiguated, and additional data merged from other sources. See the raw data download page for the unprocessed wiktextract data.

If you use this data in academic research, please cite Tatu Ylonen: Wiktextract: Wiktionary as Machine-Readable Structured Data, Proceedings of the 13th Conference on Language Resources and Evaluation (LREC), pp. 1317-1325, Marseille, 20-25 June 2022. Linking to the relevant page(s) under https://kaikki.org would also be greatly appreciated.