Wiktionary data extraction errors and warnings

Khmer inflections

Download data in csv format

Word Part of speech Count Form Tags Other examples (may be other parts of speech)
ពូក noun 6102 puuk romanization

Word Part of speech Count Form Tags Other examples (may be other parts of speech)
កា verb 2692 kaa romanization
verb 2692 កា រកា abstract-noun

Word Part of speech Count Form Tags Other examples (may be other parts of speech)
កូនង៉ែត noun 1674 koun ngaet error-NO-TAGS-REPORT-THIS

Word Part of speech Count Form Tags Other examples (may be other parts of speech)
ខ្មែរ name 357 khmae romanization
name 357 ខ្មេរ alternative

Word Part of speech Count Form Tags Other examples (may be other parts of speech)
សរសៃឈាមខ្មៅ noun 145 សរសៃ ឈាមខ្មៅ canonical
noun 145 sɑɑ say chiəm khmaw romanization

Word Part of speech Count Form Tags Other examples (may be other parts of speech)
សម្ដែង verb 96 sɑɑsɑmdaeng romanization
verb 96 ការសម្ដែង abstract-noun
verb 96 សំដែង alternative

Word Part of speech Count Form Tags Other examples (may be other parts of speech)
សញ្ញាចរាចរ noun 27 saññaa cɑɑraacɑɑ error-NO-TAGS-REPORT-THIS
noun 27 សញ្ញាចរាចរណ៍ alternative

Word Part of speech Count Form Tags Other examples (may be other parts of speech)
ទៅជា verb 11 ទៅ ជា canonical
verb 11 tɨw ciə romanization
verb 11 ការទៅជា abstract-noun

Word Part of speech Count Form Tags Other examples (may be other parts of speech)
ស្លាំង adj 7 ភាពស្លាំង abstract-noun

Word Part of speech Count Form Tags Other examples (may be other parts of speech)
ហេតុអ្វី adv 3 ហេតុ អ្វី canonical
adv 3 haet ʼaʼvəy romanization
adv 3 ហេតុអី alternative

Word Part of speech Count Form Tags Other examples (may be other parts of speech)
ពោត noun 2 poot romanization
noun 2 ពោត classifier

Word Part of speech Count Form Tags Other examples (may be other parts of speech)
បាស។ noun 1 បាស ។ canonical

Word Part of speech Count Form Tags Other examples (may be other parts of speech)
១២ num 1 dop pī formal
num 1 pī don dop colloquial


This page is a part of the kaikki.org machine-readable dictionary. This dictionary is based on structured data extracted on 2026-06-28 from the enwiktionary dump dated 2026-06-01 using wiktextract (c6afe2d and 7f4db16). The data shown on this site has been post-processed and various details (e.g., extra categories) removed, some information disambiguated, and additional data merged from other sources. See the raw data download page for the unprocessed wiktextract data.

If you use this data in academic research, please cite Tatu Ylonen: Wiktextract: Wiktionary as Machine-Readable Structured Data, Proceedings of the 13th Conference on Language Resources and Evaluation (LREC), pp. 1317-1325, Marseille, 20-25 June 2022. Linking to the relevant page(s) under https://kaikki.org would also be greatly appreciated.