Wiktionary data extraction errors and warnings

Khmer inflections

Download data in csv format

Word Part of speech Count Form Tags Other examples (may be other parts of speech)
យានដឹកជញ្ជូនសាធារណៈ noun 7597 yiən dək cɔɔcŭəñcuun saathiərĕəʼnaʼ romanization

Word Part of speech Count Form Tags Other examples (may be other parts of speech)
ចេរ verb 2654 cee romanization
verb 2654 ការចេរ abstract-noun

Word Part of speech Count Form Tags Other examples (may be other parts of speech)
កំសាន្ត noun 377 kɑmsaan romanization
noun 377 កម្សាន្ត alternative

Word Part of speech Count Form Tags Other examples (may be other parts of speech)
ពីព្រោះ conj 143 ពី ព្រោះ canonical
conj 143 pii prŭəh romanization

Word Part of speech Count Form Tags Other examples (may be other parts of speech)
សម្ងាត់ verb 97 sɑmngat romanization
verb 97 ការសម្ងាត់ abstract-noun
verb 97 សំងាត់ alternative

Word Part of speech Count Form Tags Other examples (may be other parts of speech)
តប់ប្រមល់ adj 11 តប់ ប្រមល់ canonical
adj 11 tɑp prɑɑmŭəl romanization
adj 11 ភាពតប់ប្រមល់ abstract-noun

Word Part of speech Count Form Tags Other examples (may be other parts of speech)
ស្លេកស្លាំង adj 7 ភាពស្លេកស្លាំង abstract-noun

Word Part of speech Count Form Tags Other examples (may be other parts of speech)
មីកុឡា noun 3 មី កុឡា canonical
noun 3 mii kolaa romanization
noun 3 មីកូឡា alternative

Word Part of speech Count Form Tags Other examples (may be other parts of speech)
ពោត noun 2 poot romanization
noun 2 ពោត classifier

Word Part of speech Count Form Tags Other examples (may be other parts of speech)
១២ num 1 dop pī formal
num 1 pī don dop colloquial

Word Part of speech Count Form Tags Other examples (may be other parts of speech)
បាស។ noun 1 បាស ។ canonical

Word Part of speech Count Form Tags Other examples (may be other parts of speech)
ពូថៅ noun 1 ប៉ូវថៅ alternative


This page is a part of the kaikki.org machine-readable dictionary. This dictionary is based on structured data extracted on 2025-10-26 from the enwiktionary dump dated 2025-10-21 using wiktextract (bd88cf0 and 0a198a9). The data shown on this site has been post-processed and various details (e.g., extra categories) removed, some information disambiguated, and additional data merged from other sources. See the raw data download page for the unprocessed wiktextract data.

If you use this data in academic research, please cite Tatu Ylonen: Wiktextract: Wiktionary as Machine-Readable Structured Data, Proceedings of the 13th Conference on Language Resources and Evaluation (LREC), pp. 1317-1325, Marseille, 20-25 June 2022. Linking to the relevant page(s) under https://kaikki.org would also be greatly appreciated.