Wiktionary data extraction errors and warnings

Vietnamese inflections

Download data in csv format

Word Part of speech Count Form Tags Other examples (may be other parts of speech)
character 6294 nhéo Hán-Nôm
character 6294 nạo Hán-Nôm
character 6294 não Hán-Nôm
character 6294 nàu Hán-Nôm
character 6294 ngoéo Hán-Nôm
character 6294 nhàu Hán-Nôm
character 6294 ngàu Hán-Nôm
character 6294 nao Hán-Nôm
character 6294 nảo Hán-Nôm
character 6294 nảu Hán-Nôm
character 6294 nãu Hán-Nôm
character 6294 nạu Hán-Nôm
character 6294 nhao Hán-Nôm
character 6294 nháo Hán-Nôm
character 6294 nháu Hán-Nôm
character 6294 nhảu Hán-Nôm
character 6294 nhảo Hán-Nôm

Word Part of speech Count Form Tags Other examples (may be other parts of speech)
chủ nghĩa công lợi noun 3628 主義功利 CJK

Word Part of speech Count Form Tags Other examples (may be other parts of speech)
pháp lí adj 2022 pháp lý alternative

Word Part of speech Count Form Tags Other examples (may be other parts of speech)
lăng quăng noun 1732 con classifier

Word Part of speech Count Form Tags Other examples (may be other parts of speech)
character 1287 跌: Hán Việt readings: điệt canonical
character 1287 trật 跌: Nôm readings: chợt canonical
character 1287 trượt canonical
character 1287 trớt canonical
character 1287 trặc canonical
character 1287 xớt canonical
character 1287 xợt canonical
character 1287 đột canonical

Word Part of speech Count Form Tags Other examples (may be other parts of speech)
bánh tây noun 751 chiếc classifier
noun 751 cái classifier
noun 751 餅西 CJK

Word Part of speech Count Form Tags Other examples (may be other parts of speech)
thèm verb 423 CJK
verb 423 CJK
verb 423 CJK
verb 423 𡅩 CJK
verb 423 𩝎 CJK
verb 423 sèm alternative

Word Part of speech Count Form Tags Other examples (may be other parts of speech)
𦖑 verb 248 nghe romanization

Word Part of speech Count Form Tags Other examples (may be other parts of speech)
xa pô chê noun 182 cây classifier
noun 182 quả classifier
noun 182 trái classifier
noun 182 sa-pô-chê alternative

Word Part of speech Count Form Tags Other examples (may be other parts of speech)
chỉ noun 117 sợi classifier
noun 117 CJK
noun 117 CJK
noun 117 CJK
noun 117 CJK
noun 117 𥿗 CJK
noun 117 chỉn alternative

Word Part of speech Count Form Tags Other examples (may be other parts of speech)
character 78 lowercase

Word Part of speech Count Form Tags Other examples (may be other parts of speech)
character 67 uppercase

Word Part of speech Count Form Tags Other examples (may be other parts of speech)
dần adv 47 CJK
adv 47 CJK
adv 47 CJK
adv 47 dần dần diminutive reduplication

Word Part of speech Count Form Tags Other examples (may be other parts of speech)
character 33 碍: Hán Việt readings: ngại canonical
character 33 ngài 碍: Nôm readings: ngại canonical
character 33 ngáy canonical
character 33 ngái canonical
character 33 礙 (ngại) alternative
character 33 㝵 (ngài) alternative

Word Part of speech Count Form Tags Other examples (may be other parts of speech)
sụt sịt verb 19 sụt sà sụt sịt reduplication

Word Part of speech Count Form Tags Other examples (may be other parts of speech)
僞君子 noun 15 僞君子chữ Hán form of nguỵ quân tử . canonical
noun 15 “hypocrite” romanization

Word Part of speech Count Form Tags Other examples (may be other parts of speech)
riu adv 15 riu riu diminutive reduplication

Word Part of speech Count Form Tags Other examples (may be other parts of speech)
thảy adv 14 𪨐 CJK
adv 14 𫵧 CJK
adv 14 thảy thảy reduplication

Word Part of speech Count Form Tags Other examples (may be other parts of speech)
cu noun 9 con classifier
noun 9 CJK
noun 9 cu cu reduplication

Word Part of speech Count Form Tags Other examples (may be other parts of speech)
lòng noun 8 tấm classifier error-unknown-tag
noun 8 𢚸 CJK
noun 8 𢙱 CJK
noun 8 𪫵 CJK

Word Part of speech Count Form Tags Other examples (may be other parts of speech)
địt intj 8 Địt canonical
intj 8 𱼒 CJK

Word Part of speech Count Form Tags Other examples (may be other parts of speech)
phấn noun 6 cục classifier error-unknown-tag
noun 6 viên classifier error-unknown-tag
noun 6 hòn classifier error-unknown-tag

Word Part of speech Count Form Tags Other examples (may be other parts of speech)
rớm verb 5 rơm rớm reduplication
verb 5 rướm alternative

Word Part of speech Count Form Tags Other examples (may be other parts of speech)
vua noun 3 vị classifier
noun 3 nhà classifier
noun 3 ông classifier
noun 3 𤤰 CJK
noun 3 𢃊 CJK
noun 3 𪻟 CJK
noun 3 con classifier
noun 3 bua alternative

Word Part of speech Count Form Tags Other examples (may be other parts of speech)
vừa adj 3 CJK
adj 3 𣃣 CJK
adj 3 𣃤 CJK
adj 3 vừa vừa diminutive reduplication
adj 3 bưa alternative

Word Part of speech Count Form Tags Other examples (may be other parts of speech)
𣛤 character 3 trái Hán-Nôm
character 3 lái Hán-Nôm
character 3 𧀞 (trái) alternative
character 3 𢁑 (trái) alternative
character 3 𣡙 (trái) alternative
character 3 𣡚 (trái) alternative

Word Part of speech Count Form Tags Other examples (may be other parts of speech)
choé adj 3 choe choé diminutive reduplication
adj 3 chóe alternative

Word Part of speech Count Form Tags Other examples (may be other parts of speech)
nhện noun 2 con classifier
noun 2 CJK
noun 2 𦟶 CJK
noun 2 nhền nhện reduplication
noun 2 dện alternative
noun 2 nhệnh alternative
noun 2 dệng alternative

Word Part of speech Count Form Tags Other examples (may be other parts of speech)
𡦂 noun 2 chữ romanization
noun 2 alternative
noun 2 𡨸 alternative
noun 2 𫳘 alternative

Word Part of speech Count Form Tags Other examples (may be other parts of speech)
character 2 đồng Hán-Nôm
character 2 Chữ Quốc Ngữ" means copper canonical
character 2 coin canonical
character 2 currency instead. canonical

Word Part of speech Count Form Tags Other examples (may be other parts of speech)
cáng noun 1 CJK
noun 1 cái classifier
noun 1 CJK
noun 1 CJK
noun 1 𫆥 CJK

Word Part of speech Count Form Tags Other examples (may be other parts of speech)
醫藥 noun 1 醫藥chữ Hán form of y dược . canonical
noun 1 in institution names often

Word Part of speech Count Form Tags Other examples (may be other parts of speech)
giăng verb 1 CJK
verb 1 CJK
verb 1 CJK
verb 1 𢬥 CJK
verb 1 giăng giăng reduplication
verb 1 chăng alternative

Word Part of speech Count Form Tags Other examples (may be other parts of speech)
Ch character 1 CH uppercase
character 1 ch lowercase

Word Part of speech Count Form Tags Other examples (may be other parts of speech)
ch character 1 CH uppercase
character 1 Ch mixedcase

Word Part of speech Count Form Tags Other examples (may be other parts of speech)
CH character 1 ch lowercase
character 1 Ch mixedcase

Word Part of speech Count Form Tags Other examples (may be other parts of speech)
côca noun 1 cây classifier error-unknown-tag
noun 1 cô ca alternative

Word Part of speech Count Form Tags Other examples (may be other parts of speech)
內小邦 adj 1 內小邦chữ Hán form of nội tiểu bang . canonical
adj 1 Overseas Vietnamese error-NO-TAGS-REPORT-THIS

Word Part of speech Count Form Tags Other examples (may be other parts of speech)
chó noun 1 con classifier
noun 1 CJK
noun 1 𤝹 CJK
noun 1 𤠚 CJK
noun 1 𦢞 CJK
noun 1 CJK
noun 1 thằng classifier
noun 1 con classifier

Word Part of speech Count Form Tags Other examples (may be other parts of speech)
bắp noun 1 cây classifier error-unknown-tag
noun 1 𥟼 CJK
noun 1 CJK
noun 1 báp alternative

Word Part of speech Count Form Tags Other examples (may be other parts of speech)
muỗm noun 1 con classifier
noun 1 muồm muỗm reduplication

Word Part of speech Count Form Tags Other examples (may be other parts of speech)
照看 noun 1 照看chữ Hán form of chiếu khán . canonical
noun 1 used by Overseas Vietnamese error-NO-TAGS-REPORT-THIS
noun 1 in Vietnam dated

Word Part of speech Count Form Tags Other examples (may be other parts of speech)
暴主 noun 1 暴主chữ Nôm form of bạo chúa . canonical
noun 1 tên classifier
noun 1 kẻ classifier
noun 1 “tyrant” romanization


This page is a part of the kaikki.org machine-readable dictionary. This dictionary is based on structured data extracted on 2025-03-09 from the enwiktionary dump dated 2025-03-02 using wiktextract (32c88e6 and 633533e). The data shown on this site has been post-processed and various details (e.g., extra categories) removed, some information disambiguated, and additional data merged from other sources. See the raw data download page for the unprocessed wiktextract data.

If you use this data in academic research, please cite Tatu Ylonen: Wiktextract: Wiktionary as Machine-Readable Structured Data, Proceedings of the 13th Conference on Language Resources and Evaluation (LREC), pp. 1317-1325, Marseille, 20-25 June 2022. Linking to the relevant page(s) under https://kaikki.org would also be greatly appreciated.