Wiktionary data extraction errors and warnings

越南語 inflections

Download data in csv format

Word Part of speech Count Form Tags Other examples (may be other parts of speech)
gián noun 588 𧍴, 𫋨 romanization

Word Part of speech Count Form Tags Other examples (may be other parts of speech)
sinh nhật noun 11 sanh nhật alternative
noun 11 sanh nhựt alternative

Word Part of speech Count Form Tags Other examples (may be other parts of speech)
cá chép noun 10 个/𩵜 + 鮿/𩺗 romanization

Word Part of speech Count Form Tags Other examples (may be other parts of speech)
điên adj 6 điên điên error-NO-TAGS-REPORT-THIS

Word Part of speech Count Form Tags Other examples (may be other parts of speech)
Bính âm Hán ngữ name 5 bính âm Hán ngữ canonical
name 5 拼音漢語 romanization

Word Part of speech Count Form Tags Other examples (may be other parts of speech)
Á rập adj 4 Ả rập canonical

Word Part of speech Count Form Tags Other examples (may be other parts of speech)
ớn verb 3 𠹵, 按, 𠻈, 𢞴, 𫣃 romanization
verb 3 ơn ớn error-NO-TAGS-REPORT-THIS

Word Part of speech Count Form Tags Other examples (may be other parts of speech)
ngh character 2 Ngh capitalized

Word Part of speech Count Form Tags Other examples (may be other parts of speech)
Cà Mau name 2 哥毛, 歌毛, 歌牟 Chu-Nom

Word Part of speech Count Form Tags Other examples (may be other parts of speech)
Bồ Đào Nha adj 2 Bố Đào Nhà humorous alternative

Word Part of speech Count Form Tags Other examples (may be other parts of speech)
Nguyễn name 2 Ng̃ alternative
name 2 Ng. alternative
name 2 romanization

Word Part of speech Count Form Tags Other examples (may be other parts of speech)
tan verb 1 散, 珊, 潵, 㪚 romanization
verb 1 dan alternative
verb 1 o dan alternative

Word Part of speech Count Form Tags Other examples (may be other parts of speech)
địt intj 1 Địt canonical
intj 1 𱼒 romanization
intj 1 canonical


This page is a part of the kaikki.org machine-readable dictionary. This dictionary is based on structured data extracted on 2025-08-28 from the zhwiktionary dump dated 2025-08-22 using wiktextract (ffdbfc3 and b9346a0). The data shown on this site has been post-processed and various details (e.g., extra categories) removed, some information disambiguated, and additional data merged from other sources. See the raw data download page for the unprocessed wiktextract data.

If you use this data in academic research, please cite Tatu Ylonen: Wiktextract: Wiktionary as Machine-Readable Structured Data, Proceedings of the 13th Conference on Language Resources and Evaluation (LREC), pp. 1317-1325, Marseille, 20-25 June 2022. Linking to the relevant page(s) under https://kaikki.org would also be greatly appreciated.