Wiktionary data extraction errors and warnings

跨語言 inflections

Download data in csv format

Word Part of speech Count Form Tags Other examples (may be other parts of speech)
character 394 ga romanization

Word Part of speech Count Form Tags Other examples (may be other parts of speech)
𐐑 character 299 𐐹 error-NO-TAGS-REPORT-THIS

Word Part of speech Count Form Tags Other examples (may be other parts of speech)
cm² symbol 207 cm2 alternative

Word Part of speech Count Form Tags Other examples (may be other parts of speech)
𐐴 character 122 𐐌 capitalized

Word Part of speech Count Form Tags Other examples (may be other parts of speech)
Ρ num 30 Ρ΄ canonical

Word Part of speech Count Form Tags Other examples (may be other parts of speech)
Ћ character 5 ћ error-NO-TAGS-REPORT-THIS
character 5 Ć character Latin alternative

Word Part of speech Count Form Tags Other examples (may be other parts of speech)
character 4 capitalized
character 4 b alternative

Word Part of speech Count Form Tags Other examples (may be other parts of speech)
ѭ character 4 romanization
character 4 Ѭ error-NO-TAGS-REPORT-THIS

Word Part of speech Count Form Tags Other examples (may be other parts of speech)
punct 3 ¶¶ plural

Word Part of speech Count Form Tags Other examples (may be other parts of speech)
character 3 error-NO-TAGS-REPORT-THIS
character 3 capitalized

Word Part of speech Count Form Tags Other examples (may be other parts of speech)
andiniensis adj 3 andiniense neuter

Word Part of speech Count Form Tags Other examples (may be other parts of speech)
XII num 2 VVII alternative
num 2 xii alternative
num 2 xij alternative
num 2 vvii alternative
num 2 vvij alternative
num 2 XII. ordinal alternative

Word Part of speech Count Form Tags Other examples (may be other parts of speech)
character 2 error-NO-TAGS-REPORT-THIS
character 2 B alternative

Word Part of speech Count Form Tags Other examples (may be other parts of speech)
character 2 glagoli romanization
character 2 capitalized

Word Part of speech Count Form Tags Other examples (may be other parts of speech)
XVIII num 2 XVIIJ obsolete alternative

Word Part of speech Count Form Tags Other examples (may be other parts of speech)
𑀅 character 2 a romanization
character 2 𑀬 alternative

Word Part of speech Count Form Tags Other examples (may be other parts of speech)
alaskanus adj 1 alaskana feminine
adj 1 alaskanum neuter

Word Part of speech Count Form Tags Other examples (may be other parts of speech)
character 1 Traditional-Chinese alternative
character 1 alternative

Word Part of speech Count Form Tags Other examples (may be other parts of speech)
cm³ symbol 1 cm3 nonstandard alternative

Word Part of speech Count Form Tags Other examples (may be other parts of speech)
II num 1 ii alternative
num 1 II. alternative
num 1 ii error-NO-TAGS-REPORT-THIS

Word Part of speech Count Form Tags Other examples (may be other parts of speech)
iii num 1 III alternative
num 1 IIJ alternative
num 1 iij alternative
num 1 IIV alternative
num 1 iiv nonstandard alternative

Word Part of speech Count Form Tags Other examples (may be other parts of speech)
hle symbol 1 hleč dialectal alternative

Word Part of speech Count Form Tags Other examples (may be other parts of speech)
yta symbol 1 ytet alternative
symbol 1 ytte alternative
symbol 1 ytt past participle alternative

Word Part of speech Count Form Tags Other examples (may be other parts of speech)
symbol 1 /!\\ alternative
symbol 1 ⚠︎ canonical

Word Part of speech Count Form Tags Other examples (may be other parts of speech)
Ń character 1 ń error-NO-TAGS-REPORT-THIS
character 1 obsolete alternative

Word Part of speech Count Form Tags Other examples (may be other parts of speech)
ij character 1 ij alternative
character 1 Ij error-NO-TAGS-REPORT-THIS
character 1 IJ capitalized

Word Part of speech Count Form Tags Other examples (may be other parts of speech)
November noun 1 november alternative
noun 1 NOVEMBER alternative
noun 1 Novembre obsolete alternative


This page is a part of the kaikki.org machine-readable dictionary. This dictionary is based on structured data extracted on 2026-03-15 from the zhwiktionary dump dated 2026-03-04 using wiktextract (bdd14c0 and 9d9a410). The data shown on this site has been post-processed and various details (e.g., extra categories) removed, some information disambiguated, and additional data merged from other sources. See the raw data download page for the unprocessed wiktextract data.

If you use this data in academic research, please cite Tatu Ylonen: Wiktextract: Wiktionary as Machine-Readable Structured Data, Proceedings of the 13th Conference on Language Resources and Evaluation (LREC), pp. 1317-1325, Marseille, 20-25 June 2022. Linking to the relevant page(s) under https://kaikki.org would also be greatly appreciated.