Wiktionary data extraction errors and warnings

意第绪語 inflections

Download data in csv format

Word Part of speech Count Form Tags Other examples (may be other parts of speech)
לבֿנה noun 17 levone romanization
noun 17 לבֿנות plural

Word Part of speech Count Form Tags Other examples (may be other parts of speech)
שוואַרץ adj 7 shvarts romanization

Word Part of speech Count Form Tags Other examples (may be other parts of speech)
טאָכטער noun 5 tokhter romanization
noun 5 טעכטער plural
noun 5 טעכטערל diminutive

Word Part of speech Count Form Tags Other examples (may be other parts of speech)
וואַקוּום noun 4 וואַקווּם alternative
noun 4 וואַקוּוּם alternative
noun 4 וואַקואום alternative
noun 4 vakuum romanization
noun 4 וואַקוּומס plural

Word Part of speech Count Form Tags Other examples (may be other parts of speech)
קעמפֿער noun 2 kemfer romanization
noun 2 קעמפֿערס plural
noun 2 קעמפֿערין feminine

Word Part of speech Count Form Tags Other examples (may be other parts of speech)
דזשאַז noun 2 דזשעז alternative
noun 2 dzhaz romanization

Word Part of speech Count Form Tags Other examples (may be other parts of speech)
בײַ prep 1 באַ alternative
prep 1 bay romanization
prep 1 בײַם error-NO-TAGS-REPORT-THIS

Word Part of speech Count Form Tags Other examples (may be other parts of speech)
צו prep 1 tsu romanization
prep 1 צום error-NO-TAGS-REPORT-THIS

Word Part of speech Count Form Tags Other examples (may be other parts of speech)
קעניג noun 1 קיניג alternative
noun 1 kenig romanization
noun 1 קעניגן plural
noun 1 קעניגין feminine
noun 1 קעניגל diminutive


This page is a part of the kaikki.org machine-readable dictionary. This dictionary is based on structured data extracted on 2026-05-30 from the zhwiktionary dump dated 2026-05-01 using wiktextract (702fa29 and 7f4db16). The data shown on this site has been post-processed and various details (e.g., extra categories) removed, some information disambiguated, and additional data merged from other sources. See the raw data download page for the unprocessed wiktextract data.

If you use this data in academic research, please cite Tatu Ylonen: Wiktextract: Wiktionary as Machine-Readable Structured Data, Proceedings of the 13th Conference on Language Resources and Evaluation (LREC), pp. 1317-1325, Marseille, 20-25 June 2022. Linking to the relevant page(s) under https://kaikki.org would also be greatly appreciated.