Wiktionary data extraction errors and warnings

Inflection check

List of different kinds of inflection tables. When wiktextract parses word heads and tables, it assigns the forms it encounters with tags that describe grammatical or contextual information. The tags and forms that are found in head sections and tables are kept separate from other head section and table tags, and later they are merged with other heads and tables into table types that all contain the same number of word forms with the same tags for those forms.

The information presented here is mostly for debugging, but it can also be used to find interesting word paradigms and to hunt down mistakes, typoes and badly formated Wiktionary entries. A table type that has only a few unique instances is quite likely to contain some kind of minor error in the original data.

Language ⏶ Table forms Errors (% affected words) Language Table forms ⏷ Errors (% affected words)
Abasinisch 2 0 (0.00%) Deutsch 32040 0 (0.00%)
Abchasisch 3 0 (0.00%) Latein 248 558 (2.77%)
Acehnesisch 2 0 (0.00%) Polnisch 211 8 (0.03%)
Adygeisch 3 0 (0.00%) Altgriechisch 158 746 (6.66%)
Afrikaans 16 78 (10.13%) Schwedisch 149 0 (0.00%)
Akkadisch 14 162 (3.23%) Englisch 120 0 (0.00%)
Albanisch 40 258 (3.63%) Französisch 80 0 (0.00%)
Altaisch 3 0 (0.00%) Italienisch 78 50 (0.02%)
Altenglisch 18 12 (28.83%) Tschechisch 78 0 (0.00%)
Altfranzösisch 3 0 (0.00%) Niederländisch 77 68 (7.60%)
Altgriechisch 158 746 (6.66%) Ukrainisch 74 0 (0.00%)
Althochdeutsch 19 0 (0.00%) Russisch 61 0 (0.00%)
Altirisch 5 0 (0.00%) Niedersorbisch 56 0 (0.00%)
Altkirchenslawisch 16 66 (21.21%) Neugriechisch 54 122 (13.04%)
Altnordisch 7 0 (0.00%) Arabisch 50 154 (8.09%)
Alttschechisch 1 0 (0.00%) Armenisch 49 134 (2.54%)
Amharisch 1 0 (0.00%) Isländisch 44 50 (9.36%)
Arabisch 50 154 (8.09%) Obersorbisch 42 0 (0.00%)
Aragonesisch 1 0 (0.00%) Serbisch 42 14 (0.38%)
Armenisch 49 134 (2.54%) Weißrussisch 41 0 (0.00%)
Aserbaidschanisch 13 8 (2.86%) Albanisch 40 258 (3.63%)
Assamesisch 1 0 (0.00%) Kroatisch 40 0 (0.00%)
Asturisch 1 0 (0.00%) Spanisch 38 0 (0.00%)
Awarisch 2 0 (0.00%) Bosnisch 34 0 (0.00%)
Bairisch 1 0 (0.00%) Slowakisch 33 0 (0.00%)
Baktrisch 1 0 (0.00%) Slowenisch 33 0 (0.00%)
Balinesisch 1 0 (0.00%) Gotisch 32 24 (20.97%)
Baschkirisch 3 0 (0.00%) Portugiesisch 31 32 (5.17%)
Baskisch 15 22 (4.97%) Prußisch 31 288 (13.53%)
Belutschi 4 4 (0.00%) Niederdeutsch 30 100 (7.04%)
Bengalisch 2 0 (0.00%) Finnisch 30 82 (4.56%)
Birmanisch 3 0 (0.00%) Mazedonisch 29 0 (0.00%)
Bosnisch 34 0 (0.00%) Ido 28 32 (40.40%)
Brahui 1 0 (0.00%) Irisch 27 2 (3.12%)
Bretonisch 5 0 (0.00%) Bulgarisch 27 132 (3.62%)
Bulgarisch 27 132 (3.62%) Hebräisch 27 16 (1.89%)
Burjatisch 3 0 (0.00%) Dänisch 26 2 (10.85%)
Catawba 1 0 (0.00%) Esperanto 25 0 (0.00%)
Chakassisch 1 0 (0.00%) Norwegisch 25 0 (0.00%)
Chantisch 2 0 (0.00%) Rumänisch 25 122 (2.54%)
Chinesisch 23 0 (0.00%) Türkisch 25 80 (1.34%)
Deutsch 32040 0 (0.00%) Persisch 25 16 (0.76%)
Dunganisch 2 0 (0.00%) Ungarisch 24 42 (4.04%)
Durango-Nahuatl 1 0 (0.00%) Chinesisch 23 0 (0.00%)
Dänisch 26 2 (10.85%) Färöisch 21 0 (0.00%)
Englisch 120 0 (0.00%) Jiddisch 20 0 (0.00%)
Esperanto 25 0 (0.00%) Althochdeutsch 19 0 (0.00%)
Estnisch 6 30 (16.83%) Litauisch 19 0 (0.00%)
Faliskisch 6 0 (0.00%) Lettisch 18 30 (9.98%)
Finnisch 30 82 (4.56%) Altenglisch 18 12 (28.83%)
Französisch 80 0 (0.00%) Katalanisch 17 38 (16.67%)
Friaulisch 5 0 (0.00%) Usbekisch 17 0 (0.00%)
Frühneuhochdeutsch 11 0 (0.00%) Klassisches Nahuatl 17 96 (40.61%)
Fulfulde 1 0 (0.00%) Urdu 17 124 (2.18%)
Färöisch 21 0 (0.00%) Okzitanisch 16 14 (0.18%)
Galicisch 4 0 (0.00%) Afrikaans 16 78 (10.13%)
Georgisch 11 0 (0.00%) Altkirchenslawisch 16 66 (21.21%)
Gotisch 32 24 (20.97%) Baskisch 15 22 (4.97%)
Guerrero-Nahuatl 1 2 (0.00%) Kurdisch 14 30 (0.00%)
Gujarati 1 0 (0.00%) Akkadisch 14 162 (3.23%)
Gurage 1 0 (0.00%) Aserbaidschanisch 13 8 (2.86%)
Haitianisch 1 0 (0.00%) Koreanisch 12 0 (0.00%)
Hausa 5 12 (16.85%) Levantinisches Arabisch 12 2 (0.00%)
Hawaiianisch 3 0 (0.00%) Walisisch 12 0 (0.00%)
Hebräisch 27 16 (1.89%) Maltesisch 11 0 (0.00%)
Hethitisch 8 0 (0.00%) Westfriesisch 11 34 (32.00%)
Hindi 6 0 (0.00%) Georgisch 11 0 (0.00%)
Huastekisches Ost-Nahuatl 1 0 (0.00%) Zentral-Nahuatl 11 50 (10.00%)
Huastekisches West-Nahuatl 1 2 (0.00%) Frühneuhochdeutsch 11 0 (0.00%)
Huastekisches Zentral-Nahuatl 8 34 (38.89%) Sumerisch 11 80 (25.00%)
Hurritisch 1 0 (0.00%) Vietnamesisch 10 0 (0.00%)
Ido 28 32 (40.40%) Mittelhochdeutsch 10 0 (0.00%)
Indonesisch 6 0 (0.00%) International 9 0 (0.00%)
Interlingua 6 0 (0.00%) Thai 9 0 (0.00%)
International 9 0 (0.00%) Huastekisches Zentral-Nahuatl 8 34 (38.89%)
Inuktitut 4 0 (0.00%) Südpikenisch 8 0 (0.00%)
Inupiaq 1 0 (0.00%) Hethitisch 8 0 (0.00%)
Irisch 27 2 (3.12%) Kaschubisch 7 0 (0.00%)
Isländisch 44 50 (9.36%) Krimtatarisch 7 0 (0.00%)
Italienisch 78 50 (0.02%) Altnordisch 7 0 (0.00%)
Jakutisch 2 0 (0.00%) Interlingua 6 0 (0.00%)
Jamaika-Kreolisch 3 0 (0.00%) Japanisch 6 0 (0.00%)
Japanisch 6 0 (0.00%) Estnisch 6 30 (16.83%)
Jiddisch 20 0 (0.00%) Indonesisch 6 0 (0.00%)
Kabardinisch 3 0 (0.00%) Paschtu 6 60 (0.78%)
Kannada 1 0 (0.00%) Hindi 6 0 (0.00%)
Kantonesisch 1 0 (0.00%) Serbokroatisch 6 0 (0.00%)
Karatschai-Balkarisch 3 0 (0.00%) Sindhi 6 6 (0.00%)
Kasachisch 4 0 (0.00%) Faliskisch 6 0 (0.00%)
Kaschubisch 7 0 (0.00%) Koptisch 6 0 (0.00%)
Katalanisch 17 38 (16.67%) Luxemburgisch 5 16 (0.00%)
Khmer 1 0 (0.00%) Friaulisch 5 0 (0.00%)
Khowar 1 0 (0.00%) Shona 5 0 (0.00%)
Kikuyu 2 0 (0.00%) Hausa 5 12 (16.85%)
Kirchenslawisch 1 0 (0.00%) Bretonisch 5 0 (0.00%)
Kirgisisch 4 0 (0.00%) Altirisch 5 0 (0.00%)
Klamath 1 0 (0.00%) Tagalog 5 6 (62.50%)
Klassisches Nahuatl 17 96 (40.61%) Mongolisch 5 2 (0.00%)
Komi 3 0 (0.00%) Tadschikisch 5 0 (0.00%)
Konkani 1 0 (0.00%) Venezianisch 5 0 (0.00%)
Koptisch 6 0 (0.00%) West-Pandschabi 5 30 (3.35%)
Koreanisch 12 0 (0.00%) Umbrisch 5 0 (0.00%)
Kornisch 1 0 (0.00%) Galicisch 4 0 (0.00%)
Korsisch 1 0 (0.00%) Sardisch 4 0 (0.00%)
Kotava 1 2 (100.00%) Scots 4 14 (48.84%)
Krimtatarisch 7 0 (0.00%) Suaheli 4 34 (77.27%)
Kroatisch 40 0 (0.00%) Tatarisch 4 0 (0.00%)
Kumükisch 1 0 (0.00%) Kasachisch 4 0 (0.00%)
Kurdisch 14 30 (0.00%) Kirgisisch 4 0 (0.00%)
Ladinisch 2 0 (0.00%) Tetelcingo-Nahuatl 4 4 (25.00%)
Laotisch 1 0 (0.00%) Inuktitut 4 0 (0.00%)
Latein 248 558 (2.77%) Oskisch 4 0 (0.00%)
Lettgallisch 3 0 (0.00%) Nepalesisch 4 0 (0.00%)
Lettisch 18 30 (9.98%) Belutschi 4 4 (0.00%)
Levantinisches Arabisch 12 2 (0.00%) Lettgallisch 3 0 (0.00%)
Litauisch 19 0 (0.00%) Hawaiianisch 3 0 (0.00%)
Luxemburgisch 5 16 (0.00%) Nauruisch 3 0 (0.00%)
Láadan 1 0 (0.00%) Maori 3 0 (0.00%)
Malaiisch 2 0 (0.00%) Volapük 3 0 (0.00%)
Malayalam 1 0 (0.00%) Baschkirisch 3 0 (0.00%)
Maledivisch 1 0 (0.00%) Altfranzösisch 3 0 (0.00%)
Maltesisch 11 0 (0.00%) Abchasisch 3 0 (0.00%)
Mandschurisch 1 0 (0.00%) Adygeisch 3 0 (0.00%)
Manx 1 0 (0.00%) Altaisch 3 0 (0.00%)
Maori 3 0 (0.00%) Burjatisch 3 0 (0.00%)
Marathi 3 0 (0.00%) Kabardinisch 3 0 (0.00%)
Mari 2 0 (0.00%) Karatschai-Balkarisch 3 0 (0.00%)
Marsisch 2 0 (0.00%) Komi 3 0 (0.00%)
Mazedonisch 29 0 (0.00%) Ossetisch 3 0 (0.00%)
Mittelenglisch 1 0 (0.00%) Tschetschenisch 3 0 (0.00%)
Mittelgriechisch 1 0 (0.00%) Tschuktschisch 3 0 (0.00%)
Mittelhochdeutsch 10 0 (0.00%) Temascaltepec-Nahuatl 3 10 (25.00%)
Mokscha 2 0 (0.00%) Rätoromanisch 3 0 (0.00%)
Mongolisch 5 2 (0.00%) Marathi 3 0 (0.00%)
Morisien 1 0 (0.00%) Sanskrit 3 0 (0.00%)
Nahuatl 1 0 (0.00%) Jamaika-Kreolisch 3 0 (0.00%)
Nauruisch 3 0 (0.00%) Birmanisch 3 0 (0.00%)
Nepalesisch 4 0 (0.00%) Malaiisch 2 0 (0.00%)
Neugriechisch 54 122 (13.04%) Kikuyu 2 0 (0.00%)
Niederdeutsch 30 100 (7.04%) Tetum 2 0 (0.00%)
Niederländisch 77 68 (7.60%) Schottisch-Gälisch 2 0 (0.00%)
Niedersorbisch 56 0 (0.00%) Westflämisch 2 0 (0.00%)
Niueanisch 1 0 (0.00%) Abasinisch 2 0 (0.00%)
Nordfriesisch 1 0 (0.00%) Awarisch 2 0 (0.00%)
Norwegisch 25 0 (0.00%) Chantisch 2 0 (0.00%)
Novial 1 0 (0.00%) Dunganisch 2 0 (0.00%)
Obersorbisch 42 0 (0.00%) Jakutisch 2 0 (0.00%)
Okzitanisch 16 14 (0.18%) Mari 2 0 (0.00%)
Orizaba-Nahuatl 2 0 (0.00%) Mokscha 2 0 (0.00%)
Oromo 1 0 (0.00%) Tschuwaschisch 2 0 (0.00%)
Oskisch 4 0 (0.00%) Tuwinisch 2 0 (0.00%)
Osmanisches Türkisch 1 2 (0.00%) Udmurtisch 2 0 (0.00%)
Ossetisch 3 0 (0.00%) Urum 2 0 (0.00%)
Pali 1 0 (0.00%) Sizilianisch 2 0 (0.00%)
Pandschabi 1 0 (0.00%) Marsisch 2 0 (0.00%)
Papiamentu 1 0 (0.00%) Acehnesisch 2 0 (0.00%)
Paschtu 6 60 (0.78%) Telugu 2 0 (0.00%)
Pennsylvaniadeutsch 1 0 (0.00%) Uigurisch 2 0 (0.00%)
Persisch 25 16 (0.76%) Sogdisch 2 0 (0.00%)
Piemontesisch 1 0 (0.00%) Bengalisch 2 0 (0.00%)
Polabisch 2 0 (0.00%) Orizaba-Nahuatl 2 0 (0.00%)
Polnisch 211 8 (0.03%) Polabisch 2 0 (0.00%)
Portugiesisch 31 32 (5.17%) Ladinisch 2 0 (0.00%)
Prußisch 31 288 (13.53%) Asturisch 1 0 (0.00%)
Rumänisch 25 122 (2.54%) Haitianisch 1 0 (0.00%)
Russisch 61 0 (0.00%) Pennsylvaniadeutsch 1 0 (0.00%)
Rätoromanisch 3 0 (0.00%) Nordfriesisch 1 0 (0.00%)
Sami 1 0 (0.00%) Balinesisch 1 0 (0.00%)
Samoanisch 1 0 (0.00%) Huastekisches Ost-Nahuatl 1 0 (0.00%)
Sanskrit 3 0 (0.00%) Tok Pisin 1 0 (0.00%)
Sardisch 4 0 (0.00%) Niueanisch 1 0 (0.00%)
Schottisch-Gälisch 2 0 (0.00%) Papiamentu 1 0 (0.00%)
Schwedisch 149 0 (0.00%) Malayalam 1 0 (0.00%)
Scots 4 14 (48.84%) Kannada 1 0 (0.00%)
Serbisch 42 14 (0.38%) Urartäisch 1 0 (0.00%)
Serbokroatisch 6 0 (0.00%) Tuvaluisch 1 0 (0.00%)
Sesotho 1 0 (0.00%) Chakassisch 1 0 (0.00%)
Shona 5 0 (0.00%) Turkmenisch 1 0 (0.00%)
Sindarin 1 0 (0.00%) Kumükisch 1 0 (0.00%)
Sindhi 6 6 (0.00%) Nahuatl 1 0 (0.00%)
Singhalesisch 1 0 (0.00%) Huastekisches West-Nahuatl 1 2 (0.00%)
Sizilianisch 2 0 (0.00%) Aragonesisch 1 0 (0.00%)
Slowakisch 33 0 (0.00%) Zentrales Puebla-Nahuatl 1 0 (0.00%)
Slowenisch 33 0 (0.00%) Korsisch 1 0 (0.00%)
Sogdisch 2 0 (0.00%) Tamil 1 0 (0.00%)
Somalisch 1 0 (0.00%) Sesotho 1 0 (0.00%)
Spanisch 38 0 (0.00%) Manx 1 0 (0.00%)
Suaheli 4 34 (77.27%) Kornisch 1 0 (0.00%)
Sumerisch 11 80 (25.00%) Samoanisch 1 0 (0.00%)
Swanisch 1 0 (0.00%) Somalisch 1 0 (0.00%)
Südpikenisch 8 0 (0.00%) isiZulu 1 0 (0.00%)
Tadschikisch 5 0 (0.00%) Sindarin 1 0 (0.00%)
Tagalog 5 6 (62.50%) Hurritisch 1 0 (0.00%)
Tahitianisch 1 0 (0.00%) Fulfulde 1 0 (0.00%)
Tamil 1 0 (0.00%) Bairisch 1 0 (0.00%)
Tatarisch 4 0 (0.00%) Pali 1 0 (0.00%)
Telugu 2 0 (0.00%) Sami 1 0 (0.00%)
Temascaltepec-Nahuatl 3 10 (25.00%) Twi 1 0 (0.00%)
Tetelcingo-Nahuatl 4 4 (25.00%) Novial 1 0 (0.00%)
Tetum 2 0 (0.00%) Zentral-Alaska-Yupik 1 0 (0.00%)
Thai 9 0 (0.00%) Oromo 1 0 (0.00%)
Tibetisch 1 0 (0.00%) Swanisch 1 0 (0.00%)
Tigrinya 1 0 (0.00%) Gurage 1 0 (0.00%)
Tok Pisin 1 0 (0.00%) Inupiaq 1 0 (0.00%)
Torwali 1 0 (0.00%) Assamesisch 1 0 (0.00%)
Tschechisch 78 0 (0.00%) Gujarati 1 0 (0.00%)
Tschetschenisch 3 0 (0.00%) Pandschabi 1 0 (0.00%)
Tschuktschisch 3 0 (0.00%) Khowar 1 0 (0.00%)
Tschuwaschisch 2 0 (0.00%) Laotisch 1 0 (0.00%)
Turkmenisch 1 0 (0.00%) Catawba 1 0 (0.00%)
Tuvaluisch 1 0 (0.00%) Mittelenglisch 1 0 (0.00%)
Tuwinisch 2 0 (0.00%) Mittelgriechisch 1 0 (0.00%)
Twi 1 0 (0.00%) Guerrero-Nahuatl 1 2 (0.00%)
Türkisch 25 80 (1.34%) Klamath 1 0 (0.00%)
Udmurtisch 2 0 (0.00%) Torwali 1 0 (0.00%)
Ugaritisch 1 0 (0.00%) Amharisch 1 0 (0.00%)
Uigurisch 2 0 (0.00%) Durango-Nahuatl 1 0 (0.00%)
Ukrainisch 74 0 (0.00%) Piemontesisch 1 0 (0.00%)
Umbrisch 5 0 (0.00%) Alttschechisch 1 0 (0.00%)
Ungarisch 24 42 (4.04%) Kotava 1 2 (100.00%)
Urartäisch 1 0 (0.00%) Láadan 1 0 (0.00%)
Urdu 17 124 (2.18%) Brahui 1 0 (0.00%)
Urum 2 0 (0.00%) Morisien 1 0 (0.00%)
Usbekisch 17 0 (0.00%) Tahitianisch 1 0 (0.00%)
Venezianisch 5 0 (0.00%) Kirchenslawisch 1 0 (0.00%)
Vietnamesisch 10 0 (0.00%) Osmanisches Türkisch 1 2 (0.00%)
Volapük 3 0 (0.00%) Kantonesisch 1 0 (0.00%)
Walisisch 12 0 (0.00%) Ugaritisch 1 0 (0.00%)
Weißrussisch 41 0 (0.00%) Mandschurisch 1 0 (0.00%)
West-Pandschabi 5 30 (3.35%) Tibetisch 1 0 (0.00%)
Westflämisch 2 0 (0.00%) Konkani 1 0 (0.00%)
Westfriesisch 11 34 (32.00%) Maledivisch 1 0 (0.00%)
Zentral-Alaska-Yupik 1 0 (0.00%) Baktrisch 1 0 (0.00%)
Zentral-Nahuatl 11 50 (10.00%) Tigrinya 1 0 (0.00%)
Zentrales Puebla-Nahuatl 1 0 (0.00%) Khmer 1 0 (0.00%)
isiZulu 1 0 (0.00%) Singhalesisch 1 0 (0.00%)

This page is a part of the kaikki.org machine-readable dictionary. This dictionary is based on structured data extracted on 2025-12-13 from the dewiktionary dump dated 2025-12-02 using wiktextract (e2469cc and 9905b1f). The data shown on this site has been post-processed and various details (e.g., extra categories) removed, some information disambiguated, and additional data merged from other sources. See the raw data download page for the unprocessed wiktextract data.

If you use this data in academic research, please cite Tatu Ylonen: Wiktextract: Wiktionary as Machine-Readable Structured Data, Proceedings of the 13th Conference on Language Resources and Evaluation (LREC), pp. 1317-1325, Marseille, 20-25 June 2022. Linking to the relevant page(s) under https://kaikki.org would also be greatly appreciated.