Wiktionary data extraction errors and warnings

Inflection check

List of different kinds of inflection tables. When wiktextract parses word heads and tables, it assigns the forms it encounters with tags that describe grammatical or contextual information. The tags and forms that are found in head sections and tables are kept separate from other head section and table tags, and later they are merged with other heads and tables into table types that all contain the same number of word forms with the same tags for those forms.

The information presented here is mostly for debugging, but it can also be used to find interesting word paradigms and to hunt down mistakes, typoes and badly formated Wiktionary entries. A table type that has only a few unique instances is quite likely to contain some kind of minor error in the original data.

Language ⏶ Table forms Errors (% affected words) Language Table forms ⏷ Errors (% affected words)
Abasinisch 2 0 (0.00%) Deutsch 31571 4 (0.00%)
Abchasisch 3 0 (0.00%) Latein 249 546 (2.78%)
Acehnesisch 2 0 (0.00%) Polnisch 211 8 (0.03%)
Adygeisch 3 0 (0.00%) Altgriechisch 158 738 (6.44%)
Afrikaans 16 78 (10.13%) Schwedisch 149 0 (0.00%)
Akkadisch 14 162 (3.16%) Englisch 121 0 (0.00%)
Albanisch 43 254 (3.61%) Französisch 80 0 (0.00%)
Altaisch 3 0 (0.00%) Niederländisch 78 66 (7.67%)
Altenglisch 19 12 (28.57%) Italienisch 78 50 (0.02%)
Altfranzösisch 3 0 (0.00%) Tschechisch 78 0 (0.00%)
Altgriechisch 158 738 (6.44%) Ukrainisch 74 0 (0.00%)
Althochdeutsch 19 0 (0.00%) Russisch 61 0 (0.00%)
Altirisch 5 0 (0.00%) Niedersorbisch 56 0 (0.00%)
Altkirchenslawisch 16 56 (21.21%) Neugriechisch 54 118 (13.04%)
Altnordisch 7 0 (0.00%) Arabisch 51 194 (7.65%)
Altsächsisch 1 0 (0.00%) Armenisch 49 134 (2.51%)
Alttschechisch 1 0 (0.00%) Isländisch 45 50 (9.34%)
Amharisch 1 0 (0.00%) Albanisch 43 254 (3.61%)
Arabisch 51 194 (7.65%) Obersorbisch 42 0 (0.00%)
Aragonesisch 1 0 (0.00%) Weißrussisch 41 0 (0.00%)
Armenisch 49 134 (2.51%) Serbisch 41 14 (0.36%)
Aserbaidschanisch 14 8 (2.84%) Kroatisch 40 0 (0.00%)
Assamesisch 1 0 (0.00%) Spanisch 38 0 (0.00%)
Asturisch 1 0 (0.00%) Bosnisch 34 0 (0.00%)
Awarisch 2 0 (0.00%) Niederdeutsch 33 102 (6.80%)
Bairisch 1 0 (0.00%) Slowakisch 33 0 (0.00%)
Baktrisch 1 0 (0.00%) Slowenisch 33 0 (0.00%)
Balinesisch 1 0 (0.00%) Portugiesisch 32 36 (5.08%)
Baschkirisch 3 0 (0.00%) Gotisch 32 24 (20.97%)
Baskisch 15 22 (4.97%) Finnisch 31 82 (4.56%)
Belutschi 4 4 (0.00%) Prußisch 31 266 (13.49%)
Bengalisch 2 0 (0.00%) Ido 29 32 (40.55%)
Birmanisch 3 0 (0.00%) Mazedonisch 29 0 (0.00%)
Bosnisch 34 0 (0.00%) Irisch 27 2 (3.12%)
Brahui 1 0 (0.00%) Bulgarisch 27 134 (3.69%)
Bretonisch 5 0 (0.00%) Hebräisch 27 16 (1.85%)
Bulgarisch 27 134 (3.69%) Dänisch 26 2 (10.85%)
Burjatisch 3 0 (0.00%) Norwegisch 26 2 (0.26%)
Catawba 1 0 (0.00%) Persisch 26 16 (0.72%)
Chakassisch 1 0 (0.00%) Esperanto 25 0 (0.00%)
Chantisch 2 0 (0.00%) Rumänisch 25 110 (2.54%)
Chinesisch 23 0 (0.00%) Chinesisch 23 0 (0.00%)
Deutsch 31571 4 (0.00%) Türkisch 23 80 (1.50%)
Dunganisch 2 0 (0.00%) Ungarisch 22 42 (4.35%)
Durango-Nahuatl 1 0 (0.00%) Lettisch 21 30 (9.93%)
Dänisch 26 2 (10.85%) Färöisch 21 0 (0.00%)
Englisch 121 0 (0.00%) Jiddisch 20 0 (0.00%)
Esperanto 25 0 (0.00%) Althochdeutsch 19 0 (0.00%)
Estnisch 6 20 (16.83%) Litauisch 19 0 (0.00%)
Faliskisch 6 0 (0.00%) Altenglisch 19 12 (28.57%)
Finnisch 31 82 (4.56%) Katalanisch 17 38 (17.54%)
Französisch 80 0 (0.00%) Usbekisch 17 0 (0.00%)
Friaulisch 5 0 (0.00%) Klassisches Nahuatl 17 98 (40.61%)
Frühneuhochdeutsch 11 0 (0.00%) Urdu 17 124 (2.18%)
Fulfulde 1 0 (0.00%) Okzitanisch 16 14 (0.18%)
Färöisch 21 0 (0.00%) Afrikaans 16 78 (10.13%)
Galicisch 4 0 (0.00%) Altkirchenslawisch 16 56 (21.21%)
Georgisch 11 0 (0.00%) Baskisch 15 22 (4.97%)
Gotisch 32 24 (20.97%) Kurdisch 14 30 (0.00%)
Guerrero-Nahuatl 1 2 (0.00%) Levantinisches Arabisch 14 18 (0.41%)
Gujarati 1 0 (0.00%) Aserbaidschanisch 14 8 (2.84%)
Gurage 1 0 (0.00%) Akkadisch 14 162 (3.16%)
Haitianisch 1 0 (0.00%) Walisisch 12 0 (0.00%)
Hausa 5 12 (16.85%) Mittelhochdeutsch 12 0 (0.00%)
Hawaiianisch 3 0 (0.00%) Koreanisch 12 0 (0.00%)
Hebräisch 27 16 (1.85%) Maltesisch 11 0 (0.00%)
Hethitisch 8 0 (0.00%) Westfriesisch 11 34 (31.68%)
Hindi 6 0 (0.00%) Georgisch 11 0 (0.00%)
Huastekisches Ost-Nahuatl 1 0 (0.00%) Zentral-Nahuatl 11 50 (10.00%)
Huastekisches West-Nahuatl 1 2 (0.00%) Frühneuhochdeutsch 11 0 (0.00%)
Huastekisches Zentral-Nahuatl 8 34 (38.89%) Sumerisch 11 80 (25.00%)
Hurritisch 1 0 (0.00%) Vietnamesisch 10 0 (0.00%)
Ido 29 32 (40.55%) International 9 0 (0.00%)
Indonesisch 6 0 (0.00%) Thai 9 0 (0.00%)
Interlingua 6 0 (0.00%) Huastekisches Zentral-Nahuatl 8 34 (38.89%)
International 9 0 (0.00%) Südpikenisch 8 0 (0.00%)
Inuktitut 4 0 (0.00%) Hethitisch 8 0 (0.00%)
Inupiaq 1 0 (0.00%) Kaschubisch 7 0 (0.00%)
Irisch 27 2 (3.12%) Krimtatarisch 7 0 (0.00%)
Isländisch 45 50 (9.34%) Altnordisch 7 0 (0.00%)
Italienisch 78 50 (0.02%) Interlingua 6 0 (0.00%)
Jakutisch 2 0 (0.00%) Japanisch 6 0 (0.00%)
Jamaika-Kreolisch 3 0 (0.00%) Estnisch 6 20 (16.83%)
Japanisch 6 0 (0.00%) Indonesisch 6 0 (0.00%)
Jiddisch 20 0 (0.00%) Paschtu 6 60 (0.78%)
Kabardinisch 3 0 (0.00%) Hindi 6 0 (0.00%)
Kannada 1 0 (0.00%) Serbokroatisch 6 0 (0.00%)
Kantonesisch 1 0 (0.00%) Sindhi 6 6 (0.00%)
Karatschai-Balkarisch 3 0 (0.00%) Faliskisch 6 0 (0.00%)
Kasachisch 4 0 (0.00%) Koptisch 6 0 (0.00%)
Kaschubisch 7 0 (0.00%) Luxemburgisch 5 16 (0.00%)
Katalanisch 17 38 (17.54%) Friaulisch 5 0 (0.00%)
Khmer 1 0 (0.00%) Shona 5 0 (0.00%)
Khowar 1 0 (0.00%) Hausa 5 12 (16.85%)
Kikuyu 2 0 (0.00%) Bretonisch 5 0 (0.00%)
Kirchenslawisch 1 0 (0.00%) Altirisch 5 0 (0.00%)
Kirgisisch 4 0 (0.00%) Tagalog 5 4 (62.50%)
Klamath 1 0 (0.00%) Mongolisch 5 2 (0.00%)
Klassisches Nahuatl 17 98 (40.61%) Tadschikisch 5 0 (0.00%)
Komi 3 0 (0.00%) Venezianisch 5 0 (0.00%)
Konkani 1 0 (0.00%) West-Pandschabi 5 30 (3.33%)
Koptisch 6 0 (0.00%) Umbrisch 5 0 (0.00%)
Koreanisch 12 0 (0.00%) Galicisch 4 0 (0.00%)
Kornisch 1 0 (0.00%) Nauruisch 4 0 (0.00%)
Korsisch 1 0 (0.00%) Sardisch 4 0 (0.00%)
Kotava 1 2 (100.00%) Scots 4 16 (48.84%)
Krimtatarisch 7 0 (0.00%) Suaheli 4 34 (77.27%)
Kroatisch 40 0 (0.00%) Tatarisch 4 0 (0.00%)
Kumükisch 1 0 (0.00%) Kasachisch 4 0 (0.00%)
Kurdisch 14 30 (0.00%) Kirgisisch 4 0 (0.00%)
Ladinisch 2 0 (0.00%) Tetelcingo-Nahuatl 4 4 (25.00%)
Laotisch 1 0 (0.00%) Inuktitut 4 0 (0.00%)
Latein 249 546 (2.78%) Oskisch 4 0 (0.00%)
Lettgallisch 3 0 (0.00%) Nepalesisch 4 0 (0.00%)
Lettisch 21 30 (9.93%) Belutschi 4 4 (0.00%)
Levantinisches Arabisch 14 18 (0.41%) Lettgallisch 3 0 (0.00%)
Litauisch 19 0 (0.00%) Hawaiianisch 3 0 (0.00%)
Luwisch 2 2 (50.00%) Maori 3 0 (0.00%)
Luxemburgisch 5 16 (0.00%) Volapük 3 0 (0.00%)
Láadan 1 0 (0.00%) Baschkirisch 3 0 (0.00%)
Malaiisch 2 0 (0.00%) Altfranzösisch 3 0 (0.00%)
Malayalam 1 0 (0.00%) Abchasisch 3 0 (0.00%)
Maledivisch 1 0 (0.00%) Adygeisch 3 0 (0.00%)
Maltesisch 11 0 (0.00%) Altaisch 3 0 (0.00%)
Mandschurisch 1 0 (0.00%) Burjatisch 3 0 (0.00%)
Manx 1 0 (0.00%) Kabardinisch 3 0 (0.00%)
Maori 3 0 (0.00%) Karatschai-Balkarisch 3 0 (0.00%)
Marathi 3 0 (0.00%) Komi 3 0 (0.00%)
Mari 2 0 (0.00%) Ossetisch 3 0 (0.00%)
Marsisch 2 0 (0.00%) Tschetschenisch 3 0 (0.00%)
Mazedonisch 29 0 (0.00%) Tschuktschisch 3 0 (0.00%)
Mittelenglisch 2 0 (0.00%) Temascaltepec-Nahuatl 3 10 (25.00%)
Mittelgriechisch 1 0 (0.00%) Rätoromanisch 3 0 (0.00%)
Mittelhochdeutsch 12 0 (0.00%) Marathi 3 0 (0.00%)
Mittelniederdeutsch 2 4 (0.00%) Sanskrit 3 0 (0.00%)
Mokscha 2 0 (0.00%) Jamaika-Kreolisch 3 0 (0.00%)
Mongolisch 5 2 (0.00%) Birmanisch 3 0 (0.00%)
Morisien 1 0 (0.00%) Malaiisch 2 0 (0.00%)
Nahuatl 1 0 (0.00%) Mittelenglisch 2 0 (0.00%)
Nauruisch 4 0 (0.00%) Kikuyu 2 0 (0.00%)
Nepalesisch 4 0 (0.00%) Tetum 2 0 (0.00%)
Neugriechisch 54 118 (13.04%) Mittelniederdeutsch 2 4 (0.00%)
Niederdeutsch 33 102 (6.80%) Schottisch-Gälisch 2 0 (0.00%)
Niederländisch 78 66 (7.67%) Westflämisch 2 0 (0.00%)
Niedersorbisch 56 0 (0.00%) Abasinisch 2 0 (0.00%)
Niueanisch 1 0 (0.00%) Awarisch 2 0 (0.00%)
Nordfriesisch 1 0 (0.00%) Chantisch 2 0 (0.00%)
Norwegisch 26 2 (0.26%) Dunganisch 2 0 (0.00%)
Novial 1 0 (0.00%) Jakutisch 2 0 (0.00%)
Obersorbisch 42 0 (0.00%) Mari 2 0 (0.00%)
Okzitanisch 16 14 (0.18%) Mokscha 2 0 (0.00%)
Orizaba-Nahuatl 2 0 (0.00%) Tschuwaschisch 2 0 (0.00%)
Oromo 1 0 (0.00%) Tuwinisch 2 0 (0.00%)
Oskisch 4 0 (0.00%) Udmurtisch 2 0 (0.00%)
Osmanisches Türkisch 1 2 (0.00%) Urum 2 0 (0.00%)
Ossetisch 3 0 (0.00%) Sizilianisch 2 0 (0.00%)
Pali 1 0 (0.00%) Marsisch 2 0 (0.00%)
Pandschabi 1 0 (0.00%) Acehnesisch 2 0 (0.00%)
Papiamentu 1 0 (0.00%) Luwisch 2 2 (50.00%)
Paschtu 6 60 (0.78%) Telugu 2 0 (0.00%)
Pennsylvaniadeutsch 1 0 (0.00%) Uigurisch 2 0 (0.00%)
Persisch 26 16 (0.72%) Sogdisch 2 0 (0.00%)
Piemontesisch 1 0 (0.00%) Bengalisch 2 0 (0.00%)
Polabisch 2 0 (0.00%) Orizaba-Nahuatl 2 0 (0.00%)
Polnisch 211 8 (0.03%) Polabisch 2 0 (0.00%)
Portugiesisch 32 36 (5.08%) Ladinisch 2 0 (0.00%)
Prußisch 31 266 (13.49%) Asturisch 1 0 (0.00%)
Rumänisch 25 110 (2.54%) Haitianisch 1 0 (0.00%)
Russisch 61 0 (0.00%) Pennsylvaniadeutsch 1 0 (0.00%)
Rätoromanisch 3 0 (0.00%) Nordfriesisch 1 0 (0.00%)
Sami 1 0 (0.00%) Balinesisch 1 0 (0.00%)
Samoanisch 1 0 (0.00%) Kornisch 1 0 (0.00%)
Sanskrit 3 0 (0.00%) Huastekisches Ost-Nahuatl 1 0 (0.00%)
Sardisch 4 0 (0.00%) Tok Pisin 1 0 (0.00%)
Schottisch-Gälisch 2 0 (0.00%) Niueanisch 1 0 (0.00%)
Schwedisch 149 0 (0.00%) Papiamentu 1 0 (0.00%)
Scots 4 16 (48.84%) Malayalam 1 0 (0.00%)
Serbisch 41 14 (0.36%) Kannada 1 0 (0.00%)
Serbokroatisch 6 0 (0.00%) Urartäisch 1 0 (0.00%)
Sesotho 1 0 (0.00%) Tuvaluisch 1 0 (0.00%)
Shona 5 0 (0.00%) Chakassisch 1 0 (0.00%)
Sindarin 1 0 (0.00%) Turkmenisch 1 0 (0.00%)
Sindhi 6 6 (0.00%) Kumükisch 1 0 (0.00%)
Singhalesisch 1 0 (0.00%) Nahuatl 1 0 (0.00%)
Sizilianisch 2 0 (0.00%) Huastekisches West-Nahuatl 1 2 (0.00%)
Slowakisch 33 0 (0.00%) Aragonesisch 1 0 (0.00%)
Slowenisch 33 0 (0.00%) Zentrales Puebla-Nahuatl 1 0 (0.00%)
Sogdisch 2 0 (0.00%) Korsisch 1 0 (0.00%)
Somalisch 1 0 (0.00%) Tamil 1 0 (0.00%)
Spanisch 38 0 (0.00%) Sesotho 1 0 (0.00%)
Suaheli 4 34 (77.27%) Manx 1 0 (0.00%)
Sumerisch 11 80 (25.00%) Samoanisch 1 0 (0.00%)
Swanisch 1 0 (0.00%) Somalisch 1 0 (0.00%)
Südpikenisch 8 0 (0.00%) isiZulu 1 0 (0.00%)
Tadschikisch 5 0 (0.00%) Sindarin 1 0 (0.00%)
Tagalog 5 4 (62.50%) Hurritisch 1 0 (0.00%)
Tahitianisch 1 0 (0.00%) Fulfulde 1 0 (0.00%)
Tamil 1 0 (0.00%) Bairisch 1 0 (0.00%)
Tatarisch 4 0 (0.00%) Pali 1 0 (0.00%)
Telugu 2 0 (0.00%) Sami 1 0 (0.00%)
Temascaltepec-Nahuatl 3 10 (25.00%) Altsächsisch 1 0 (0.00%)
Tetelcingo-Nahuatl 4 4 (25.00%) Twi 1 0 (0.00%)
Tetum 2 0 (0.00%) Novial 1 0 (0.00%)
Thai 9 0 (0.00%) Zentral-Alaska-Yupik 1 0 (0.00%)
Tibetisch 1 0 (0.00%) Oromo 1 0 (0.00%)
Tigrinya 1 0 (0.00%) Swanisch 1 0 (0.00%)
Tok Pisin 1 0 (0.00%) Gurage 1 0 (0.00%)
Torwali 1 0 (0.00%) Inupiaq 1 0 (0.00%)
Tschechisch 78 0 (0.00%) Khowar 1 0 (0.00%)
Tschetschenisch 3 0 (0.00%) Torwali 1 0 (0.00%)
Tschuktschisch 3 0 (0.00%) Assamesisch 1 0 (0.00%)
Tschuwaschisch 2 0 (0.00%) Gujarati 1 0 (0.00%)
Turkmenisch 1 0 (0.00%) Pandschabi 1 0 (0.00%)
Tuvaluisch 1 0 (0.00%) Laotisch 1 0 (0.00%)
Tuwinisch 2 0 (0.00%) Catawba 1 0 (0.00%)
Twi 1 0 (0.00%) Mittelgriechisch 1 0 (0.00%)
Türkisch 23 80 (1.50%) Guerrero-Nahuatl 1 2 (0.00%)
Udmurtisch 2 0 (0.00%) Klamath 1 0 (0.00%)
Ugaritisch 1 0 (0.00%) Amharisch 1 0 (0.00%)
Uigurisch 2 0 (0.00%) Durango-Nahuatl 1 0 (0.00%)
Ukrainisch 74 0 (0.00%) Piemontesisch 1 0 (0.00%)
Umbrisch 5 0 (0.00%) Alttschechisch 1 0 (0.00%)
Ungarisch 22 42 (4.35%) Kotava 1 2 (100.00%)
Urartäisch 1 0 (0.00%) Láadan 1 0 (0.00%)
Urdu 17 124 (2.18%) Brahui 1 0 (0.00%)
Urum 2 0 (0.00%) Morisien 1 0 (0.00%)
Usbekisch 17 0 (0.00%) Tahitianisch 1 0 (0.00%)
Venezianisch 5 0 (0.00%) Kirchenslawisch 1 0 (0.00%)
Vietnamesisch 10 0 (0.00%) Osmanisches Türkisch 1 2 (0.00%)
Volapük 3 0 (0.00%) Kantonesisch 1 0 (0.00%)
Walisisch 12 0 (0.00%) Ugaritisch 1 0 (0.00%)
Weißrussisch 41 0 (0.00%) Mandschurisch 1 0 (0.00%)
West-Pandschabi 5 30 (3.33%) Tibetisch 1 0 (0.00%)
Westflämisch 2 0 (0.00%) Konkani 1 0 (0.00%)
Westfriesisch 11 34 (31.68%) Maledivisch 1 0 (0.00%)
Zentral-Alaska-Yupik 1 0 (0.00%) Baktrisch 1 0 (0.00%)
Zentral-Nahuatl 11 50 (10.00%) Tigrinya 1 0 (0.00%)
Zentrales Puebla-Nahuatl 1 0 (0.00%) Khmer 1 0 (0.00%)
isiZulu 1 0 (0.00%) Singhalesisch 1 0 (0.00%)

This page is a part of the kaikki.org machine-readable dictionary. This dictionary is based on structured data extracted on 2025-11-19 from the dewiktionary dump dated 2025-11-02 using wiktextract (2f66b98 and a050b89). The data shown on this site has been post-processed and various details (e.g., extra categories) removed, some information disambiguated, and additional data merged from other sources. See the raw data download page for the unprocessed wiktextract data.

If you use this data in academic research, please cite Tatu Ylonen: Wiktextract: Wiktionary as Machine-Readable Structured Data, Proceedings of the 13th Conference on Language Resources and Evaluation (LREC), pp. 1317-1325, Marseille, 20-25 June 2022. Linking to the relevant page(s) under https://kaikki.org would also be greatly appreciated.