Wiktionary data extraction errors and warnings

Inflection check

List of different kinds of inflection tables. When wiktextract parses word heads and tables, it assigns the forms it encounters with tags that describe grammatical or contextual information. The tags and forms that are found in head sections and tables are kept separate from other head section and table tags, and later they are merged with other heads and tables into table types that all contain the same number of word forms with the same tags for those forms.

The information presented here is mostly for debugging, but it can also be used to find interesting word paradigms and to hunt down mistakes, typoes and badly formated Wiktionary entries. A table type that has only a few unique instances is quite likely to contain some kind of minor error in the original data.

Language ⏶ Table forms Errors (% affected words) Language Table forms ⏷ Errors (% affected words)
Abasinisch 2 0 (0.00%) Deutsch 31303 4 (0.00%)
Abchasisch 3 0 (0.00%) Latein 248 556 (2.83%)
Acehnesisch 2 0 (0.00%) Polnisch 211 8 (0.03%)
Adygeisch 3 0 (0.00%) Altgriechisch 158 738 (6.44%)
Afrikaans 16 82 (10.13%) Schwedisch 149 0 (0.00%)
Akkadisch 14 162 (3.16%) Englisch 107 192 (42.57%)
Albanisch 43 308 (3.61%) Italienisch 78 42 (0.02%)
Altaisch 3 0 (0.00%) Tschechisch 78 0 (0.00%)
Altenglisch 19 64 (28.57%) Französisch 75 214 (28.99%)
Altfranzösisch 3 0 (0.00%) Ukrainisch 74 0 (0.00%)
Altgriechisch 158 738 (6.44%) Niederländisch 72 148 (7.67%)
Althochdeutsch 19 24 (0.00%) Russisch 61 0 (0.00%)
Altirisch 5 0 (0.00%) Niedersorbisch 56 0 (0.00%)
Altkirchenslawisch 16 80 (21.21%) Neugriechisch 54 224 (13.39%)
Altnordisch 7 0 (0.00%) Arabisch 51 186 (7.65%)
Altsächsisch 1 0 (0.00%) Armenisch 49 134 (2.51%)
Alttschechisch 1 0 (0.00%) Isländisch 45 56 (9.34%)
Amharisch 1 0 (0.00%) Albanisch 43 308 (3.61%)
Arabisch 51 186 (7.65%) Obersorbisch 42 0 (0.00%)
Aragonesisch 1 0 (0.00%) Weißrussisch 41 0 (0.00%)
Armenisch 49 134 (2.51%) Serbisch 41 14 (0.36%)
Aserbaidschanisch 14 8 (2.84%) Kroatisch 40 0 (0.00%)
Assamesisch 1 0 (0.00%) Spanisch 37 60 (30.16%)
Asturisch 1 0 (0.00%) Bosnisch 34 0 (0.00%)
Awarisch 2 0 (0.00%) Niederdeutsch 33 152 (6.80%)
Bairisch 1 0 (0.00%) Slowakisch 33 0 (0.00%)
Baktrisch 1 0 (0.00%) Slowenisch 33 0 (0.00%)
Balinesisch 1 0 (0.00%) Portugiesisch 32 40 (5.08%)
Baschkirisch 3 0 (0.00%) Gotisch 32 32 (20.97%)
Baskisch 15 22 (4.97%) Finnisch 31 82 (4.56%)
Belutschi 4 4 (0.00%) Prußisch 31 316 (13.49%)
Bengalisch 2 0 (0.00%) Ido 29 32 (40.55%)
Birmanisch 3 0 (0.00%) Mazedonisch 29 0 (0.00%)
Bosnisch 34 0 (0.00%) Irisch 27 2 (3.12%)
Brahui 1 0 (0.00%) Bulgarisch 27 124 (3.69%)
Bretonisch 5 0 (0.00%) Hebräisch 27 16 (1.85%)
Bulgarisch 27 124 (3.69%) Dänisch 26 2 (10.85%)
Burjatisch 3 0 (0.00%) Norwegisch 26 2 (0.26%)
Catawba 1 0 (0.00%) Persisch 26 16 (0.72%)
Chakassisch 1 0 (0.00%) Esperanto 25 0 (0.00%)
Chantisch 2 0 (0.00%) Rumänisch 25 122 (2.54%)
Chinesisch 23 0 (0.00%) Chinesisch 23 0 (0.00%)
Deutsch 31303 4 (0.00%) Ungarisch 22 42 (4.52%)
Dunganisch 2 0 (0.00%) Türkisch 22 80 (1.57%)
Durango-Nahuatl 1 0 (0.00%) Lettisch 21 32 (9.93%)
Dänisch 26 2 (10.85%) Färöisch 21 0 (0.00%)
Englisch 107 192 (42.57%) Jiddisch 20 0 (0.00%)
Esperanto 25 0 (0.00%) Althochdeutsch 19 24 (0.00%)
Estnisch 6 22 (16.83%) Litauisch 19 0 (0.00%)
Faliskisch 6 0 (0.00%) Altenglisch 19 64 (28.57%)
Finnisch 31 82 (4.56%) Usbekisch 17 0 (0.00%)
Französisch 75 214 (28.99%) Okzitanisch 17 18 (0.19%)
Friaulisch 5 0 (0.00%) Klassisches Nahuatl 17 98 (40.61%)
Frühneuhochdeutsch 11 0 (0.00%) Urdu 17 124 (2.18%)
Fulfulde 1 0 (0.00%) Katalanisch 16 44 (17.98%)
Färöisch 21 0 (0.00%) Afrikaans 16 82 (10.13%)
Galicisch 4 0 (0.00%) Altkirchenslawisch 16 80 (21.21%)
Georgisch 11 0 (0.00%) Baskisch 15 22 (4.97%)
Gotisch 32 32 (20.97%) Kurdisch 14 30 (0.00%)
Guerrero-Nahuatl 1 2 (0.00%) Levantinisches Arabisch 14 18 (0.41%)
Gujarati 1 0 (0.00%) Aserbaidschanisch 14 8 (2.84%)
Gurage 1 0 (0.00%) Akkadisch 14 162 (3.16%)
Haitianisch 1 0 (0.00%) Walisisch 12 0 (0.00%)
Hausa 5 12 (16.85%) Mittelhochdeutsch 12 0 (0.00%)
Hawaiianisch 3 0 (0.00%) Koreanisch 12 0 (0.00%)
Hebräisch 27 16 (1.85%) Maltesisch 11 0 (0.00%)
Hethitisch 8 0 (0.00%) Georgisch 11 0 (0.00%)
Hindi 6 0 (0.00%) Zentral-Nahuatl 11 50 (10.00%)
Huastekisches Ost-Nahuatl 1 0 (0.00%) Frühneuhochdeutsch 11 0 (0.00%)
Huastekisches West-Nahuatl 1 2 (0.00%) Sumerisch 11 80 (25.00%)
Huastekisches Zentral-Nahuatl 8 40 (38.89%) Vietnamesisch 10 0 (0.00%)
Hurritisch 1 0 (0.00%) International 9 0 (0.00%)
Ido 29 32 (40.55%) Westfriesisch 9 60 (31.68%)
Indonesisch 6 0 (0.00%) Thai 9 0 (0.00%)
Interlingua 6 0 (0.00%) Huastekisches Zentral-Nahuatl 8 40 (38.89%)
International 9 0 (0.00%) Südpikenisch 8 0 (0.00%)
Inuktitut 4 0 (0.00%) Hethitisch 8 0 (0.00%)
Inupiaq 1 0 (0.00%) Kaschubisch 7 0 (0.00%)
Irisch 27 2 (3.12%) Krimtatarisch 7 0 (0.00%)
Isländisch 45 56 (9.34%) Altnordisch 7 0 (0.00%)
Italienisch 78 42 (0.02%) Interlingua 6 0 (0.00%)
Jakutisch 2 0 (0.00%) Japanisch 6 0 (0.00%)
Jamaika-Kreolisch 5 12 (28.57%) Estnisch 6 22 (16.83%)
Japanisch 6 0 (0.00%) Indonesisch 6 0 (0.00%)
Jiddisch 20 0 (0.00%) Paschtu 6 60 (0.78%)
Kabardinisch 3 0 (0.00%) Hindi 6 0 (0.00%)
Kannada 1 0 (0.00%) Serbokroatisch 6 0 (0.00%)
Kantonesisch 1 0 (0.00%) Sindhi 6 6 (0.00%)
Karatschai-Balkarisch 3 0 (0.00%) Faliskisch 6 0 (0.00%)
Kasachisch 4 0 (0.00%) Koptisch 6 0 (0.00%)
Kaschubisch 7 0 (0.00%) Luxemburgisch 5 92 (28.57%)
Katalanisch 16 44 (17.98%) Friaulisch 5 0 (0.00%)
Khmer 1 0 (0.00%) Shona 5 0 (0.00%)
Khowar 1 0 (0.00%) Hausa 5 12 (16.85%)
Kikuyu 2 0 (0.00%) Bretonisch 5 0 (0.00%)
Kirchenslawisch 1 0 (0.00%) Altirisch 5 0 (0.00%)
Kirgisisch 4 0 (0.00%) Tagalog 5 6 (62.50%)
Klamath 1 0 (0.00%) Mongolisch 5 2 (0.00%)
Klassisches Nahuatl 17 98 (40.61%) Tadschikisch 5 0 (0.00%)
Komi 3 0 (0.00%) Venezianisch 5 0 (0.00%)
Konkani 1 0 (0.00%) West-Pandschabi 5 30 (3.33%)
Koptisch 6 0 (0.00%) Jamaika-Kreolisch 5 12 (28.57%)
Koreanisch 12 0 (0.00%) Umbrisch 5 0 (0.00%)
Kornisch 1 0 (0.00%) Galicisch 4 0 (0.00%)
Korsisch 1 0 (0.00%) Nauruisch 4 0 (0.00%)
Kotava 1 2 (100.00%) Sardisch 4 0 (0.00%)
Krimtatarisch 7 0 (0.00%) Scots 4 18 (48.84%)
Kroatisch 40 0 (0.00%) Suaheli 4 38 (77.27%)
Kumükisch 1 0 (0.00%) Tatarisch 4 0 (0.00%)
Kurdisch 14 30 (0.00%) Kasachisch 4 0 (0.00%)
Ladinisch 2 0 (0.00%) Kirgisisch 4 0 (0.00%)
Laotisch 1 0 (0.00%) Tetelcingo-Nahuatl 4 4 (25.00%)
Latein 248 556 (2.83%) Inuktitut 4 0 (0.00%)
Lettgallisch 3 0 (0.00%) Oskisch 4 0 (0.00%)
Lettisch 21 32 (9.93%) Nepalesisch 4 0 (0.00%)
Levantinisches Arabisch 14 18 (0.41%) Belutschi 4 4 (0.00%)
Litauisch 19 0 (0.00%) Lettgallisch 3 0 (0.00%)
Luwisch 2 2 (50.00%) Hawaiianisch 3 0 (0.00%)
Luxemburgisch 5 92 (28.57%) Maori 3 0 (0.00%)
Láadan 1 0 (0.00%) Volapük 3 0 (0.00%)
Malaiisch 2 0 (0.00%) Baschkirisch 3 0 (0.00%)
Malayalam 1 0 (0.00%) Altfranzösisch 3 0 (0.00%)
Maledivisch 1 0 (0.00%) Abchasisch 3 0 (0.00%)
Maltesisch 11 0 (0.00%) Adygeisch 3 0 (0.00%)
Mandschurisch 1 0 (0.00%) Altaisch 3 0 (0.00%)
Manx 1 0 (0.00%) Burjatisch 3 0 (0.00%)
Maori 3 0 (0.00%) Kabardinisch 3 0 (0.00%)
Marathi 3 0 (0.00%) Karatschai-Balkarisch 3 0 (0.00%)
Mari 2 0 (0.00%) Komi 3 0 (0.00%)
Marsisch 2 0 (0.00%) Ossetisch 3 0 (0.00%)
Mazedonisch 29 0 (0.00%) Tschetschenisch 3 0 (0.00%)
Mittelenglisch 2 0 (0.00%) Tschuktschisch 3 0 (0.00%)
Mittelgriechisch 1 0 (0.00%) Temascaltepec-Nahuatl 3 10 (25.00%)
Mittelhochdeutsch 12 0 (0.00%) Rätoromanisch 3 0 (0.00%)
Mittelniederdeutsch 2 4 (0.00%) Marathi 3 0 (0.00%)
Mokscha 2 0 (0.00%) Sanskrit 3 0 (0.00%)
Mongolisch 5 2 (0.00%) Birmanisch 3 0 (0.00%)
Morisien 1 0 (0.00%) Malaiisch 2 0 (0.00%)
Nahuatl 1 0 (0.00%) Mittelenglisch 2 0 (0.00%)
Nauruisch 4 0 (0.00%) Kikuyu 2 0 (0.00%)
Nepalesisch 4 0 (0.00%) Tetum 2 0 (0.00%)
Neugriechisch 54 224 (13.39%) Mittelniederdeutsch 2 4 (0.00%)
Niederdeutsch 33 152 (6.80%) Schottisch-Gälisch 2 0 (0.00%)
Niederländisch 72 148 (7.67%) Westflämisch 2 0 (0.00%)
Niedersorbisch 56 0 (0.00%) Abasinisch 2 0 (0.00%)
Niueanisch 1 0 (0.00%) Awarisch 2 0 (0.00%)
Nordfriesisch 1 0 (0.00%) Chantisch 2 0 (0.00%)
Norwegisch 26 2 (0.26%) Dunganisch 2 0 (0.00%)
Novial 1 0 (0.00%) Jakutisch 2 0 (0.00%)
Obersorbisch 42 0 (0.00%) Mari 2 0 (0.00%)
Okzitanisch 17 18 (0.19%) Mokscha 2 0 (0.00%)
Orizaba-Nahuatl 2 0 (0.00%) Tschuwaschisch 2 0 (0.00%)
Oromo 1 0 (0.00%) Tuwinisch 2 0 (0.00%)
Oskisch 4 0 (0.00%) Udmurtisch 2 0 (0.00%)
Osmanisches Türkisch 1 2 (0.00%) Urum 2 0 (0.00%)
Ossetisch 3 0 (0.00%) Sizilianisch 2 0 (0.00%)
Pali 1 0 (0.00%) Marsisch 2 0 (0.00%)
Pandschabi 1 0 (0.00%) Acehnesisch 2 0 (0.00%)
Papiamentu 1 0 (0.00%) Luwisch 2 2 (50.00%)
Paschtu 6 60 (0.78%) Telugu 2 0 (0.00%)
Pennsylvaniadeutsch 1 0 (0.00%) Uigurisch 2 0 (0.00%)
Persisch 26 16 (0.72%) Sogdisch 2 0 (0.00%)
Piemontesisch 1 0 (0.00%) Bengalisch 2 0 (0.00%)
Polabisch 2 0 (0.00%) Orizaba-Nahuatl 2 0 (0.00%)
Polnisch 211 8 (0.03%) Polabisch 2 0 (0.00%)
Portugiesisch 32 40 (5.08%) Ladinisch 2 0 (0.00%)
Prußisch 31 316 (13.49%) Asturisch 1 0 (0.00%)
Rumänisch 25 122 (2.54%) Haitianisch 1 0 (0.00%)
Russisch 61 0 (0.00%) Pennsylvaniadeutsch 1 0 (0.00%)
Rätoromanisch 3 0 (0.00%) Nordfriesisch 1 0 (0.00%)
Sami 1 0 (0.00%) Balinesisch 1 0 (0.00%)
Samoanisch 1 0 (0.00%) Kornisch 1 0 (0.00%)
Sanskrit 3 0 (0.00%) Huastekisches Ost-Nahuatl 1 0 (0.00%)
Sardisch 4 0 (0.00%) Tok Pisin 1 0 (0.00%)
Schottisch-Gälisch 2 0 (0.00%) Niueanisch 1 0 (0.00%)
Schwedisch 149 0 (0.00%) Papiamentu 1 0 (0.00%)
Scots 4 18 (48.84%) Malayalam 1 0 (0.00%)
Serbisch 41 14 (0.36%) Kannada 1 0 (0.00%)
Serbokroatisch 6 0 (0.00%) Urartäisch 1 0 (0.00%)
Sesotho 1 0 (0.00%) Tuvaluisch 1 0 (0.00%)
Shona 5 0 (0.00%) Chakassisch 1 0 (0.00%)
Sindarin 1 0 (0.00%) Turkmenisch 1 0 (0.00%)
Sindhi 6 6 (0.00%) Kumükisch 1 0 (0.00%)
Singhalesisch 1 0 (0.00%) Nahuatl 1 0 (0.00%)
Sizilianisch 2 0 (0.00%) Huastekisches West-Nahuatl 1 2 (0.00%)
Slowakisch 33 0 (0.00%) Aragonesisch 1 0 (0.00%)
Slowenisch 33 0 (0.00%) Zentrales Puebla-Nahuatl 1 0 (0.00%)
Sogdisch 2 0 (0.00%) Korsisch 1 0 (0.00%)
Somalisch 1 0 (0.00%) Tamil 1 0 (0.00%)
Spanisch 37 60 (30.16%) Sesotho 1 0 (0.00%)
Suaheli 4 38 (77.27%) Manx 1 0 (0.00%)
Sumerisch 11 80 (25.00%) Samoanisch 1 0 (0.00%)
Swanisch 1 0 (0.00%) Somalisch 1 0 (0.00%)
Südpikenisch 8 0 (0.00%) isiZulu 1 0 (0.00%)
Tadschikisch 5 0 (0.00%) Sindarin 1 0 (0.00%)
Tagalog 5 6 (62.50%) Hurritisch 1 0 (0.00%)
Tahitianisch 1 0 (0.00%) Fulfulde 1 0 (0.00%)
Tamil 1 0 (0.00%) Bairisch 1 0 (0.00%)
Tatarisch 4 0 (0.00%) Pali 1 0 (0.00%)
Telugu 2 0 (0.00%) Sami 1 0 (0.00%)
Temascaltepec-Nahuatl 3 10 (25.00%) Altsächsisch 1 0 (0.00%)
Tetelcingo-Nahuatl 4 4 (25.00%) Twi 1 0 (0.00%)
Tetum 2 0 (0.00%) Novial 1 0 (0.00%)
Thai 9 0 (0.00%) Zentral-Alaska-Yupik 1 0 (0.00%)
Tibetisch 1 0 (0.00%) Oromo 1 0 (0.00%)
Tigrinya 1 0 (0.00%) Swanisch 1 0 (0.00%)
Tok Pisin 1 0 (0.00%) Gurage 1 0 (0.00%)
Torwali 1 0 (0.00%) Inupiaq 1 0 (0.00%)
Tschechisch 78 0 (0.00%) Khowar 1 0 (0.00%)
Tschetschenisch 3 0 (0.00%) Torwali 1 0 (0.00%)
Tschuktschisch 3 0 (0.00%) Assamesisch 1 0 (0.00%)
Tschuwaschisch 2 0 (0.00%) Gujarati 1 0 (0.00%)
Turkmenisch 1 0 (0.00%) Pandschabi 1 0 (0.00%)
Tuvaluisch 1 0 (0.00%) Laotisch 1 0 (0.00%)
Tuwinisch 2 0 (0.00%) Catawba 1 0 (0.00%)
Twi 1 0 (0.00%) Mittelgriechisch 1 0 (0.00%)
Türkisch 22 80 (1.57%) Guerrero-Nahuatl 1 2 (0.00%)
Udmurtisch 2 0 (0.00%) Klamath 1 0 (0.00%)
Ugaritisch 1 0 (0.00%) Amharisch 1 0 (0.00%)
Uigurisch 2 0 (0.00%) Durango-Nahuatl 1 0 (0.00%)
Ukrainisch 74 0 (0.00%) Piemontesisch 1 0 (0.00%)
Umbrisch 5 0 (0.00%) Alttschechisch 1 0 (0.00%)
Ungarisch 22 42 (4.52%) Kotava 1 2 (100.00%)
Urartäisch 1 0 (0.00%) Láadan 1 0 (0.00%)
Urdu 17 124 (2.18%) Brahui 1 0 (0.00%)
Urum 2 0 (0.00%) Morisien 1 0 (0.00%)
Usbekisch 17 0 (0.00%) Tahitianisch 1 0 (0.00%)
Venezianisch 5 0 (0.00%) Kirchenslawisch 1 0 (0.00%)
Vietnamesisch 10 0 (0.00%) Osmanisches Türkisch 1 2 (0.00%)
Volapük 3 0 (0.00%) Kantonesisch 1 0 (0.00%)
Walisisch 12 0 (0.00%) Ugaritisch 1 0 (0.00%)
Weißrussisch 41 0 (0.00%) Mandschurisch 1 0 (0.00%)
West-Pandschabi 5 30 (3.33%) Tibetisch 1 0 (0.00%)
Westflämisch 2 0 (0.00%) Konkani 1 0 (0.00%)
Westfriesisch 9 60 (31.68%) Maledivisch 1 0 (0.00%)
Zentral-Alaska-Yupik 1 0 (0.00%) Baktrisch 1 0 (0.00%)
Zentral-Nahuatl 11 50 (10.00%) Tigrinya 1 0 (0.00%)
Zentrales Puebla-Nahuatl 1 0 (0.00%) Khmer 1 0 (0.00%)
isiZulu 1 0 (0.00%) Singhalesisch 1 0 (0.00%)

This page is a part of the kaikki.org machine-readable dictionary. This dictionary is based on structured data extracted on 2025-10-01 from the dewiktionary dump dated 2025-09-20 using wiktextract (ea0d853 and 1ab82da). The data shown on this site has been post-processed and various details (e.g., extra categories) removed, some information disambiguated, and additional data merged from other sources. See the raw data download page for the unprocessed wiktextract data.

If you use this data in academic research, please cite Tatu Ylonen: Wiktextract: Wiktionary as Machine-Readable Structured Data, Proceedings of the 13th Conference on Language Resources and Evaluation (LREC), pp. 1317-1325, Marseille, 20-25 June 2022. Linking to the relevant page(s) under https://kaikki.org would also be greatly appreciated.