Wiktionary data extraction errors and warnings

Inflection check

List of different kinds of inflection tables. When wiktextract parses word heads and tables, it assigns the forms it encounters with tags that describe grammatical or contextual information. The tags and forms that are found in head sections and tables are kept separate from other head section and table tags, and later they are merged with other heads and tables into table types that all contain the same number of word forms with the same tags for those forms.

The information presented here is mostly for debugging, but it can also be used to find interesting word paradigms and to hunt down mistakes, typoes and badly formated Wiktionary entries. A table type that has only a few unique instances is quite likely to contain some kind of minor error in the original data.

Language ⏶ Table forms Errors (% affected words) Language Table forms ⏷ Errors (% affected words)
Abasinisch 1 0 (0.00%) Deutsch 30845 4 (0.00%)
Abchasisch 2 0 (0.00%) Latein 240 536 (2.91%)
Acehnesisch 2 0 (0.00%) Polnisch 210 8 (0.03%)
Adygeisch 2 0 (0.00%) Schwedisch 147 154 (0.00%)
Afrikaans 16 82 (10.13%) Altgriechisch 144 738 (6.44%)
Akkadisch 14 162 (3.16%) Englisch 97 160 (42.59%)
Albanisch 43 308 (3.61%) Tschechisch 76 0 (0.00%)
Altaisch 2 0 (0.00%) Italienisch 75 42 (0.02%)
Altenglisch 19 64 (28.57%) Ukrainisch 73 0 (0.00%)
Altfranzösisch 3 0 (0.00%) Französisch 72 212 (28.97%)
Altgriechisch 144 738 (6.44%) Niederländisch 72 148 (7.67%)
Althochdeutsch 19 24 (0.00%) Russisch 57 0 (0.00%)
Altirisch 5 0 (0.00%) Niedersorbisch 55 0 (0.00%)
Altkirchenslawisch 16 78 (21.21%) Neugriechisch 54 222 (13.39%)
Altnordisch 7 0 (0.00%) Armenisch 49 134 (2.51%)
Altsächsisch 1 0 (0.00%) Arabisch 48 210 (6.19%)
Alttschechisch 1 0 (0.00%) Albanisch 43 308 (3.61%)
Amharisch 1 0 (0.00%) Isländisch 43 56 (9.34%)
Arabisch 48 210 (6.19%) Obersorbisch 42 0 (0.00%)
Aragonesisch 1 0 (0.00%) Kroatisch 40 0 (0.00%)
Armenisch 49 134 (2.51%) Weißrussisch 39 0 (0.00%)
Aserbaidschanisch 14 12 (2.84%) Serbisch 39 158 (3.65%)
Assamesisch 1 0 (0.00%) Spanisch 35 60 (30.16%)
Asturisch 1 0 (0.00%) Bosnisch 34 0 (0.00%)
Awarisch 1 0 (0.00%) Niederdeutsch 33 152 (6.80%)
Bairisch 1 0 (0.00%) Gotisch 32 30 (20.97%)
Baktrisch 1 0 (0.00%) Slowakisch 31 0 (0.00%)
Balinesisch 1 0 (0.00%) Portugiesisch 31 40 (5.08%)
Baschkirisch 2 0 (0.00%) Slowenisch 31 0 (0.00%)
Baskisch 15 22 (4.97%) Finnisch 31 82 (4.56%)
Belutschi 4 4 (0.00%) Prußisch 31 302 (13.49%)
Bengalisch 2 0 (0.00%) Ido 29 40 (40.55%)
Birmanisch 3 0 (0.00%) Irisch 27 4 (3.12%)
Bosnisch 34 0 (0.00%) Mazedonisch 27 0 (0.00%)
Brahui 1 0 (0.00%) Hebräisch 27 16 (1.85%)
Bretonisch 5 0 (0.00%) Persisch 25 16 (0.72%)
Bulgarisch 19 140 (29.64%) Dänisch 24 2 (10.85%)
Burjatisch 2 0 (0.00%) Norwegisch 24 2 (0.26%)
Catawba 1 0 (0.00%) Esperanto 23 8 (0.00%)
Chakassisch 1 0 (0.00%) Rumänisch 23 112 (2.54%)
Chantisch 1 0 (0.00%) Türkisch 22 80 (1.64%)
Chinesisch 12 0 (0.00%) Ungarisch 21 42 (4.52%)
Deutsch 30845 4 (0.00%) Färöisch 20 14 (25.74%)
Dunganisch 1 0 (0.00%) Jiddisch 20 0 (0.00%)
Durango-Nahuatl 1 0 (0.00%) Lettisch 19 32 (9.93%)
Dänisch 24 2 (10.85%) Bulgarisch 19 140 (29.64%)
Englisch 97 160 (42.59%) Althochdeutsch 19 24 (0.00%)
Esperanto 23 8 (0.00%) Altenglisch 19 64 (28.57%)
Estnisch 5 22 (16.83%) Litauisch 18 0 (0.00%)
Faliskisch 6 0 (0.00%) Okzitanisch 18 18 (0.19%)
Finnisch 31 82 (4.56%) Klassisches Nahuatl 17 88 (40.37%)
Französisch 72 212 (28.97%) Urdu 17 142 (2.18%)
Friaulisch 5 0 (0.00%) Katalanisch 16 44 (18.90%)
Frühneuhochdeutsch 11 0 (0.00%) Afrikaans 16 82 (10.13%)
Fulfulde 1 0 (0.00%) Altkirchenslawisch 16 78 (21.21%)
Färöisch 20 14 (25.74%) Usbekisch 15 0 (0.00%)
Galicisch 4 0 (0.00%) Baskisch 15 22 (4.97%)
Georgisch 10 0 (0.00%) Kurdisch 14 30 (0.00%)
Gotisch 32 30 (20.97%) Levantinisches Arabisch 14 18 (0.41%)
Guerrero-Nahuatl 1 2 (0.00%) Aserbaidschanisch 14 12 (2.84%)
Gujarati 1 0 (0.00%) Akkadisch 14 162 (3.16%)
Gurage 1 0 (0.00%) Walisisch 12 0 (0.00%)
Haitianisch 1 0 (0.00%) Mittelhochdeutsch 12 0 (0.00%)
Hausa 5 10 (16.85%) Chinesisch 12 0 (0.00%)
Hawaiianisch 3 0 (0.00%) Koreanisch 12 0 (0.00%)
Hebräisch 27 16 (1.85%) Zentral-Nahuatl 11 50 (10.00%)
Hethitisch 8 0 (0.00%) Frühneuhochdeutsch 11 0 (0.00%)
Hindi 6 0 (0.00%) Sumerisch 11 78 (25.00%)
Huastekisches Ost-Nahuatl 1 0 (0.00%) Maltesisch 10 0 (0.00%)
Huastekisches West-Nahuatl 1 2 (0.00%) Vietnamesisch 10 0 (0.00%)
Huastekisches Zentral-Nahuatl 8 40 (38.89%) Georgisch 10 0 (0.00%)
Hurritisch 1 0 (0.00%) Westfriesisch 9 60 (31.68%)
Ido 29 40 (40.55%) Thai 9 0 (0.00%)
Indonesisch 6 0 (0.00%) Huastekisches Zentral-Nahuatl 8 40 (38.89%)
Interlingua 6 0 (0.00%) Südpikenisch 8 0 (0.00%)
International 5 0 (0.00%) Hethitisch 8 0 (0.00%)
Inuktitut 4 0 (0.00%) Altnordisch 7 0 (0.00%)
Inupiaq 1 0 (0.00%) Interlingua 6 0 (0.00%)
Irisch 27 4 (3.12%) Japanisch 6 0 (0.00%)
Isländisch 43 56 (9.34%) Kaschubisch 6 0 (0.00%)
Italienisch 75 42 (0.02%) Indonesisch 6 0 (0.00%)
Jakutisch 1 0 (0.00%) Paschtu 6 60 (0.78%)
Jamaika-Kreolisch 5 12 (28.57%) Hindi 6 0 (0.00%)
Japanisch 6 0 (0.00%) Sindhi 6 6 (0.00%)
Jiddisch 20 0 (0.00%) Faliskisch 6 0 (0.00%)
Kabardinisch 2 0 (0.00%) Koptisch 6 0 (0.00%)
Kannada 1 0 (0.00%) Luxemburgisch 5 92 (28.57%)
Kantonesisch 1 0 (0.00%) International 5 0 (0.00%)
Karatschai-Balkarisch 2 0 (0.00%) Friaulisch 5 0 (0.00%)
Kasachisch 3 0 (0.00%) Estnisch 5 22 (16.83%)
Kaschubisch 6 0 (0.00%) Krimtatarisch 5 0 (0.00%)
Katalanisch 16 44 (18.90%) Hausa 5 10 (16.85%)
Khmer 1 0 (0.00%) Bretonisch 5 0 (0.00%)
Khowar 1 0 (0.00%) Altirisch 5 0 (0.00%)
Kikuyu 1 0 (0.00%) Tagalog 5 4 (62.50%)
Kirchenslawisch 1 0 (0.00%) Serbokroatisch 5 0 (0.00%)
Kirgisisch 3 0 (0.00%) Venezianisch 5 0 (0.00%)
Klamath 1 0 (0.00%) West-Pandschabi 5 30 (3.33%)
Klassisches Nahuatl 17 88 (40.37%) Jamaika-Kreolisch 5 12 (28.57%)
Klassisches Nahuatl‎ 3 16 (50.00%) Umbrisch 5 0 (0.00%)
Komi 2 0 (0.00%) Galicisch 4 0 (0.00%)
Konkani 1 0 (0.00%) Nauruisch 4 0 (0.00%)
Koptisch 6 0 (0.00%) Sardisch 4 0 (0.00%)
Koreanisch 12 0 (0.00%) Scots 4 20 (48.84%)
Kornisch 1 0 (0.00%) Suaheli 4 38 (77.27%)
Korsisch 1 0 (0.00%) Mongolisch 4 2 (0.00%)
Kotava 1 2 (100.00%) Tadschikisch 4 0 (0.00%)
Krimtatarisch 5 0 (0.00%) Tetelcingo-Nahuatl 4 4 (25.00%)
Kroatisch 40 0 (0.00%) Inuktitut 4 0 (0.00%)
Kumükisch 1 0 (0.00%) Oskisch 4 0 (0.00%)
Kurdisch 14 30 (0.00%) Nepalesisch 4 0 (0.00%)
Ladinisch 2 0 (0.00%) Belutschi 4 4 (0.00%)
Laotisch 1 0 (0.00%) Shona 3 0 (0.00%)
Latein 240 536 (2.91%) Hawaiianisch 3 0 (0.00%)
Lettgallisch 2 0 (0.00%) Maori 3 0 (0.00%)
Lettisch 19 32 (9.93%) Volapük 3 0 (0.00%)
Levantinisches Arabisch 14 18 (0.41%) Altfranzösisch 3 0 (0.00%)
Litauisch 18 0 (0.00%) Tatarisch 3 0 (0.00%)
Luwisch 2 2 (50.00%) Kasachisch 3 0 (0.00%)
Luxemburgisch 5 92 (28.57%) Kirgisisch 3 0 (0.00%)
Láadan 1 0 (0.00%) Temascaltepec-Nahuatl 3 10 (25.00%)
Malaiisch 2 0 (0.00%) Klassisches Nahuatl‎ 3 16 (50.00%)
Malayalam 1 0 (0.00%) Rätoromanisch 3 0 (0.00%)
Maledivisch 1 0 (0.00%) Marathi 3 0 (0.00%)
Maltesisch 10 0 (0.00%) Sanskrit 3 0 (0.00%)
Mandschurisch 1 0 (0.00%) Birmanisch 3 0 (0.00%)
Manx 1 0 (0.00%) Malaiisch 2 0 (0.00%)
Maori 3 0 (0.00%) Mittelenglisch 2 0 (0.00%)
Marathi 3 0 (0.00%) Lettgallisch 2 0 (0.00%)
Mari 1 0 (0.00%) Tetum 2 0 (0.00%)
Marsisch 2 0 (0.00%) Mittelniederdeutsch 2 4 (0.00%)
Mazedonisch 27 0 (0.00%) Schottisch-Gälisch 2 0 (0.00%)
Mittelenglisch 2 0 (0.00%) Westflämisch 2 0 (0.00%)
Mittelgriechisch 1 0 (0.00%) Baschkirisch 2 0 (0.00%)
Mittelhochdeutsch 12 0 (0.00%) Abchasisch 2 0 (0.00%)
Mittelniederdeutsch 2 4 (0.00%) Adygeisch 2 0 (0.00%)
Mokscha 1 0 (0.00%) Altaisch 2 0 (0.00%)
Mongolisch 4 2 (0.00%) Burjatisch 2 0 (0.00%)
Morisien 1 0 (0.00%) Kabardinisch 2 0 (0.00%)
Nahuatl 1 0 (0.00%) Karatschai-Balkarisch 2 0 (0.00%)
Nauruisch 4 0 (0.00%) Komi 2 0 (0.00%)
Nepalesisch 4 0 (0.00%) Ossetisch 2 0 (0.00%)
Neugriechisch 54 222 (13.39%) Tschetschenisch 2 0 (0.00%)
Niederdeutsch 33 152 (6.80%) Tschuktschisch 2 0 (0.00%)
Niederländisch 72 148 (7.67%) Sizilianisch 2 0 (0.00%)
Niedersorbisch 55 0 (0.00%) Marsisch 2 0 (0.00%)
Niueanisch 1 0 (0.00%) Acehnesisch 2 0 (0.00%)
Nordfriesisch 1 0 (0.00%) Luwisch 2 2 (50.00%)
Norwegisch 24 2 (0.26%) Telugu 2 0 (0.00%)
Novial 1 0 (0.00%) Uigurisch 2 0 (0.00%)
Obersorbisch 42 0 (0.00%) Sogdisch 2 0 (0.00%)
Okzitanisch 18 18 (0.19%) Bengalisch 2 0 (0.00%)
Orizaba-Nahuatl 2 0 (0.00%) Orizaba-Nahuatl 2 0 (0.00%)
Oromo 1 0 (0.00%) Polabisch 2 0 (0.00%)
Oskisch 4 0 (0.00%) Ladinisch 2 0 (0.00%)
Osmanisches Türkisch 1 2 (0.00%) Asturisch 1 0 (0.00%)
Ossetisch 2 0 (0.00%) Haitianisch 1 0 (0.00%)
Pali 1 0 (0.00%) Kikuyu 1 0 (0.00%)
Pandschabi 1 0 (0.00%) Nordfriesisch 1 0 (0.00%)
Papiamentu 1 0 (0.00%) Balinesisch 1 0 (0.00%)
Paschtu 6 60 (0.78%) Kornisch 1 0 (0.00%)
Persisch 25 16 (0.72%) Huastekisches Ost-Nahuatl 1 0 (0.00%)
Piemontesisch 1 0 (0.00%) Tok Pisin 1 0 (0.00%)
Polabisch 2 0 (0.00%) Niueanisch 1 0 (0.00%)
Polnisch 210 8 (0.03%) Papiamentu 1 0 (0.00%)
Portugiesisch 31 40 (5.08%) Malayalam 1 0 (0.00%)
Prußisch 31 302 (13.49%) Kannada 1 0 (0.00%)
Rumänisch 23 112 (2.54%) Urartäisch 1 0 (0.00%)
Russisch 57 0 (0.00%) Tuvaluisch 1 0 (0.00%)
Rätoromanisch 3 0 (0.00%) Chakassisch 1 0 (0.00%)
Sami 1 0 (0.00%) Turkmenisch 1 0 (0.00%)
Samoanisch 1 0 (0.00%) Kumükisch 1 0 (0.00%)
Sanskrit 3 0 (0.00%) Abasinisch 1 0 (0.00%)
Sardisch 4 0 (0.00%) Awarisch 1 0 (0.00%)
Schottisch-Gälisch 2 0 (0.00%) Chantisch 1 0 (0.00%)
Schwedisch 147 154 (0.00%) Dunganisch 1 0 (0.00%)
Scots 4 20 (48.84%) Jakutisch 1 0 (0.00%)
Serbisch 39 158 (3.65%) Mari 1 0 (0.00%)
Serbokroatisch 5 0 (0.00%) Mokscha 1 0 (0.00%)
Sesotho 1 0 (0.00%) Tschuwaschisch 1 0 (0.00%)
Shona 3 0 (0.00%) Tuwinisch 1 0 (0.00%)
Sindarin 1 0 (0.00%) Udmurtisch 1 0 (0.00%)
Sindhi 6 6 (0.00%) Urum 1 0 (0.00%)
Singhalesisch 1 0 (0.00%) Nahuatl 1 0 (0.00%)
Sizilianisch 2 0 (0.00%) Huastekisches West-Nahuatl 1 2 (0.00%)
Slowakisch 31 0 (0.00%) Aragonesisch 1 0 (0.00%)
Slowenisch 31 0 (0.00%) Zentrales Puebla-Nahuatl 1 0 (0.00%)
Sogdisch 2 0 (0.00%) Korsisch 1 0 (0.00%)
Somalisch 1 0 (0.00%) Tamil 1 0 (0.00%)
Spanisch 35 60 (30.16%) Sesotho 1 0 (0.00%)
Suaheli 4 38 (77.27%) Manx 1 0 (0.00%)
Sumerisch 11 78 (25.00%) Samoanisch 1 0 (0.00%)
Swanisch 1 0 (0.00%) Somalisch 1 0 (0.00%)
Südpikenisch 8 0 (0.00%) isiZulu 1 0 (0.00%)
Tadschikisch 4 0 (0.00%) Sindarin 1 0 (0.00%)
Tagalog 5 4 (62.50%) Hurritisch 1 0 (0.00%)
Tahitianisch 1 0 (0.00%) Fulfulde 1 0 (0.00%)
Tamil 1 0 (0.00%) Bairisch 1 0 (0.00%)
Tatarisch 3 0 (0.00%) Pali 1 0 (0.00%)
Telugu 2 0 (0.00%) Sami 1 0 (0.00%)
Temascaltepec-Nahuatl 3 10 (25.00%) Altsächsisch 1 0 (0.00%)
Tetelcingo-Nahuatl 4 4 (25.00%) Twi 1 0 (0.00%)
Tetum 2 0 (0.00%) Novial 1 0 (0.00%)
Thai 9 0 (0.00%) Zentral-Alaska-Yupik 1 0 (0.00%)
Tibetisch 1 0 (0.00%) Oromo 1 0 (0.00%)
Tigrinya 1 0 (0.00%) Swanisch 1 0 (0.00%)
Tok Pisin 1 0 (0.00%) Gurage 1 0 (0.00%)
Torwali 1 0 (0.00%) Inupiaq 1 0 (0.00%)
Tschechisch 76 0 (0.00%) Khowar 1 0 (0.00%)
Tschetschenisch 2 0 (0.00%) Torwali 1 0 (0.00%)
Tschuktschisch 2 0 (0.00%) Assamesisch 1 0 (0.00%)
Tschuwaschisch 1 0 (0.00%) Gujarati 1 0 (0.00%)
Turkmenisch 1 0 (0.00%) Pandschabi 1 0 (0.00%)
Tuvaluisch 1 0 (0.00%) Laotisch 1 0 (0.00%)
Tuwinisch 1 0 (0.00%) Catawba 1 0 (0.00%)
Twi 1 0 (0.00%) Mittelgriechisch 1 0 (0.00%)
Türkisch 22 80 (1.64%) Guerrero-Nahuatl 1 2 (0.00%)
Udmurtisch 1 0 (0.00%) Klamath 1 0 (0.00%)
Ugaritisch 1 0 (0.00%) Amharisch 1 0 (0.00%)
Uigurisch 2 0 (0.00%) Durango-Nahuatl 1 0 (0.00%)
Ukrainisch 73 0 (0.00%) Piemontesisch 1 0 (0.00%)
Umbrisch 5 0 (0.00%) Alttschechisch 1 0 (0.00%)
Ungarisch 21 42 (4.52%) Kotava 1 2 (100.00%)
Urartäisch 1 0 (0.00%) Láadan 1 0 (0.00%)
Urdu 17 142 (2.18%) Brahui 1 0 (0.00%)
Urum 1 0 (0.00%) Morisien 1 0 (0.00%)
Usbekisch 15 0 (0.00%) Tahitianisch 1 0 (0.00%)
Venezianisch 5 0 (0.00%) Kirchenslawisch 1 0 (0.00%)
Vietnamesisch 10 0 (0.00%) Osmanisches Türkisch 1 2 (0.00%)
Volapük 3 0 (0.00%) Kantonesisch 1 0 (0.00%)
Walisisch 12 0 (0.00%) Ugaritisch 1 0 (0.00%)
Weißrussisch 39 0 (0.00%) Mandschurisch 1 0 (0.00%)
West-Pandschabi 5 30 (3.33%) Tibetisch 1 0 (0.00%)
Westflämisch 2 0 (0.00%) Konkani 1 0 (0.00%)
Westfriesisch 9 60 (31.68%) Maledivisch 1 0 (0.00%)
Zentral-Alaska-Yupik 1 0 (0.00%) Baktrisch 1 0 (0.00%)
Zentral-Nahuatl 11 50 (10.00%) Tigrinya 1 0 (0.00%)
Zentrales Puebla-Nahuatl 1 0 (0.00%) Khmer 1 0 (0.00%)
isiZulu 1 0 (0.00%) Singhalesisch 1 0 (0.00%)

This page is a part of the kaikki.org machine-readable dictionary. This dictionary is based on structured data extracted on 2025-07-28 from the dewiktionary dump dated 2025-07-20 using wiktextract (c280bfc and daf64d0). The data shown on this site has been post-processed and various details (e.g., extra categories) removed, some information disambiguated, and additional data merged from other sources. See the raw data download page for the unprocessed wiktextract data.

If you use this data in academic research, please cite Tatu Ylonen: Wiktextract: Wiktionary as Machine-Readable Structured Data, Proceedings of the 13th Conference on Language Resources and Evaluation (LREC), pp. 1317-1325, Marseille, 20-25 June 2022. Linking to the relevant page(s) under https://kaikki.org would also be greatly appreciated.