Wiktionary data extraction errors and warnings

Inflection check

List of different kinds of inflection tables. When wiktextract parses word heads and tables, it assigns the forms it encounters with tags that describe grammatical or contextual information. The tags and forms that are found in head sections and tables are kept separate from other head section and table tags, and later they are merged with other heads and tables into table types that all contain the same number of word forms with the same tags for those forms.

The information presented here is mostly for debugging, but it can also be used to find interesting word paradigms and to hunt down mistakes, typoes and badly formated Wiktionary entries. A table type that has only a few unique instances is quite likely to contain some kind of minor error in the original data.

Language ⏶ Table forms Errors (% affected words) Language Table forms ⏷ Errors (% affected words)
Abasinisch 2 0 (0.00%) Deutsch 31912 6 (0.00%)
Abchasisch 3 0 (0.00%) Latein 167 640 (2.71%)
Acehnesisch 2 0 (0.00%) Altgriechisch 167 848 (6.65%)
Adygeisch 3 0 (0.00%) Polnisch 141 8 (0.03%)
Afrikaans 16 82 (10.13%) Englisch 114 0 (0.00%)
Akkadisch 14 162 (3.16%) Schwedisch 104 0 (0.00%)
Albanisch 43 282 (3.61%) Französisch 79 0 (0.00%)
Altaisch 3 0 (0.00%) Italienisch 77 50 (0.02%)
Altenglisch 10 12 (28.57%) Tschechisch 74 0 (0.00%)
Altfranzösisch 2 0 (0.00%) Niederländisch 73 42 (7.76%)
Altgriechisch 167 848 (6.65%) Russisch 61 0 (0.00%)
Althochdeutsch 18 0 (0.00%) Arabisch 53 220 (8.31%)
Altirisch 4 0 (0.00%) Armenisch 51 134 (2.51%)
Altkirchenslawisch 12 80 (21.21%) Niedersorbisch 49 0 (0.00%)
Altnordisch 6 0 (0.00%) Ukrainisch 48 0 (0.00%)
Altsächsisch 1 0 (0.00%) Neugriechisch 45 140 (13.39%)
Alttschechisch 1 0 (0.00%) Albanisch 43 282 (3.61%)
Amharisch 1 0 (0.00%) Serbisch 40 62 (0.36%)
Arabisch 53 220 (8.31%) Spanisch 38 0 (0.00%)
Aragonesisch 1 0 (0.00%) Niederdeutsch 33 108 (6.76%)
Armenisch 51 134 (2.51%) Isländisch 33 52 (9.32%)
Aserbaidschanisch 14 8 (2.84%) Slowakisch 32 0 (0.00%)
Assamesisch 1 0 (0.00%) Prußisch 31 280 (13.49%)
Asturisch 1 0 (0.00%) Portugiesisch 30 36 (5.08%)
Awarisch 2 0 (0.00%) Ido 29 32 (40.19%)
Bairisch 1 0 (0.00%) Obersorbisch 29 0 (0.00%)
Baktrisch 1 0 (0.00%) Weißrussisch 28 0 (0.00%)
Balinesisch 1 0 (0.00%) Gotisch 28 24 (20.97%)
Baschkirisch 3 0 (0.00%) Bulgarisch 27 126 (3.67%)
Baskisch 13 22 (4.97%) Türkisch 27 80 (1.24%)
Belutschi 4 4 (0.00%) Hebräisch 27 16 (1.85%)
Bengalisch 2 0 (0.00%) Finnisch 26 82 (4.54%)
Birmanisch 3 0 (0.00%) Persisch 26 16 (0.72%)
Bosnisch 21 0 (0.00%) Dänisch 25 2 (10.85%)
Brahui 1 0 (0.00%) Rumänisch 25 122 (2.54%)
Bretonisch 5 0 (0.00%) Mazedonisch 25 0 (0.00%)
Bulgarisch 27 126 (3.67%) Chinesisch 24 0 (0.00%)
Burjatisch 3 0 (0.00%) Slowenisch 23 0 (0.00%)
Catawba 1 0 (0.00%) Kroatisch 23 0 (0.00%)
Chakassisch 1 0 (0.00%) Norwegisch 22 6 (0.26%)
Chantisch 2 0 (0.00%) Ungarisch 22 44 (4.03%)
Chinesisch 24 0 (0.00%) Bosnisch 21 0 (0.00%)
Deutsch 31912 6 (0.00%) Lettisch 20 32 (9.93%)
Dunganisch 2 0 (0.00%) Färöisch 20 0 (0.00%)
Durango-Nahuatl 1 2 (0.00%) Jiddisch 20 0 (0.00%)
Dänisch 25 2 (10.85%) Esperanto 19 0 (0.00%)
Englisch 114 0 (0.00%) Althochdeutsch 18 0 (0.00%)
Esperanto 19 0 (0.00%) Irisch 18 2 (2.99%)
Estnisch 6 22 (16.83%) Katalanisch 17 34 (15.58%)
Faliskisch 6 0 (0.00%) Usbekisch 17 0 (0.00%)
Finnisch 26 82 (4.54%) Litauisch 17 0 (0.00%)
Französisch 79 0 (0.00%) Urdu 17 124 (2.18%)
Friaulisch 5 0 (0.00%) Afrikaans 16 82 (10.13%)
Frühneuhochdeutsch 11 0 (0.00%) Okzitanisch 15 14 (0.17%)
Fulfulde 1 0 (0.00%) Kurdisch 15 30 (0.00%)
Färöisch 20 0 (0.00%) Levantinisches Arabisch 14 18 (0.41%)
Galicisch 4 0 (0.00%) Aserbaidschanisch 14 8 (2.84%)
Georgisch 11 0 (0.00%) Südpikenisch 14 0 (0.00%)
Gotisch 28 24 (20.97%) Akkadisch 14 162 (3.16%)
Guerrero-Nahuatl 1 4 (0.00%) Baskisch 13 22 (4.97%)
Gujarati 1 0 (0.00%) Walisisch 12 0 (0.00%)
Gurage 1 0 (0.00%) Koreanisch 12 0 (0.00%)
Haitianisch 1 0 (0.00%) Klassisches Nahuatl 12 76 (47.11%)
Hausa 5 12 (16.85%) Altkirchenslawisch 12 80 (21.21%)
Hawaiianisch 3 0 (0.00%) Maltesisch 11 0 (0.00%)
Hebräisch 27 16 (1.85%) Georgisch 11 0 (0.00%)
Hethitisch 8 0 (0.00%) Frühneuhochdeutsch 11 0 (0.00%)
Hindi 6 0 (0.00%) Sumerisch 11 78 (25.00%)
Huastekisches Ost-Nahuatl 1 2 (0.00%) Mittelhochdeutsch 10 0 (0.00%)
Huastekisches West-Nahuatl 1 4 (0.00%) Vietnamesisch 10 0 (0.00%)
Huastekisches Zentral-Nahuatl 6 36 (38.89%) Zentral-Nahuatl 10 50 (30.56%)
Hurritisch 1 0 (0.00%) Altenglisch 10 12 (28.57%)
Ido 29 32 (40.19%) International 9 0 (0.00%)
Indonesisch 6 0 (0.00%) Westfriesisch 9 26 (28.83%)
Interlingua 6 0 (0.00%) Thai 9 0 (0.00%)
International 9 0 (0.00%) Marsisch 8 0 (0.00%)
Inuktitut 4 0 (0.00%) Hethitisch 8 0 (0.00%)
Inupiaq 1 0 (0.00%) Interlingua 6 0 (0.00%)
Irisch 18 2 (2.99%) Japanisch 6 0 (0.00%)
Isländisch 33 52 (9.32%) Estnisch 6 22 (16.83%)
Italienisch 77 50 (0.02%) Kaschubisch 6 0 (0.00%)
Jakutisch 2 0 (0.00%) Krimtatarisch 6 0 (0.00%)
Jamaika-Kreolisch 5 4 (28.57%) Altnordisch 6 0 (0.00%)
Japanisch 6 0 (0.00%) Indonesisch 6 0 (0.00%)
Jiddisch 20 0 (0.00%) Paschtu 6 60 (0.78%)
Kabardinisch 3 0 (0.00%) Huastekisches Zentral-Nahuatl 6 36 (38.89%)
Kannada 1 0 (0.00%) Volskisch 6 0 (0.00%)
Kantonesisch 1 0 (0.00%) Hindi 6 0 (0.00%)
Karatschai-Balkarisch 3 0 (0.00%) Serbokroatisch 6 0 (0.00%)
Kasachisch 4 0 (0.00%) Sindhi 6 6 (0.00%)
Kaschubisch 6 0 (0.00%) Faliskisch 6 0 (0.00%)
Katalanisch 17 34 (15.58%) Umbrisch 6 0 (0.00%)
Khmer 1 0 (0.00%) Koptisch 6 0 (0.00%)
Khowar 1 0 (0.00%) Luxemburgisch 5 16 (0.00%)
Kikuyu 2 0 (0.00%) Friaulisch 5 0 (0.00%)
Kirchenslawisch 1 0 (0.00%) Shona 5 0 (0.00%)
Kirgisisch 4 0 (0.00%) Hausa 5 12 (16.85%)
Klamath 1 0 (0.00%) Bretonisch 5 0 (0.00%)
Klassisches Nahuatl 12 76 (47.11%) Tagalog 5 6 (62.50%)
Komi 3 0 (0.00%) Mongolisch 5 2 (0.00%)
Konkani 1 0 (0.00%) Tadschikisch 5 0 (0.00%)
Koptisch 6 0 (0.00%) Tetelcingo-Nahuatl 5 10 (77.78%)
Koreanisch 12 0 (0.00%) Venezianisch 5 0 (0.00%)
Kornisch 1 0 (0.00%) Oskisch 5 0 (0.00%)
Korsisch 1 0 (0.00%) West-Pandschabi 5 30 (3.33%)
Kotava 1 2 (100.00%) Jamaika-Kreolisch 5 4 (28.57%)
Krimtatarisch 6 0 (0.00%) Galicisch 4 0 (0.00%)
Kroatisch 23 0 (0.00%) Altirisch 4 0 (0.00%)
Kumükisch 1 0 (0.00%) Nauruisch 4 0 (0.00%)
Kurdisch 15 30 (0.00%) Sardisch 4 0 (0.00%)
Ladinisch 2 0 (0.00%) Scots 4 16 (48.84%)
Laotisch 1 0 (0.00%) Suaheli 4 34 (77.27%)
Latein 167 640 (2.71%) Tatarisch 4 0 (0.00%)
Lettgallisch 3 0 (0.00%) Kasachisch 4 0 (0.00%)
Lettisch 20 32 (9.93%) Kirgisisch 4 0 (0.00%)
Levantinisches Arabisch 14 18 (0.41%) Inuktitut 4 0 (0.00%)
Litauisch 17 0 (0.00%) Nepalesisch 4 0 (0.00%)
Luwisch 2 2 (50.00%) Belutschi 4 4 (0.00%)
Luxemburgisch 5 16 (0.00%) Vestinisch 4 0 (0.00%)
Láadan 1 0 (0.00%) Lettgallisch 3 0 (0.00%)
Malaiisch 2 0 (0.00%) Hawaiianisch 3 0 (0.00%)
Malayalam 1 0 (0.00%) Maori 3 0 (0.00%)
Maledivisch 1 0 (0.00%) Volapük 3 0 (0.00%)
Maltesisch 11 0 (0.00%) Baschkirisch 3 0 (0.00%)
Mandschurisch 1 0 (0.00%) Abchasisch 3 0 (0.00%)
Manx 1 0 (0.00%) Adygeisch 3 0 (0.00%)
Maori 3 0 (0.00%) Altaisch 3 0 (0.00%)
Marathi 3 0 (0.00%) Burjatisch 3 0 (0.00%)
Mari 2 0 (0.00%) Kabardinisch 3 0 (0.00%)
Marsisch 8 0 (0.00%) Karatschai-Balkarisch 3 0 (0.00%)
Mazedonisch 25 0 (0.00%) Komi 3 0 (0.00%)
Mittelenglisch 2 0 (0.00%) Ossetisch 3 0 (0.00%)
Mittelgriechisch 1 0 (0.00%) Tschetschenisch 3 0 (0.00%)
Mittelhochdeutsch 10 0 (0.00%) Tschuktschisch 3 0 (0.00%)
Mittelniederdeutsch 2 4 (0.00%) Temascaltepec-Nahuatl 3 16 (25.00%)
Mokscha 2 0 (0.00%) Rätoromanisch 3 0 (0.00%)
Mongolisch 5 2 (0.00%) Marathi 3 0 (0.00%)
Morisien 1 0 (0.00%) Sanskrit 3 0 (0.00%)
Nahuatl 1 0 (0.00%) Orizaba-Nahuatl 3 6 (20.00%)
Nauruisch 4 0 (0.00%) Birmanisch 3 0 (0.00%)
Nepalesisch 4 0 (0.00%) Malaiisch 2 0 (0.00%)
Neugriechisch 45 140 (13.39%) Mittelenglisch 2 0 (0.00%)
Niederdeutsch 33 108 (6.76%) Kikuyu 2 0 (0.00%)
Niederländisch 73 42 (7.76%) Tetum 2 0 (0.00%)
Niedersorbisch 49 0 (0.00%) Mittelniederdeutsch 2 4 (0.00%)
Niueanisch 1 0 (0.00%) Schottisch-Gälisch 2 0 (0.00%)
Nordfriesisch 1 0 (0.00%) Tuvaluisch 2 0 (0.00%)
Norwegisch 22 6 (0.26%) Westflämisch 2 0 (0.00%)
Novial 1 0 (0.00%) Altfranzösisch 2 0 (0.00%)
Obersorbisch 29 0 (0.00%) Abasinisch 2 0 (0.00%)
Okzitanisch 15 14 (0.17%) Awarisch 2 0 (0.00%)
Orizaba-Nahuatl 3 6 (20.00%) Chantisch 2 0 (0.00%)
Oromo 1 0 (0.00%) Dunganisch 2 0 (0.00%)
Oskisch 5 0 (0.00%) Jakutisch 2 0 (0.00%)
Osmanisches Türkisch 1 2 (0.00%) Mari 2 0 (0.00%)
Ossetisch 3 0 (0.00%) Mokscha 2 0 (0.00%)
Pali 1 0 (0.00%) Tschuwaschisch 2 0 (0.00%)
Pandschabi 1 0 (0.00%) Tuwinisch 2 0 (0.00%)
Papiamentu 1 0 (0.00%) Udmurtisch 2 0 (0.00%)
Paschtu 6 60 (0.78%) Urum 2 0 (0.00%)
Pennsylvaniadeutsch 1 0 (0.00%) Sizilianisch 2 0 (0.00%)
Persisch 26 16 (0.72%) Zentrales Puebla-Nahuatl 2 4 (100.00%)
Piemontesisch 1 0 (0.00%) Acehnesisch 2 0 (0.00%)
Polabisch 2 0 (0.00%) Luwisch 2 2 (50.00%)
Polnisch 141 8 (0.03%) Telugu 2 0 (0.00%)
Portugiesisch 30 36 (5.08%) Uigurisch 2 0 (0.00%)
Prußisch 31 280 (13.49%) Sogdisch 2 0 (0.00%)
Rumänisch 25 122 (2.54%) Bengalisch 2 0 (0.00%)
Russisch 61 0 (0.00%) Polabisch 2 0 (0.00%)
Rätoromanisch 3 0 (0.00%) Ladinisch 2 0 (0.00%)
Sami 1 0 (0.00%) Asturisch 1 0 (0.00%)
Samoanisch 1 0 (0.00%) Haitianisch 1 0 (0.00%)
Sanskrit 3 0 (0.00%) Pennsylvaniadeutsch 1 0 (0.00%)
Sardisch 4 0 (0.00%) Nordfriesisch 1 0 (0.00%)
Schottisch-Gälisch 2 0 (0.00%) Balinesisch 1 0 (0.00%)
Schwedisch 104 0 (0.00%) Kornisch 1 0 (0.00%)
Scots 4 16 (48.84%) Huastekisches Ost-Nahuatl 1 2 (0.00%)
Serbisch 40 62 (0.36%) Tok Pisin 1 0 (0.00%)
Serbokroatisch 6 0 (0.00%) Niueanisch 1 0 (0.00%)
Sesotho 1 0 (0.00%) Papiamentu 1 0 (0.00%)
Shona 5 0 (0.00%) Malayalam 1 0 (0.00%)
Sindarin 1 0 (0.00%) Kannada 1 0 (0.00%)
Sindhi 6 6 (0.00%) Urartäisch 1 0 (0.00%)
Singhalesisch 1 0 (0.00%) Chakassisch 1 0 (0.00%)
Sizilianisch 2 0 (0.00%) Turkmenisch 1 0 (0.00%)
Slowakisch 32 0 (0.00%) Kumükisch 1 0 (0.00%)
Slowenisch 23 0 (0.00%) Nahuatl 1 0 (0.00%)
Sogdisch 2 0 (0.00%) Huastekisches West-Nahuatl 1 4 (0.00%)
Somalisch 1 0 (0.00%) Aragonesisch 1 0 (0.00%)
Spanisch 38 0 (0.00%) Korsisch 1 0 (0.00%)
Suaheli 4 34 (77.27%) Tamil 1 0 (0.00%)
Sumerisch 11 78 (25.00%) Sesotho 1 0 (0.00%)
Swanisch 1 0 (0.00%) Manx 1 0 (0.00%)
Südpikenisch 14 0 (0.00%) Samoanisch 1 0 (0.00%)
Tadschikisch 5 0 (0.00%) Somalisch 1 0 (0.00%)
Tagalog 5 6 (62.50%) isiZulu 1 0 (0.00%)
Tahitianisch 1 0 (0.00%) Sindarin 1 0 (0.00%)
Tamil 1 0 (0.00%) Hurritisch 1 0 (0.00%)
Tatarisch 4 0 (0.00%) Fulfulde 1 0 (0.00%)
Telugu 2 0 (0.00%) Bairisch 1 0 (0.00%)
Temascaltepec-Nahuatl 3 16 (25.00%) Pali 1 0 (0.00%)
Tetelcingo-Nahuatl 5 10 (77.78%) Sami 1 0 (0.00%)
Tetum 2 0 (0.00%) Altsächsisch 1 0 (0.00%)
Thai 9 0 (0.00%) Twi 1 0 (0.00%)
Tibetisch 1 0 (0.00%) Novial 1 0 (0.00%)
Tigrinya 1 0 (0.00%) Zentral-Alaska-Yupik 1 0 (0.00%)
Tok Pisin 1 0 (0.00%) Oromo 1 0 (0.00%)
Torwali 1 0 (0.00%) Swanisch 1 0 (0.00%)
Tschechisch 74 0 (0.00%) Gurage 1 0 (0.00%)
Tschetschenisch 3 0 (0.00%) Inupiaq 1 0 (0.00%)
Tschuktschisch 3 0 (0.00%) Khowar 1 0 (0.00%)
Tschuwaschisch 2 0 (0.00%) Torwali 1 0 (0.00%)
Turkmenisch 1 0 (0.00%) Assamesisch 1 0 (0.00%)
Tuvaluisch 2 0 (0.00%) Gujarati 1 0 (0.00%)
Tuwinisch 2 0 (0.00%) Pandschabi 1 0 (0.00%)
Twi 1 0 (0.00%) Laotisch 1 0 (0.00%)
Türkisch 27 80 (1.24%) Catawba 1 0 (0.00%)
Udmurtisch 2 0 (0.00%) Mittelgriechisch 1 0 (0.00%)
Ugaritisch 1 0 (0.00%) Guerrero-Nahuatl 1 4 (0.00%)
Uigurisch 2 0 (0.00%) Klamath 1 0 (0.00%)
Ukrainisch 48 0 (0.00%) Amharisch 1 0 (0.00%)
Umbrisch 6 0 (0.00%) Durango-Nahuatl 1 2 (0.00%)
Ungarisch 22 44 (4.03%) Piemontesisch 1 0 (0.00%)
Urartäisch 1 0 (0.00%) Alttschechisch 1 0 (0.00%)
Urdu 17 124 (2.18%) Kotava 1 2 (100.00%)
Urum 2 0 (0.00%) Láadan 1 0 (0.00%)
Usbekisch 17 0 (0.00%) Brahui 1 0 (0.00%)
Venezianisch 5 0 (0.00%) Morisien 1 0 (0.00%)
Vestinisch 4 0 (0.00%) Tahitianisch 1 0 (0.00%)
Vietnamesisch 10 0 (0.00%) Kirchenslawisch 1 0 (0.00%)
Volapük 3 0 (0.00%) Osmanisches Türkisch 1 2 (0.00%)
Volskisch 6 0 (0.00%) Kantonesisch 1 0 (0.00%)
Walisisch 12 0 (0.00%) Ugaritisch 1 0 (0.00%)
Weißrussisch 28 0 (0.00%) Mandschurisch 1 0 (0.00%)
West-Pandschabi 5 30 (3.33%) Tibetisch 1 0 (0.00%)
Westflämisch 2 0 (0.00%) Konkani 1 0 (0.00%)
Westfriesisch 9 26 (28.83%) Maledivisch 1 0 (0.00%)
Zentral-Alaska-Yupik 1 0 (0.00%) Baktrisch 1 0 (0.00%)
Zentral-Nahuatl 10 50 (30.56%) Tigrinya 1 0 (0.00%)
Zentrales Puebla-Nahuatl 2 4 (100.00%) Khmer 1 0 (0.00%)
isiZulu 1 0 (0.00%) Singhalesisch 1 0 (0.00%)

This page is a part of the kaikki.org machine-readable dictionary. This dictionary is based on structured data extracted on 2026-02-20 from the dewiktionary dump dated 2026-02-02 using wiktextract (f492ef9 and 59dc20b). The data shown on this site has been post-processed and various details (e.g., extra categories) removed, some information disambiguated, and additional data merged from other sources. See the raw data download page for the unprocessed wiktextract data.

If you use this data in academic research, please cite Tatu Ylonen: Wiktextract: Wiktionary as Machine-Readable Structured Data, Proceedings of the 13th Conference on Language Resources and Evaluation (LREC), pp. 1317-1325, Marseille, 20-25 June 2022. Linking to the relevant page(s) under https://kaikki.org would also be greatly appreciated.