Wiktionary data extraction errors and warnings

Inflection check

List of different kinds of inflection tables. When wiktextract parses word heads and tables, it assigns the forms it encounters with tags that describe grammatical or contextual information. The tags and forms that are found in head sections and tables are kept separate from other head section and table tags, and later they are merged with other heads and tables into table types that all contain the same number of word forms with the same tags for those forms.

The information presented here is mostly for debugging, but it can also be used to find interesting word paradigms and to hunt down mistakes, typoes and badly formated Wiktionary entries. A table type that has only a few unique instances is quite likely to contain some kind of minor error in the original data.

Language ⏶ Table forms Errors (% affected words) Language Table forms ⏷ Errors (% affected words)
Abasinisch 2 0 (0.00%) Deutsch 31818 6 (0.00%)
Abchasisch 3 0 (0.00%) Altgriechisch 167 862 (6.65%)
Acehnesisch 2 0 (0.00%) Latein 165 638 (2.75%)
Adygeisch 3 0 (0.00%) Polnisch 141 8 (0.03%)
Afrikaans 16 82 (10.13%) Englisch 114 0 (0.00%)
Akkadisch 14 152 (3.16%) Schwedisch 104 0 (0.00%)
Albanisch 43 280 (3.61%) Französisch 79 0 (0.00%)
Altaisch 3 0 (0.00%) Italienisch 77 50 (0.02%)
Altenglisch 10 12 (28.57%) Tschechisch 74 0 (0.00%)
Altfranzösisch 2 0 (0.00%) Niederländisch 73 44 (7.76%)
Altgriechisch 167 862 (6.65%) Russisch 61 0 (0.00%)
Althochdeutsch 18 0 (0.00%) Arabisch 53 222 (7.87%)
Altirisch 4 0 (0.00%) Armenisch 51 134 (2.51%)
Altkirchenslawisch 12 80 (21.21%) Niedersorbisch 49 0 (0.00%)
Altnordisch 6 0 (0.00%) Ukrainisch 48 0 (0.00%)
Altsächsisch 1 0 (0.00%) Neugriechisch 45 128 (13.39%)
Alttschechisch 1 0 (0.00%) Albanisch 43 280 (3.61%)
Amharisch 1 0 (0.00%) Serbisch 40 62 (0.36%)
Arabisch 53 222 (7.87%) Spanisch 38 0 (0.00%)
Aragonesisch 1 0 (0.00%) Niederdeutsch 33 108 (6.76%)
Armenisch 51 134 (2.51%) Isländisch 33 56 (9.32%)
Aserbaidschanisch 14 8 (2.84%) Slowakisch 32 0 (0.00%)
Assamesisch 1 0 (0.00%) Prußisch 31 332 (13.49%)
Asturisch 1 0 (0.00%) Portugiesisch 30 36 (5.08%)
Awarisch 2 0 (0.00%) Obersorbisch 29 0 (0.00%)
Bairisch 1 0 (0.00%) Ido 28 32 (40.99%)
Baktrisch 1 0 (0.00%) Weißrussisch 28 0 (0.00%)
Balinesisch 1 0 (0.00%) Gotisch 28 24 (20.97%)
Baschkirisch 3 0 (0.00%) Bulgarisch 27 130 (3.67%)
Baskisch 13 22 (4.97%) Türkisch 27 80 (1.24%)
Belutschi 4 4 (0.00%) Hebräisch 27 16 (1.85%)
Bengalisch 2 0 (0.00%) Persisch 26 16 (0.72%)
Birmanisch 3 0 (0.00%) Dänisch 25 2 (10.85%)
Bosnisch 21 0 (0.00%) Rumänisch 25 122 (2.54%)
Brahui 1 0 (0.00%) Finnisch 25 82 (4.55%)
Bretonisch 5 0 (0.00%) Mazedonisch 25 0 (0.00%)
Bulgarisch 27 130 (3.67%) Chinesisch 23 0 (0.00%)
Burjatisch 3 0 (0.00%) Slowenisch 23 0 (0.00%)
Catawba 1 0 (0.00%) Kroatisch 23 0 (0.00%)
Chakassisch 1 0 (0.00%) Norwegisch 22 6 (0.26%)
Chantisch 2 0 (0.00%) Ungarisch 22 44 (4.03%)
Chinesisch 23 0 (0.00%) Bosnisch 21 0 (0.00%)
Deutsch 31818 6 (0.00%) Lettisch 20 32 (9.93%)
Dunganisch 2 0 (0.00%) Färöisch 20 0 (0.00%)
Durango-Nahuatl 1 2 (0.00%) Jiddisch 20 0 (0.00%)
Dänisch 25 2 (10.85%) Esperanto 19 0 (0.00%)
Englisch 114 0 (0.00%) Althochdeutsch 18 0 (0.00%)
Esperanto 19 0 (0.00%) Irisch 18 2 (2.99%)
Estnisch 6 22 (16.83%) Katalanisch 17 34 (16.28%)
Faliskisch 6 0 (0.00%) Usbekisch 17 0 (0.00%)
Finnisch 25 82 (4.55%) Litauisch 17 0 (0.00%)
Französisch 79 0 (0.00%) Urdu 17 124 (2.18%)
Friaulisch 5 0 (0.00%) Afrikaans 16 82 (10.13%)
Frühneuhochdeutsch 11 0 (0.00%) Okzitanisch 15 14 (0.18%)
Fulfulde 1 0 (0.00%) Kurdisch 15 30 (0.00%)
Färöisch 20 0 (0.00%) Levantinisches Arabisch 14 18 (0.41%)
Galicisch 4 0 (0.00%) Aserbaidschanisch 14 8 (2.84%)
Georgisch 11 0 (0.00%) Südpikenisch 14 0 (0.00%)
Gotisch 28 24 (20.97%) Akkadisch 14 152 (3.16%)
Guerrero-Nahuatl 1 4 (0.00%) Baskisch 13 22 (4.97%)
Gujarati 1 0 (0.00%) Walisisch 12 0 (0.00%)
Gurage 1 0 (0.00%) Koreanisch 12 0 (0.00%)
Haitianisch 1 0 (0.00%) Klassisches Nahuatl 12 76 (47.11%)
Hausa 5 12 (16.85%) Altkirchenslawisch 12 80 (21.21%)
Hawaiianisch 3 0 (0.00%) Maltesisch 11 0 (0.00%)
Hebräisch 27 16 (1.85%) Georgisch 11 0 (0.00%)
Hethitisch 8 0 (0.00%) Frühneuhochdeutsch 11 0 (0.00%)
Hindi 6 0 (0.00%) Sumerisch 11 80 (25.00%)
Huastekisches Ost-Nahuatl 1 2 (0.00%) Mittelhochdeutsch 10 0 (0.00%)
Huastekisches West-Nahuatl 1 4 (0.00%) Vietnamesisch 10 0 (0.00%)
Huastekisches Zentral-Nahuatl 6 32 (38.89%) Zentral-Nahuatl 10 50 (30.56%)
Hurritisch 1 0 (0.00%) Altenglisch 10 12 (28.57%)
Ido 28 32 (40.99%) International 9 0 (0.00%)
Indonesisch 6 0 (0.00%) Westfriesisch 9 26 (28.83%)
Interlingua 6 0 (0.00%) Thai 9 0 (0.00%)
International 9 0 (0.00%) Marsisch 8 0 (0.00%)
Inuktitut 4 0 (0.00%) Hethitisch 8 0 (0.00%)
Inupiaq 1 0 (0.00%) Interlingua 6 0 (0.00%)
Irisch 18 2 (2.99%) Japanisch 6 0 (0.00%)
Isländisch 33 56 (9.32%) Estnisch 6 22 (16.83%)
Italienisch 77 50 (0.02%) Kaschubisch 6 0 (0.00%)
Jakutisch 2 0 (0.00%) Krimtatarisch 6 0 (0.00%)
Jamaika-Kreolisch 5 4 (28.57%) Altnordisch 6 0 (0.00%)
Japanisch 6 0 (0.00%) Indonesisch 6 0 (0.00%)
Jiddisch 20 0 (0.00%) Paschtu 6 60 (0.78%)
Kabardinisch 3 0 (0.00%) Huastekisches Zentral-Nahuatl 6 32 (38.89%)
Kannada 1 0 (0.00%) Volskisch 6 0 (0.00%)
Kantonesisch 1 0 (0.00%) Hindi 6 0 (0.00%)
Karatschai-Balkarisch 3 0 (0.00%) Serbokroatisch 6 0 (0.00%)
Kasachisch 4 0 (0.00%) Sindhi 6 6 (0.00%)
Kaschubisch 6 0 (0.00%) Faliskisch 6 0 (0.00%)
Katalanisch 17 34 (16.28%) Umbrisch 6 0 (0.00%)
Khmer 1 0 (0.00%) Koptisch 6 0 (0.00%)
Khowar 1 0 (0.00%) Luxemburgisch 5 16 (0.00%)
Kikuyu 2 0 (0.00%) Friaulisch 5 0 (0.00%)
Kirchenslawisch 1 0 (0.00%) Shona 5 0 (0.00%)
Kirgisisch 4 0 (0.00%) Hausa 5 12 (16.85%)
Klamath 1 0 (0.00%) Bretonisch 5 0 (0.00%)
Klassisches Nahuatl 12 76 (47.11%) Tagalog 5 6 (62.50%)
Komi 3 0 (0.00%) Mongolisch 5 2 (0.00%)
Konkani 1 0 (0.00%) Tadschikisch 5 0 (0.00%)
Koptisch 6 0 (0.00%) Tetelcingo-Nahuatl 5 10 (77.78%)
Koreanisch 12 0 (0.00%) Venezianisch 5 0 (0.00%)
Kornisch 1 0 (0.00%) Oskisch 5 0 (0.00%)
Korsisch 1 0 (0.00%) West-Pandschabi 5 30 (3.33%)
Kotava 1 2 (100.00%) Jamaika-Kreolisch 5 4 (28.57%)
Krimtatarisch 6 0 (0.00%) Galicisch 4 0 (0.00%)
Kroatisch 23 0 (0.00%) Altirisch 4 0 (0.00%)
Kumükisch 1 0 (0.00%) Nauruisch 4 0 (0.00%)
Kurdisch 15 30 (0.00%) Sardisch 4 0 (0.00%)
Ladinisch 2 0 (0.00%) Scots 4 20 (48.84%)
Laotisch 1 0 (0.00%) Suaheli 4 34 (77.27%)
Latein 165 638 (2.75%) Tatarisch 4 0 (0.00%)
Lettgallisch 3 0 (0.00%) Kasachisch 4 0 (0.00%)
Lettisch 20 32 (9.93%) Kirgisisch 4 0 (0.00%)
Levantinisches Arabisch 14 18 (0.41%) Inuktitut 4 0 (0.00%)
Litauisch 17 0 (0.00%) Nepalesisch 4 0 (0.00%)
Luwisch 2 2 (50.00%) Belutschi 4 4 (0.00%)
Luxemburgisch 5 16 (0.00%) Vestinisch 4 0 (0.00%)
Láadan 1 0 (0.00%) Lettgallisch 3 0 (0.00%)
Malaiisch 2 0 (0.00%) Hawaiianisch 3 0 (0.00%)
Malayalam 1 0 (0.00%) Maori 3 0 (0.00%)
Maledivisch 1 0 (0.00%) Volapük 3 0 (0.00%)
Maltesisch 11 0 (0.00%) Baschkirisch 3 0 (0.00%)
Mandschurisch 1 0 (0.00%) Abchasisch 3 0 (0.00%)
Manx 1 0 (0.00%) Adygeisch 3 0 (0.00%)
Maori 3 0 (0.00%) Altaisch 3 0 (0.00%)
Marathi 3 0 (0.00%) Burjatisch 3 0 (0.00%)
Mari 2 0 (0.00%) Kabardinisch 3 0 (0.00%)
Marsisch 8 0 (0.00%) Karatschai-Balkarisch 3 0 (0.00%)
Mazedonisch 25 0 (0.00%) Komi 3 0 (0.00%)
Mittelenglisch 2 0 (0.00%) Ossetisch 3 0 (0.00%)
Mittelgriechisch 1 0 (0.00%) Tschetschenisch 3 0 (0.00%)
Mittelhochdeutsch 10 0 (0.00%) Tschuktschisch 3 0 (0.00%)
Mittelniederdeutsch 2 4 (0.00%) Temascaltepec-Nahuatl 3 16 (25.00%)
Mokscha 2 0 (0.00%) Rätoromanisch 3 0 (0.00%)
Mongolisch 5 2 (0.00%) Marathi 3 0 (0.00%)
Morisien 1 0 (0.00%) Sanskrit 3 0 (0.00%)
Nahuatl 1 0 (0.00%) Orizaba-Nahuatl 3 6 (20.00%)
Nauruisch 4 0 (0.00%) Birmanisch 3 0 (0.00%)
Nepalesisch 4 0 (0.00%) Malaiisch 2 0 (0.00%)
Neugriechisch 45 128 (13.39%) Mittelenglisch 2 0 (0.00%)
Niederdeutsch 33 108 (6.76%) Kikuyu 2 0 (0.00%)
Niederländisch 73 44 (7.76%) Tetum 2 0 (0.00%)
Niedersorbisch 49 0 (0.00%) Mittelniederdeutsch 2 4 (0.00%)
Niueanisch 1 0 (0.00%) Schottisch-Gälisch 2 0 (0.00%)
Nordfriesisch 1 0 (0.00%) Tuvaluisch 2 0 (0.00%)
Norwegisch 22 6 (0.26%) Westflämisch 2 0 (0.00%)
Novial 1 0 (0.00%) Altfranzösisch 2 0 (0.00%)
Obersorbisch 29 0 (0.00%) Abasinisch 2 0 (0.00%)
Okzitanisch 15 14 (0.18%) Awarisch 2 0 (0.00%)
Orizaba-Nahuatl 3 6 (20.00%) Chantisch 2 0 (0.00%)
Oromo 1 0 (0.00%) Dunganisch 2 0 (0.00%)
Oskisch 5 0 (0.00%) Jakutisch 2 0 (0.00%)
Osmanisches Türkisch 1 2 (0.00%) Mari 2 0 (0.00%)
Ossetisch 3 0 (0.00%) Mokscha 2 0 (0.00%)
Pali 1 0 (0.00%) Tschuwaschisch 2 0 (0.00%)
Pandschabi 1 0 (0.00%) Tuwinisch 2 0 (0.00%)
Papiamentu 1 0 (0.00%) Udmurtisch 2 0 (0.00%)
Paschtu 6 60 (0.78%) Urum 2 0 (0.00%)
Pennsylvaniadeutsch 1 0 (0.00%) Sizilianisch 2 0 (0.00%)
Persisch 26 16 (0.72%) Zentrales Puebla-Nahuatl 2 4 (100.00%)
Piemontesisch 1 0 (0.00%) Acehnesisch 2 0 (0.00%)
Polabisch 2 0 (0.00%) Luwisch 2 2 (50.00%)
Polnisch 141 8 (0.03%) Telugu 2 0 (0.00%)
Portugiesisch 30 36 (5.08%) Uigurisch 2 0 (0.00%)
Prußisch 31 332 (13.49%) Sogdisch 2 0 (0.00%)
Rumänisch 25 122 (2.54%) Bengalisch 2 0 (0.00%)
Russisch 61 0 (0.00%) Polabisch 2 0 (0.00%)
Rätoromanisch 3 0 (0.00%) Ladinisch 2 0 (0.00%)
Sami 1 0 (0.00%) Asturisch 1 0 (0.00%)
Samoanisch 1 0 (0.00%) Haitianisch 1 0 (0.00%)
Sanskrit 3 0 (0.00%) Pennsylvaniadeutsch 1 0 (0.00%)
Sardisch 4 0 (0.00%) Nordfriesisch 1 0 (0.00%)
Schottisch-Gälisch 2 0 (0.00%) Balinesisch 1 0 (0.00%)
Schwedisch 104 0 (0.00%) Kornisch 1 0 (0.00%)
Scots 4 20 (48.84%) Huastekisches Ost-Nahuatl 1 2 (0.00%)
Serbisch 40 62 (0.36%) Tok Pisin 1 0 (0.00%)
Serbokroatisch 6 0 (0.00%) Niueanisch 1 0 (0.00%)
Sesotho 1 0 (0.00%) Papiamentu 1 0 (0.00%)
Shona 5 0 (0.00%) Malayalam 1 0 (0.00%)
Sindarin 1 0 (0.00%) Kannada 1 0 (0.00%)
Sindhi 6 6 (0.00%) Urartäisch 1 0 (0.00%)
Singhalesisch 1 0 (0.00%) Chakassisch 1 0 (0.00%)
Sizilianisch 2 0 (0.00%) Turkmenisch 1 0 (0.00%)
Slowakisch 32 0 (0.00%) Kumükisch 1 0 (0.00%)
Slowenisch 23 0 (0.00%) Nahuatl 1 0 (0.00%)
Sogdisch 2 0 (0.00%) Huastekisches West-Nahuatl 1 4 (0.00%)
Somalisch 1 0 (0.00%) Aragonesisch 1 0 (0.00%)
Spanisch 38 0 (0.00%) Korsisch 1 0 (0.00%)
Suaheli 4 34 (77.27%) Tamil 1 0 (0.00%)
Sumerisch 11 80 (25.00%) Sesotho 1 0 (0.00%)
Swanisch 1 0 (0.00%) Manx 1 0 (0.00%)
Südpikenisch 14 0 (0.00%) Samoanisch 1 0 (0.00%)
Tadschikisch 5 0 (0.00%) Somalisch 1 0 (0.00%)
Tagalog 5 6 (62.50%) isiZulu 1 0 (0.00%)
Tahitianisch 1 0 (0.00%) Sindarin 1 0 (0.00%)
Tamil 1 0 (0.00%) Hurritisch 1 0 (0.00%)
Tatarisch 4 0 (0.00%) Fulfulde 1 0 (0.00%)
Telugu 2 0 (0.00%) Bairisch 1 0 (0.00%)
Temascaltepec-Nahuatl 3 16 (25.00%) Pali 1 0 (0.00%)
Tetelcingo-Nahuatl 5 10 (77.78%) Sami 1 0 (0.00%)
Tetum 2 0 (0.00%) Altsächsisch 1 0 (0.00%)
Thai 9 0 (0.00%) Twi 1 0 (0.00%)
Tibetisch 1 0 (0.00%) Novial 1 0 (0.00%)
Tigrinya 1 0 (0.00%) Zentral-Alaska-Yupik 1 0 (0.00%)
Tok Pisin 1 0 (0.00%) Oromo 1 0 (0.00%)
Torwali 1 0 (0.00%) Swanisch 1 0 (0.00%)
Tschechisch 74 0 (0.00%) Gurage 1 0 (0.00%)
Tschetschenisch 3 0 (0.00%) Inupiaq 1 0 (0.00%)
Tschuktschisch 3 0 (0.00%) Khowar 1 0 (0.00%)
Tschuwaschisch 2 0 (0.00%) Torwali 1 0 (0.00%)
Turkmenisch 1 0 (0.00%) Assamesisch 1 0 (0.00%)
Tuvaluisch 2 0 (0.00%) Gujarati 1 0 (0.00%)
Tuwinisch 2 0 (0.00%) Pandschabi 1 0 (0.00%)
Twi 1 0 (0.00%) Laotisch 1 0 (0.00%)
Türkisch 27 80 (1.24%) Catawba 1 0 (0.00%)
Udmurtisch 2 0 (0.00%) Mittelgriechisch 1 0 (0.00%)
Ugaritisch 1 0 (0.00%) Guerrero-Nahuatl 1 4 (0.00%)
Uigurisch 2 0 (0.00%) Klamath 1 0 (0.00%)
Ukrainisch 48 0 (0.00%) Amharisch 1 0 (0.00%)
Umbrisch 6 0 (0.00%) Durango-Nahuatl 1 2 (0.00%)
Ungarisch 22 44 (4.03%) Piemontesisch 1 0 (0.00%)
Urartäisch 1 0 (0.00%) Alttschechisch 1 0 (0.00%)
Urdu 17 124 (2.18%) Kotava 1 2 (100.00%)
Urum 2 0 (0.00%) Láadan 1 0 (0.00%)
Usbekisch 17 0 (0.00%) Brahui 1 0 (0.00%)
Venezianisch 5 0 (0.00%) Morisien 1 0 (0.00%)
Vestinisch 4 0 (0.00%) Tahitianisch 1 0 (0.00%)
Vietnamesisch 10 0 (0.00%) Kirchenslawisch 1 0 (0.00%)
Volapük 3 0 (0.00%) Osmanisches Türkisch 1 2 (0.00%)
Volskisch 6 0 (0.00%) Kantonesisch 1 0 (0.00%)
Walisisch 12 0 (0.00%) Ugaritisch 1 0 (0.00%)
Weißrussisch 28 0 (0.00%) Mandschurisch 1 0 (0.00%)
West-Pandschabi 5 30 (3.33%) Tibetisch 1 0 (0.00%)
Westflämisch 2 0 (0.00%) Konkani 1 0 (0.00%)
Westfriesisch 9 26 (28.83%) Maledivisch 1 0 (0.00%)
Zentral-Alaska-Yupik 1 0 (0.00%) Baktrisch 1 0 (0.00%)
Zentral-Nahuatl 10 50 (30.56%) Tigrinya 1 0 (0.00%)
Zentrales Puebla-Nahuatl 2 4 (100.00%) Khmer 1 0 (0.00%)
isiZulu 1 0 (0.00%) Singhalesisch 1 0 (0.00%)

This page is a part of the kaikki.org machine-readable dictionary. This dictionary is based on structured data extracted on 2026-01-08 from the dewiktionary dump dated 2026-01-02 using wiktextract (96027d6 and 9905b1f). The data shown on this site has been post-processed and various details (e.g., extra categories) removed, some information disambiguated, and additional data merged from other sources. See the raw data download page for the unprocessed wiktextract data.

If you use this data in academic research, please cite Tatu Ylonen: Wiktextract: Wiktionary as Machine-Readable Structured Data, Proceedings of the 13th Conference on Language Resources and Evaluation (LREC), pp. 1317-1325, Marseille, 20-25 June 2022. Linking to the relevant page(s) under https://kaikki.org would also be greatly appreciated.