Wiktionary data extraction errors and warnings

Inflection check

List of different kinds of inflection tables. When wiktextract parses word heads and tables, it assigns the forms it encounters with tags that describe grammatical or contextual information. The tags and forms that are found in head sections and tables are kept separate from other head section and table tags, and later they are merged with other heads and tables into table types that all contain the same number of word forms with the same tags for those forms.

The information presented here is mostly for debugging, but it can also be used to find interesting word paradigms and to hunt down mistakes, typoes and badly formated Wiktionary entries. A table type that has only a few unique instances is quite likely to contain some kind of minor error in the original data.

Language ⏶ Table forms Errors (% affected words) Language Table forms ⏷ Errors (% affected words)
Afrikáans 2 0 (0.00%) Inglés 39 2 (0.58%)
Ainu 1 2 (100.00%) Español 35 50 (0.03%)
Albanés 3 0 (0.00%) Alemán 32 18 (0.77%)
Alemán 32 18 (0.77%) Sueco 21 8 (1.25%)
Alemán antiguo 1 0 (0.00%) Italiano 9 24 (0.99%)
Allentiac 1 0 (0.00%) Neerlandés 9 18 (1.62%)
Amárico 1 0 (0.00%) Portugués 9 2 (0.31%)
Aragonés 3 0 (0.00%) Francés 8 2 (0.27%)
Arameo 1 0 (0.00%) Bretón 8 18 (30.15%)
Armenio 1 0 (0.00%) Asturiano 7 2 (1.34%)
Arrumano 1 0 (0.00%) Manés 7 32 (7.32%)
Asturiano 7 2 (1.34%) Griego 7 2 (1.41%)
Avéstico 1 0 (0.00%) Catalán 6 2 (0.24%)
Azerí 2 0 (0.00%) Galés 6 22 (10.84%)
Bretón 8 18 (30.15%) Noruego bokmål 6 12 (1.64%)
Búlgaro 1 0 (0.00%) Polaco 6 0 (0.00%)
Caló 2 0 (0.00%) Irlandés 6 164 (18.64%)
Castellano antiguo 5 0 (0.00%) Ruso 6 2 (23.08%)
Catalán 6 2 (0.24%) Gallego 5 0 (0.00%)
Catalán antiguo 1 0 (0.00%) Occitano 5 0 (0.00%)
Chaná 1 2 (100.00%) Castellano antiguo 5 0 (0.00%)
Charrúa 1 2 (100.00%) Francés antiguo 5 2 (1.92%)
Checo 2 0 (0.00%) Francés medio 5 2 (5.03%)
Cheroqui 1 0 (0.00%) Gaélico escocés 5 32 (3.17%)
Chewa 1 0 (0.00%) Danés 5 6 (11.76%)
Coreano 1 2 (100.00%) Irlandés antiguo 5 0 (0.00%)
Corso 2 0 (0.00%) Rumano 4 0 (0.00%)
Cupeño 1 0 (0.00%) Náhuatl central 4 0 (0.00%)
Córnico 1 0 (0.00%) Hebreo 4 2 (13.36%)
Córnico (Kernewek Kemmyn) 1 0 (0.00%) Interlingua 3 4 (4.00%)
Córnico (Standard Written Form) 1 0 (0.00%) Islandés 3 0 (0.00%)
Córnico (Standard Written Form, Kernewek Kemmyn) 1 0 (0.00%) Aragonés 3 0 (0.00%)
Danés 5 6 (11.76%) Latín 3 6 (0.00%)
Dálmata 2 2 (6.67%) Extremeño 3 0 (0.00%)
Emiliano-romañol 2 2 (66.67%) Judeoespañol 3 2 (0.80%)
Escocés 2 8 (0.00%) Valón 3 2 (5.56%)
Eslovaco 1 0 (0.00%) Véneto 3 0 (0.00%)
Español 35 50 (0.03%) Esperanto 3 0 (0.00%)
Español (Ortografía de Bello) 1 0 (0.00%) Noruego nynorsk 3 2 (0.00%)
Esperanto 3 0 (0.00%) Normando 3 2 (2.44%)
Extremeño 3 0 (0.00%) Guaraní 3 4 (20.00%)
Feroés 1 0 (0.00%) Albanés 3 0 (0.00%)
Finés 1 0 (0.00%) Siciliano 3 2 (8.33%)
Francés 8 2 (0.27%) Klingon 3 2 (1.12%)
Francés antiguo 5 2 (1.92%) Árabe 3 2 (7.14%)
Francés medio 5 2 (5.03%) Maya yucateco 2 0 (0.00%)
Frisón 1 0 (0.00%) Náhuatl de la Huasteca central 2 0 (0.00%)
Friulano 2 2 (4.76%) Afrikáans 2 0 (0.00%)
Galaicoportugués 2 0 (0.00%) Azerí 2 0 (0.00%)
Gallego 5 0 (0.00%) Checo 2 0 (0.00%)
Galés 6 22 (10.84%) Escocés 2 8 (0.00%)
Galó 1 0 (0.00%) Japonés (Romaji) 2 20 (0.00%)
Gaélico escocés 5 32 (3.17%) Náhuatl clásico 2 0 (0.00%)
Georgiano 1 0 (0.00%) Náhuatl de la Huasteca oriental 2 0 (0.00%)
Griego 7 2 (1.41%) Inglés medio 2 0 (0.00%)
Griego antiguo 2 0 (0.00%) Náhuatl de la Huasteca occidental 2 0 (0.00%)
Groenlandés 1 0 (0.00%) Friulano 2 2 (4.76%)
Guaraní 3 4 (20.00%) Galaicoportugués 2 0 (0.00%)
Gótico 2 0 (0.00%) Italiano antiguo 2 0 (0.00%)
Gǀwi 1 0 (0.00%) Romanche 2 0 (0.00%)
Haida 1 0 (0.00%) Sardo 2 0 (0.00%)
Hausa 1 0 (0.00%) Maltés 2 0 (0.00%)
Hawaiano 1 0 (0.00%) Náhuatl de Guerrero 2 0 (0.00%)
Hebreo 4 2 (13.36%) Corso 2 0 (0.00%)
Hebreo antiguo 1 0 (0.00%) Persa 2 0 (0.00%)
Herero 1 0 (0.00%) Náhuatl clásico (Grafía normalizada) 2 0 (0.00%)
Hindi 1 0 (0.00%) Quechua cuzqueño 2 24 (66.67%)
Húngaro 1 0 (0.00%) Quechua de Huaylas Ancash 2 0 (0.00%)
Ido 1 0 (0.00%) Dálmata 2 2 (6.67%)
Inglés 39 2 (0.58%) Japonés (Hiragana) 2 20 (0.00%)
Inglés antiguo 1 0 (0.00%) Japonés (Mixta) 2 20 (0.00%)
Inglés medio 2 0 (0.00%) Griego antiguo 2 0 (0.00%)
Interlingua 3 4 (4.00%) Gótico 2 0 (0.00%)
Inuktitut 1 0 (0.00%) Caló 2 0 (0.00%)
Inuktitut (Alfabeto latino) 1 0 (0.00%) Emiliano-romañol 2 2 (66.67%)
Irlandés 6 164 (18.64%) Tamil 2 0 (0.00%)
Irlandés antiguo 5 0 (0.00%) Telugú 2 2 (0.00%)
Islandés 3 0 (0.00%) Eslovaco 1 0 (0.00%)
Istriorrumano 1 0 (0.00%) Vasco 1 0 (0.00%)
Italiano 9 24 (0.99%) Finés 1 0 (0.00%)
Italiano antiguo 2 0 (0.00%) Húngaro 1 0 (0.00%)
Japonés 1 20 (0.00%) Lituano 1 0 (0.00%)
Japonés (Hiragana) 2 20 (0.00%) Luxemburgués 1 0 (0.00%)
Japonés (Mixta) 2 20 (0.00%) Malgache 1 2 (0.00%)
Japonés (Romaji) 2 20 (0.00%) Serbocroata (Latino) 1 0 (0.00%)
Judeoespañol 3 2 (0.80%) Tagalo 1 0 (0.00%)
Judeoespañol (Ortografía turca) 1 0 (0.00%) Turco 1 0 (0.00%)
Kikapú 1 0 (0.00%) Navarro-aragonés 1 0 (0.00%)
Klingon 3 2 (1.12%) Volapuk 1 0 (0.00%)
Ladino 1 0 (0.00%) Frisón 1 0 (0.00%)
Latín 3 6 (0.00%) Translingüístico 1 2 (100.00%)
Leonés antiguo 1 0 (0.00%) Provenzal antiguo 1 0 (0.00%)
Letón 1 14 (0.00%) Córnico 1 0 (0.00%)
Ligur 1 0 (0.00%) Catalán antiguo 1 0 (0.00%)
Limburgués 1 0 (0.00%) Lingua franca nova 1 0 (0.00%)
Lingua franca nova 1 0 (0.00%) Napolitano 1 0 (0.00%)
Lituano 1 0 (0.00%) Hawaiano 1 0 (0.00%)
Luxemburgués 1 0 (0.00%) Letón 1 14 (0.00%)
Macedonio 1 0 (0.00%) Mirandés 1 0 (0.00%)
Malgache 1 2 (0.00%) Romaní (macrolengua) 1 0 (0.00%)
Maltés 2 0 (0.00%) Serbocroata 1 0 (0.00%)
Manés 7 32 (7.32%) Ladino 1 0 (0.00%)
Maorí 1 0 (0.00%) Groenlandés 1 0 (0.00%)
Mapuche (Grafemario Azümchefe, Alfabeto Unificado) 1 0 (0.00%) Náhuatl de Orizaba 1 0 (0.00%)
Mapuche (Ortografía de Luis de Valdivia) 1 0 (0.00%) Suajili 1 0 (0.00%)
Maya yucateco 2 0 (0.00%) Prusiano antiguo 1 0 (0.00%)
Mirandés 1 0 (0.00%) Ido 1 0 (0.00%)
Mixteco del sur de Puebla 1 0 (0.00%) Alemán antiguo 1 0 (0.00%)
Napolitano 1 0 (0.00%) Leonés antiguo 1 0 (0.00%)
Navajo 1 0 (0.00%) Ligur 1 0 (0.00%)
Navarro-aragonés 1 0 (0.00%) Náhuatl del norte de Puebla 1 0 (0.00%)
Neerlandés 9 18 (1.62%) Náhuatl de Durango 1 0 (0.00%)
Neoarameo asirio 1 0 (0.00%) Navajo 1 0 (0.00%)
Normando 3 2 (2.44%) Cheroqui 1 0 (0.00%)
Noruego 1 0 (0.00%) Ucraniano 1 0 (0.00%)
Noruego bokmål 6 12 (1.64%) Inuktitut 1 0 (0.00%)
Noruego nynorsk 3 2 (0.00%) Haida 1 0 (0.00%)
Náhuatl central 4 0 (0.00%) Xhosa 1 0 (0.00%)
Náhuatl clásico 2 0 (0.00%) Hindi 1 0 (0.00%)
Náhuatl clásico (Grafía normalizada) 2 0 (0.00%) Náhuatl de Tetelcingo 1 0 (0.00%)
Náhuatl de Durango 1 0 (0.00%) Galó 1 0 (0.00%)
Náhuatl de Guerrero 2 0 (0.00%) Japonés 1 20 (0.00%)
Náhuatl de Orizaba 1 0 (0.00%) Coreano 1 2 (100.00%)
Náhuatl de Tetelcingo 1 0 (0.00%) Mixteco del sur de Puebla 1 0 (0.00%)
Náhuatl de la Huasteca central 2 0 (0.00%) Quechua ayacuchano 1 24 (100.00%)
Náhuatl de la Huasteca occidental 2 0 (0.00%) Feroés 1 0 (0.00%)
Náhuatl de la Huasteca oriental 2 0 (0.00%) Hausa 1 0 (0.00%)
Náhuatl del norte de Puebla 1 0 (0.00%) Nórdico antiguo 1 0 (0.00%)
Nórdico antiguo 1 0 (0.00%) Georgiano 1 0 (0.00%)
Occitano 5 0 (0.00%) Rapanuí 1 0 (0.00%)
Persa 2 0 (0.00%) Allentiac 1 0 (0.00%)
Polaco 6 0 (0.00%) Ainu 1 2 (100.00%)
Portugués 9 2 (0.31%) Arrumano 1 0 (0.00%)
Protoindoeuropeo 1 2 (0.00%) Istriorrumano 1 0 (0.00%)
Provenzal antiguo 1 0 (0.00%) Charrúa 1 2 (100.00%)
Prusiano antiguo 1 0 (0.00%) Maorí 1 0 (0.00%)
Quechua ayacuchano 1 24 (100.00%) Inuktitut (Alfabeto latino) 1 0 (0.00%)
Quechua cuzqueño 2 24 (66.67%) Limburgués 1 0 (0.00%)
Quechua de Huaylas Ancash 2 0 (0.00%) Búlgaro 1 0 (0.00%)
Rapanuí 1 0 (0.00%) Córnico (Standard Written Form) 1 0 (0.00%)
Romanche 2 0 (0.00%) Inglés antiguo 1 0 (0.00%)
Romaní (macrolengua) 1 0 (0.00%) Mapuche (Grafemario Azümchefe, Alfabeto Unificado) 1 0 (0.00%)
Rumano 4 0 (0.00%) Español (Ortografía de Bello) 1 0 (0.00%)
Ruso 6 2 (23.08%) Córnico (Kernewek Kemmyn) 1 0 (0.00%)
Sardo 2 0 (0.00%) Chewa 1 0 (0.00%)
Serbocroata 1 0 (0.00%) Hebreo antiguo 1 0 (0.00%)
Serbocroata (Cirílico) 1 0 (0.00%) Arameo 1 0 (0.00%)
Serbocroata (Círilico) 1 0 (0.00%) Ídish 1 0 (0.00%)
Serbocroata (Latino) 1 0 (0.00%) Avéstico 1 0 (0.00%)
Shawi 1 0 (0.00%) Urdu 1 0 (0.00%)
Siciliano 3 2 (8.33%) Amárico 1 0 (0.00%)
Siríaco clásico 1 0 (0.00%) Macedonio 1 0 (0.00%)
Suajili 1 0 (0.00%) Armenio 1 0 (0.00%)
Sueco 21 8 (1.25%) Protoindoeuropeo 1 2 (0.00%)
Tagalo 1 0 (0.00%) Serbocroata (Cirílico) 1 0 (0.00%)
Tamil 2 0 (0.00%) Tehuelche 1 0 (0.00%)
Tehuelche 1 0 (0.00%) Córnico (Standard Written Form, Kernewek Kemmyn) 1 0 (0.00%)
Telugú 2 2 (0.00%) Noruego 1 0 (0.00%)
Translingüístico 1 2 (100.00%) Neoarameo asirio 1 0 (0.00%)
Turco 1 0 (0.00%) Siríaco clásico 1 0 (0.00%)
Ucraniano 1 0 (0.00%) Cupeño 1 0 (0.00%)
Urdu 1 0 (0.00%) Kikapú 1 0 (0.00%)
Valón 3 2 (5.56%) Gǀwi 1 0 (0.00%)
Vasco 1 0 (0.00%) Herero 1 0 (0.00%)
Volapuk 1 0 (0.00%) Chaná 1 2 (100.00%)
Véneto 3 0 (0.00%) Mapuche (Ortografía de Luis de Valdivia) 1 0 (0.00%)
Xhosa 1 0 (0.00%) Judeoespañol (Ortografía turca) 1 0 (0.00%)
unknown 1 0 (0.00%) Shawi 1 0 (0.00%)
Árabe 3 2 (7.14%) unknown 1 0 (0.00%)
Ídish 1 0 (0.00%) Serbocroata (Círilico) 1 0 (0.00%)

This page is a part of the kaikki.org machine-readable dictionary. This dictionary is based on structured data extracted on 2025-12-31 from the eswiktionary dump dated 2025-12-21 using wiktextract (e97c820 and 9905b1f). The data shown on this site has been post-processed and various details (e.g., extra categories) removed, some information disambiguated, and additional data merged from other sources. See the raw data download page for the unprocessed wiktextract data.

If you use this data in academic research, please cite Tatu Ylonen: Wiktextract: Wiktionary as Machine-Readable Structured Data, Proceedings of the 13th Conference on Language Resources and Evaluation (LREC), pp. 1317-1325, Marseille, 20-25 June 2022. Linking to the relevant page(s) under https://kaikki.org would also be greatly appreciated.