Wiktionary data extraction errors and warnings

Inflection check

List of different kinds of inflection tables. When wiktextract parses word heads and tables, it assigns the forms it encounters with tags that describe grammatical or contextual information. The tags and forms that are found in head sections and tables are kept separate from other head section and table tags, and later they are merged with other heads and tables into table types that all contain the same number of word forms with the same tags for those forms.

The information presented here is mostly for debugging, but it can also be used to find interesting word paradigms and to hunt down mistakes, typoes and badly formated Wiktionary entries. A table type that has only a few unique instances is quite likely to contain some kind of minor error in the original data.

Language ⏶ Table forms Errors (% affected words) Language Table forms ⏷ Errors (% affected words)
Afrikáans 2 0 (0.00%) Español 39 1138 (27.58%)
Ainu 1 2 (100.00%) Latín 31 140 (0.09%)
Albanés 3 20 (91.43%) Alemán 21 88 (1.28%)
Alemán 21 88 (1.28%) Sueco 17 230 (99.89%)
Alemán antiguo 1 0 (0.00%) Inglés 11 16 (3.25%)
Allentiac 1 0 (0.00%) Checo 11 120 (0.00%)
Amárico 1 0 (0.00%) Irlandés 11 182 (82.30%)
Aragonés 3 0 (0.00%) Francés 10 4 (3.81%)
Arameo 1 0 (0.00%) Rumano 10 114 (21.74%)
Armenio 1 0 (0.00%) Polaco 9 14 (0.00%)
Arrumano 1 0 (0.00%) Eslovaco 8 62 (6.33%)
Asturiano 6 2 (3.62%) Italiano 8 46 (5.28%)
Avéstico 1 0 (0.00%) Neerlandés 8 18 (27.24%)
Azerí 2 0 (0.00%) Gaélico escocés 8 4 (3.85%)
Bretón 6 20 (30.15%) Vasco 7 226 (26.91%)
Búlgaro 1 0 (0.00%) Asturiano 6 2 (3.62%)
Caló 3 2 (25.00%) Bretón 6 20 (30.15%)
Castellano antiguo 3 0 (0.00%) Noruego bokmål 6 12 (1.64%)
Catalán 5 2 (0.91%) Portugués 6 2 (1.46%)
Catalán antiguo 1 0 (0.00%) Ruso 6 12 (23.08%)
Chaná 1 2 (100.00%) Griego 6 34 (6.84%)
Charrúa 1 2 (100.00%) Griego antiguo 6 68 (44.07%)
Checo 11 120 (0.00%) Catalán 5 2 (0.91%)
Cheroqui 1 0 (0.00%) Manés 5 18 (9.76%)
Chewa 1 0 (0.00%) Francés antiguo 5 2 (1.92%)
Coreano 1 2 (100.00%) Danés 5 30 (96.75%)
Corso 1 0 (0.00%) Irlandés antiguo 5 24 (83.33%)
Cupeño 1 0 (0.00%) Maya yucateco 4 0 (0.00%)
Córnico 1 0 (0.00%) Galés 4 18 (33.33%)
Córnico (Kernewek Kemmyn) 1 0 (0.00%) Francés medio 4 2 (3.95%)
Córnico (Standard Written Form) 1 0 (0.00%) Náhuatl central 4 0 (0.00%)
Córnico (Standard Written Form, Kernewek Kemmyn) 1 0 (0.00%) Esloveno 3 48 (4.55%)
Córnico (Standard Written Form, Kernewek Kemmyn, Unified Cornish Revised) 1 0 (0.00%) Gallego 3 2 (0.75%)
Córnico (Standard Written Form, Unified Cornish Revised) 1 0 (0.00%) Islandés 3 16 (84.62%)
Danés 5 30 (96.75%) Japonés (Romaji) 3 24 (0.00%)
Deg xinag 1 0 (0.00%) Occitano 3 0 (0.00%)
Dálmata 2 2 (6.67%) Castellano antiguo 3 0 (0.00%)
Emiliano-romañol 2 2 (66.67%) Aragonés 3 0 (0.00%)
Escocés 2 8 (0.00%) Judeoespañol 3 2 (0.81%)
Eslovaco 8 62 (6.33%) Valón 3 2 (2.78%)
Esloveno 3 48 (4.55%) Esperanto 3 0 (0.00%)
Español 39 1138 (27.58%) Noruego nynorsk 3 2 (0.00%)
Español (Ortografía de Bello) 1 0 (0.00%) Normando 3 2 (2.47%)
Esperanto 3 0 (0.00%) Albanés 3 20 (91.43%)
Extremeño 2 0 (0.00%) Siciliano 3 2 (8.33%)
Feroés 1 0 (0.00%) Hebreo 3 2 (12.92%)
Finés 1 0 (0.00%) Gótico 3 8 (0.00%)
Francés 10 4 (3.81%) Caló 3 2 (25.00%)
Francés antiguo 5 2 (1.92%) Mapuche (Grafemario Azümchefe, Alfabeto Unificado) 3 4 (95.24%)
Francés medio 4 2 (3.95%) Árabe 3 2 (6.67%)
Frisón 1 0 (0.00%) Náhuatl de la Huasteca central 2 0 (0.00%)
Friulano 2 2 (4.55%) Afrikáans 2 0 (0.00%)
Galaicoportugués 2 0 (0.00%) Azerí 2 0 (0.00%)
Gallego 3 2 (0.75%) Escocés 2 8 (0.00%)
Galés 4 18 (33.33%) Interlingua 2 4 (4.35%)
Galó 1 0 (0.00%) Lituano 2 0 (0.00%)
Gaélico escocés 8 4 (3.85%) Turco 2 2 (8.33%)
Georgiano 1 0 (0.00%) Náhuatl clásico 2 0 (0.00%)
Griego 6 34 (6.84%) Náhuatl de la Huasteca oriental 2 0 (0.00%)
Griego antiguo 6 68 (44.07%) Extremeño 2 0 (0.00%)
Groenlandés 1 0 (0.00%) Inglés medio 2 2 (10.00%)
Guaraní 2 4 (75.00%) Náhuatl de la Huasteca occidental 2 0 (0.00%)
Gótico 3 8 (0.00%) Volapuk 2 10 (96.23%)
Gǀwi 1 0 (0.00%) Friulano 2 2 (4.55%)
Haida 1 4 (100.00%) Mapuche (Grafemario Raguileo, Grafemario Azümchefe, Alfabeto Unificado) 2 4 (100.00%)
Hausa 1 0 (0.00%) Galaicoportugués 2 0 (0.00%)
Hawaiano 1 0 (0.00%) Véneto 2 0 (0.00%)
Hebreo 3 2 (12.92%) Italiano antiguo 2 0 (0.00%)
Hebreo antiguo 1 0 (0.00%) Romanche 2 0 (0.00%)
Herero 1 0 (0.00%) Sardo 2 0 (0.00%)
Hindi 1 0 (0.00%) Mapuche (Alfabeto Unificado) 2 4 (100.00%)
Húngaro 1 0 (0.00%) Maltés 2 0 (0.00%)
Ido 1 0 (0.00%) Guaraní 2 4 (75.00%)
Inglés 11 16 (3.25%) Náhuatl de Guerrero 2 0 (0.00%)
Inglés antiguo 1 0 (0.00%) Klingon 2 2 (1.13%)
Inglés medio 2 2 (10.00%) Persa 2 0 (0.00%)
Interlingua 2 4 (4.35%) Náhuatl clásico (Grafía normalizada) 2 0 (0.00%)
Inuktitut 1 0 (0.00%) Quechua cuzqueño 2 36 (66.67%)
Inuktitut (Alfabeto latino) 1 0 (0.00%) Mapuche (Alfabeto Unificado, Grafemario Azümchefe, Grafemario Raguileo) 2 4 (100.00%)
Irlandés 11 182 (82.30%) Mapuche 2 4 (100.00%)
Irlandés antiguo 5 24 (83.33%) Mapuche (Alfabeto Unificado, Grafemario Azümchefe) 2 4 (100.00%)
Islandés 3 16 (84.62%) Dálmata 2 2 (6.67%)
Istriorrumano 1 0 (0.00%) Japonés (Hiragana) 2 24 (0.00%)
Italiano 8 46 (5.28%) Rapa nui 2 6 (8.00%)
Italiano antiguo 2 0 (0.00%) Japonés (Mixta) 2 24 (0.00%)
Japonés 1 24 (0.00%) Emiliano-romañol 2 2 (66.67%)
Japonés (Hiragana) 2 24 (0.00%) Mapuche (Grafemario Raguileo) 2 4 (100.00%)
Japonés (Mixta) 2 24 (0.00%) Tamil 2 2 (16.67%)
Japonés (Romaji) 3 24 (0.00%) Mapuche (Alfabeto Unificado, Grafemario Azümchefe, Grafemario Azümchefe, Alfabeto Unificado) 2 4 (100.00%)
Judeoespañol 3 2 (0.81%) unknown 2 0 (0.00%)
Kikapú 1 0 (0.00%) Mapuche (Grafemario Raguileo, Grafemario Raguileo) 2 4 (100.00%)
Klingon 2 2 (1.13%) Finés 1 0 (0.00%)
Ladino 1 0 (0.00%) Húngaro 1 0 (0.00%)
Latín 31 140 (0.09%) Luxemburgués 1 0 (0.00%)
Latín (Medieval) 1 2 (0.00%) Malgache 1 2 (0.00%)
Latín medieval 1 2 (0.00%) Serbocroata (Latino) 1 0 (0.00%)
Latín vulgar 1 2 (0.00%) Tagalo 1 0 (0.00%)
Leonés antiguo 1 0 (0.00%) Navarro-aragonés 1 0 (0.00%)
Letón 1 14 (0.00%) Frisón 1 0 (0.00%)
Ligur 1 0 (0.00%) Provenzal antiguo 1 0 (0.00%)
Limburgués 1 0 (0.00%) Córnico 1 0 (0.00%)
Lingua franca nova 1 0 (0.00%) Catalán antiguo 1 0 (0.00%)
Lituano 2 0 (0.00%) Lingua franca nova 1 0 (0.00%)
Luxemburgués 1 0 (0.00%) Napolitano 1 0 (0.00%)
Macedonio 1 0 (0.00%) Hawaiano 1 0 (0.00%)
Malgache 1 2 (0.00%) Letón 1 14 (0.00%)
Maltés 2 0 (0.00%) Mirandés 1 0 (0.00%)
Manés 5 18 (9.76%) Romaní (macrolengua) 1 0 (0.00%)
Maorí 1 0 (0.00%) Serbocroata 1 0 (0.00%)
Mapuche 2 4 (100.00%) Ladino 1 0 (0.00%)
Mapuche (Alfabeto Unificado) 2 4 (100.00%) Groenlandés 1 0 (0.00%)
Mapuche (Alfabeto Unificado, Alfabeto Unificado) 1 2 (100.00%) Náhuatl de Orizaba 1 0 (0.00%)
Mapuche (Alfabeto Unificado, Grafemario Azümchefe) 2 4 (100.00%) Suajili 1 0 (0.00%)
Mapuche (Alfabeto Unificado, Grafemario Azümchefe, Alfabeto Unificado) 1 2 (100.00%) Prusiano antiguo 1 0 (0.00%)
Mapuche (Alfabeto Unificado, Grafemario Azümchefe, Grafemario Azümchefe, Alfabeto Unificado) 2 4 (100.00%) Ido 1 0 (0.00%)
Mapuche (Alfabeto Unificado, Grafemario Azümchefe, Grafemario Raguileo) 2 4 (100.00%) Corso 1 0 (0.00%)
Mapuche (Alfabeto Unificado, Grafemario Raguileo) 1 2 (100.00%) Alemán antiguo 1 0 (0.00%)
Mapuche (Alfabeto Unificado, Grafemario Raguileo, Alfabeto Unificado) 1 2 (100.00%) Leonés antiguo 1 0 (0.00%)
Mapuche (Grafemario Azümchefe) 1 2 (100.00%) Hindi 1 0 (0.00%)
Mapuche (Grafemario Azümchefe, Alfabeto Unificado) 3 4 (95.24%) Ligur 1 0 (0.00%)
Mapuche (Grafemario Azümchefe, Grafemario Azümchefe) 1 2 (100.00%) Náhuatl del norte de Puebla 1 0 (0.00%)
Mapuche (Grafemario Azümchefe, Grafemario Raguileo) 1 2 (100.00%) Náhuatl de Durango 1 0 (0.00%)
Mapuche (Grafemario Raguileo) 2 4 (100.00%) Navajo 1 0 (0.00%)
Mapuche (Grafemario Raguileo, Grafemario Azümchefe, Alfabeto Unificado) 2 4 (100.00%) Cheroqui 1 0 (0.00%)
Mapuche (Grafemario Raguileo, Grafemario Raguileo) 2 4 (100.00%) Ucraniano 1 0 (0.00%)
Mapuche (Ortografía de Luis de Valdivia) 1 0 (0.00%) Inuktitut 1 0 (0.00%)
Maya yucateco 4 0 (0.00%) Haida 1 4 (100.00%)
Mirandés 1 0 (0.00%) Xhosa 1 0 (0.00%)
Mixteco del sur de Puebla 1 0 (0.00%) Náhuatl de Tetelcingo 1 0 (0.00%)
Mochica 1 2 (0.00%) Translingüístico 1 4 (100.00%)
Napolitano 1 0 (0.00%) Galó 1 0 (0.00%)
Navajo 1 0 (0.00%) Japonés 1 24 (0.00%)
Navarro-aragonés 1 0 (0.00%) Coreano 1 2 (100.00%)
Neerlandés 8 18 (27.24%) Mixteco del sur de Puebla 1 0 (0.00%)
Neoarameo asirio 1 0 (0.00%) Quechua de Huaylas Ancash 1 0 (0.00%)
Neolatín 1 2 (0.00%) Quechua ayacuchano 1 36 (100.00%)
Normando 3 2 (2.47%) Feroés 1 0 (0.00%)
Noruego 1 0 (0.00%) Hausa 1 0 (0.00%)
Noruego bokmål 6 12 (1.64%) Nórdico antiguo 1 0 (0.00%)
Noruego nynorsk 3 2 (0.00%) Córnico (Standard Written Form, Kernewek Kemmyn, Unified Cornish Revised) 1 0 (0.00%)
Náhuatl central 4 0 (0.00%) Georgiano 1 0 (0.00%)
Náhuatl clásico 2 0 (0.00%) Allentiac 1 0 (0.00%)
Náhuatl clásico (Grafía normalizada) 2 0 (0.00%) Ainu 1 2 (100.00%)
Náhuatl de Durango 1 0 (0.00%) Arrumano 1 0 (0.00%)
Náhuatl de Guerrero 2 0 (0.00%) Istriorrumano 1 0 (0.00%)
Náhuatl de Orizaba 1 0 (0.00%) Charrúa 1 2 (100.00%)
Náhuatl de Tetelcingo 1 0 (0.00%) Maorí 1 0 (0.00%)
Náhuatl de la Huasteca central 2 0 (0.00%) Inuktitut (Alfabeto latino) 1 0 (0.00%)
Náhuatl de la Huasteca occidental 2 0 (0.00%) Limburgués 1 0 (0.00%)
Náhuatl de la Huasteca oriental 2 0 (0.00%) Búlgaro 1 0 (0.00%)
Náhuatl del norte de Puebla 1 0 (0.00%) Córnico (Standard Written Form) 1 0 (0.00%)
Nórdico antiguo 1 0 (0.00%) Mochica 1 2 (0.00%)
Occitano 3 0 (0.00%) Inglés antiguo 1 0 (0.00%)
Persa 2 0 (0.00%) Español (Ortografía de Bello) 1 0 (0.00%)
Polaco 9 14 (0.00%) Córnico (Kernewek Kemmyn) 1 0 (0.00%)
Portugués 6 2 (1.46%) Chewa 1 0 (0.00%)
Provenzal antiguo 1 0 (0.00%) Hebreo antiguo 1 0 (0.00%)
Prusiano antiguo 1 0 (0.00%) Arameo 1 0 (0.00%)
Quechua ayacuchano 1 36 (100.00%) Ídish 1 0 (0.00%)
Quechua cuzqueño 2 36 (66.67%) Avéstico 1 0 (0.00%)
Quechua de Huaylas Ancash 1 0 (0.00%) Urdu 1 0 (0.00%)
Rapa nui 2 6 (8.00%) Amárico 1 0 (0.00%)
Romanche 2 0 (0.00%) Macedonio 1 0 (0.00%)
Romaní (macrolengua) 1 0 (0.00%) Armenio 1 0 (0.00%)
Rumano 10 114 (21.74%) Serbocroata (Cirílico) 1 0 (0.00%)
Ruso 6 12 (23.08%) Tehuelche 1 0 (0.00%)
Sardo 2 0 (0.00%) Córnico (Standard Written Form, Kernewek Kemmyn) 1 0 (0.00%)
Serbocroata 1 0 (0.00%) Mapuche (Grafemario Azümchefe) 1 2 (100.00%)
Serbocroata (Cirílico) 1 0 (0.00%) Mapuche (Grafemario Azümchefe, Grafemario Azümchefe) 1 2 (100.00%)
Serbocroata (Círilico) 1 0 (0.00%) Mapuche (Alfabeto Unificado, Grafemario Azümchefe, Alfabeto Unificado) 1 2 (100.00%)
Serbocroata (Latino) 1 0 (0.00%) Latín vulgar 1 2 (0.00%)
Shawi 1 0 (0.00%) Córnico (Standard Written Form, Unified Cornish Revised) 1 0 (0.00%)
Siciliano 3 2 (8.33%) Noruego 1 0 (0.00%)
Siríaco clásico 1 0 (0.00%) Neoarameo asirio 1 0 (0.00%)
Suajili 1 0 (0.00%) Siríaco clásico 1 0 (0.00%)
Sueco 17 230 (99.89%) Mapuche (Alfabeto Unificado, Grafemario Raguileo, Alfabeto Unificado) 1 2 (100.00%)
Tagalo 1 0 (0.00%) Mapuche (Grafemario Azümchefe, Grafemario Raguileo) 1 2 (100.00%)
Tamil 2 2 (16.67%) Neolatín 1 2 (0.00%)
Tehuelche 1 0 (0.00%) Cupeño 1 0 (0.00%)
Translingüístico 1 4 (100.00%) Kikapú 1 0 (0.00%)
Turco 2 2 (8.33%) Gǀwi 1 0 (0.00%)
Ucraniano 1 0 (0.00%) Herero 1 0 (0.00%)
Urdu 1 0 (0.00%) Chaná 1 2 (100.00%)
Valón 3 2 (2.78%) Mapuche (Alfabeto Unificado, Alfabeto Unificado) 1 2 (100.00%)
Vasco 7 226 (26.91%) Deg xinag 1 0 (0.00%)
Volapuk 2 10 (96.23%) Mapuche (Ortografía de Luis de Valdivia) 1 0 (0.00%)
Véneto 2 0 (0.00%) Latín (Medieval) 1 2 (0.00%)
Xhosa 1 0 (0.00%) Mapuche (Alfabeto Unificado, Grafemario Raguileo) 1 2 (100.00%)
unknown 2 0 (0.00%) Shawi 1 0 (0.00%)
Árabe 3 2 (6.67%) Latín medieval 1 2 (0.00%)
Ídish 1 0 (0.00%) Serbocroata (Círilico) 1 0 (0.00%)

This page is a part of the kaikki.org machine-readable dictionary. This dictionary is based on structured data extracted on 2025-03-08 from the eswiktionary dump dated 2025-03-02 using wiktextract (32c88e6 and 633533e). The data shown on this site has been post-processed and various details (e.g., extra categories) removed, some information disambiguated, and additional data merged from other sources. See the raw data download page for the unprocessed wiktextract data.

If you use this data in academic research, please cite Tatu Ylonen: Wiktextract: Wiktionary as Machine-Readable Structured Data, Proceedings of the 13th Conference on Language Resources and Evaluation (LREC), pp. 1317-1325, Marseille, 20-25 June 2022. Linking to the relevant page(s) under https://kaikki.org would also be greatly appreciated.