Wiktionary data extraction errors and warnings

Inflection check

List of different kinds of inflection tables. When wiktextract parses word heads and tables, it assigns the forms it encounters with tags that describe grammatical or contextual information. The tags and forms that are found in head sections and tables are kept separate from other head section and table tags, and later they are merged with other heads and tables into table types that all contain the same number of word forms with the same tags for those forms.

The information presented here is mostly for debugging, but it can also be used to find interesting word paradigms and to hunt down mistakes, typoes and badly formated Wiktionary entries. A table type that has only a few unique instances is quite likely to contain some kind of minor error in the original data.

Language ⏶ Table forms Errors (% affected words) Language Table forms ⏷ Errors (% affected words)
Afrikáans 2 0 (0.00%) Inglés 39 2 (0.59%)
Ainu 1 2 (100.00%) Español 36 50 (0.03%)
Albanés 3 0 (0.00%) Latín 32 76 (0.00%)
Alemán 32 18 (0.77%) Alemán 32 18 (0.77%)
Alemán antiguo 1 0 (0.00%) Sueco 20 8 (1.25%)
Allentiac 1 0 (0.00%) Irlandés 15 174 (1.96%)
Amárico 1 0 (0.00%) Rumano 11 20 (20.95%)
Aragonés 3 0 (0.00%) Gaélico escocés 11 32 (1.54%)
Arameo 1 0 (0.00%) Italiano 9 24 (1.00%)
Armenio 1 0 (0.00%) Neerlandés 9 18 (1.62%)
Arrumano 1 0 (0.00%) Polaco 9 0 (0.00%)
Asturiano 7 2 (1.36%) Portugués 9 2 (0.31%)
Avéstico 1 0 (0.00%) Francés 8 2 (0.27%)
Azerí 2 0 (0.00%) Vasco 8 0 (0.00%)
Bretón 6 18 (30.15%) Asturiano 7 2 (1.36%)
Búlgaro 1 0 (0.00%) Manés 7 32 (7.32%)
Caló 2 0 (0.00%) Danés 7 6 (3.68%)
Castellano antiguo 5 0 (0.00%) Griego 7 2 (1.41%)
Catalán 6 2 (0.24%) Bretón 6 18 (30.15%)
Catalán antiguo 1 0 (0.00%) Catalán 6 2 (0.24%)
Chaná 1 2 (100.00%) Noruego bokmål 6 12 (1.64%)
Charrúa 1 2 (100.00%) Ruso 6 2 (23.08%)
Checo 5 0 (0.00%) Griego antiguo 6 0 (0.00%)
Cheroqui 1 0 (0.00%) Checo 5 0 (0.00%)
Chewa 1 0 (0.00%) Gallego 5 0 (0.00%)
Coreano 1 2 (100.00%) Occitano 5 0 (0.00%)
Corso 2 0 (0.00%) Castellano antiguo 5 0 (0.00%)
Cupeño 1 0 (0.00%) Francés antiguo 5 2 (1.92%)
Córnico 1 0 (0.00%) Francés medio 5 2 (5.03%)
Córnico (Kernewek Kemmyn) 1 0 (0.00%) Esperanto 5 0 (0.00%)
Córnico (Standard Written Form) 1 0 (0.00%) Irlandés antiguo 5 0 (0.00%)
Córnico (Standard Written Form, Kernewek Kemmyn) 1 0 (0.00%) Galés 4 18 (33.33%)
Danés 7 6 (3.68%) Náhuatl central 4 0 (0.00%)
Dálmata 2 2 (6.67%) Hebreo 4 2 (13.38%)
Emiliano-romañol 2 2 (66.67%) Interlingua 3 4 (4.35%)
Escocés 2 8 (0.00%) Islandés 3 0 (0.00%)
Eslovaco 1 0 (0.00%) Aragonés 3 0 (0.00%)
Español 36 50 (0.03%) Extremeño 3 0 (0.00%)
Español (Ortografía de Bello) 1 0 (0.00%) Judeoespañol 3 2 (0.81%)
Esperanto 5 0 (0.00%) Valón 3 2 (5.56%)
Extremeño 3 0 (0.00%) Véneto 3 0 (0.00%)
Feroés 1 0 (0.00%) Noruego nynorsk 3 2 (0.00%)
Finés 1 0 (0.00%) Normando 3 2 (2.44%)
Francés 8 2 (0.27%) Guaraní 3 4 (20.00%)
Francés antiguo 5 2 (1.92%) Albanés 3 0 (0.00%)
Francés medio 5 2 (5.03%) Siciliano 3 2 (8.33%)
Frisón 1 0 (0.00%) Klingon 3 2 (1.12%)
Friulano 2 2 (4.76%) Gótico 3 0 (0.00%)
Galaicoportugués 2 0 (0.00%) Mapuche (Grafemario Azümchefe, Alfabeto Unificado) 3 4 (95.24%)
Gallego 5 0 (0.00%) Árabe 3 2 (7.14%)
Galés 4 18 (33.33%) Maya yucateco 2 36 (0.00%)
Galó 1 0 (0.00%) Náhuatl de la Huasteca central 2 0 (0.00%)
Gaélico escocés 11 32 (1.54%) Afrikáans 2 0 (0.00%)
Georgiano 1 0 (0.00%) Azerí 2 0 (0.00%)
Griego 7 2 (1.41%) Escocés 2 8 (0.00%)
Griego antiguo 6 0 (0.00%) Japonés (Romaji) 2 20 (0.00%)
Groenlandés 1 0 (0.00%) Lituano 2 0 (0.00%)
Guaraní 3 4 (20.00%) Náhuatl clásico 2 0 (0.00%)
Gótico 3 0 (0.00%) Náhuatl de la Huasteca oriental 2 0 (0.00%)
Gǀwi 1 0 (0.00%) Inglés medio 2 0 (0.00%)
Haida 1 0 (0.00%) Náhuatl de la Huasteca occidental 2 0 (0.00%)
Hausa 1 0 (0.00%) Volapuk 2 10 (98.25%)
Hawaiano 1 0 (0.00%) Friulano 2 2 (4.76%)
Hebreo 4 2 (13.38%) Galaicoportugués 2 0 (0.00%)
Hebreo antiguo 1 0 (0.00%) Italiano antiguo 2 0 (0.00%)
Herero 1 0 (0.00%) Romanche 2 0 (0.00%)
Hindi 1 0 (0.00%) Sardo 2 0 (0.00%)
Húngaro 1 0 (0.00%) Mapuche (Alfabeto Unificado) 2 4 (100.00%)
Ido 1 0 (0.00%) Maltés 2 0 (0.00%)
Inglés 39 2 (0.59%) Náhuatl de Guerrero 2 0 (0.00%)
Inglés antiguo 1 0 (0.00%) Corso 2 0 (0.00%)
Inglés medio 2 0 (0.00%) Persa 2 0 (0.00%)
Interlingua 3 4 (4.35%) Náhuatl clásico (Grafía normalizada) 2 0 (0.00%)
Inuktitut 1 0 (0.00%) Quechua cuzqueño 2 24 (66.67%)
Inuktitut (Alfabeto latino) 1 0 (0.00%) Mapuche 2 4 (100.00%)
Irlandés 15 174 (1.96%) Quechua de Huaylas Ancash 2 0 (0.00%)
Irlandés antiguo 5 0 (0.00%) Mapuche (Alfabeto Unificado, Grafemario Azümchefe) 2 4 (100.00%)
Islandés 3 0 (0.00%) Dálmata 2 2 (6.67%)
Istriorrumano 1 0 (0.00%) Japonés (Hiragana) 2 20 (0.00%)
Italiano 9 24 (1.00%) Japonés (Mixta) 2 20 (0.00%)
Italiano antiguo 2 0 (0.00%) Rapa nui 2 6 (8.00%)
Japonés 1 20 (0.00%) Caló 2 0 (0.00%)
Japonés (Hiragana) 2 20 (0.00%) Emiliano-romañol 2 2 (66.67%)
Japonés (Mixta) 2 20 (0.00%) Mapuche (Grafemario Raguileo) 2 4 (100.00%)
Japonés (Romaji) 2 20 (0.00%) Tamil 2 0 (0.00%)
Judeoespañol 3 2 (0.81%) Telugú 2 2 (0.00%)
Judeoespañol (Ortografía turca) 1 0 (0.00%) Mapuche (Grafemario Raguileo, Grafemario Raguileo) 2 4 (100.00%)
Kikapú 1 0 (0.00%) Eslovaco 1 0 (0.00%)
Klingon 3 2 (1.12%) Finés 1 0 (0.00%)
Ladino 1 0 (0.00%) Húngaro 1 0 (0.00%)
Latín 32 76 (0.00%) Luxemburgués 1 0 (0.00%)
Latín (Medieval) 1 2 (0.00%) Malgache 1 2 (0.00%)
Latín medieval 1 2 (0.00%) Serbocroata (Latino) 1 0 (0.00%)
Latín vulgar 1 2 (0.00%) Tagalo 1 0 (0.00%)
Leonés antiguo 1 0 (0.00%) Turco 1 0 (0.00%)
Letón 1 14 (0.00%) Navarro-aragonés 1 0 (0.00%)
Ligur 1 0 (0.00%) Frisón 1 0 (0.00%)
Limburgués 1 0 (0.00%) Translingüístico 1 4 (100.00%)
Lingua franca nova 1 0 (0.00%) Provenzal antiguo 1 0 (0.00%)
Lituano 2 0 (0.00%) Córnico 1 0 (0.00%)
Luxemburgués 1 0 (0.00%) Catalán antiguo 1 0 (0.00%)
Macedonio 1 0 (0.00%) Lingua franca nova 1 0 (0.00%)
Malgache 1 2 (0.00%) Napolitano 1 0 (0.00%)
Maltés 2 0 (0.00%) Hawaiano 1 0 (0.00%)
Manés 7 32 (7.32%) Letón 1 14 (0.00%)
Maorí 1 0 (0.00%) Mirandés 1 0 (0.00%)
Mapuche 2 4 (100.00%) Romaní (macrolengua) 1 0 (0.00%)
Mapuche (Alfabeto Unificado) 2 4 (100.00%) Serbocroata 1 0 (0.00%)
Mapuche (Alfabeto Unificado, Alfabeto Unificado) 1 2 (100.00%) Ladino 1 0 (0.00%)
Mapuche (Alfabeto Unificado, Grafemario Azümchefe) 2 4 (100.00%) Groenlandés 1 0 (0.00%)
Mapuche (Alfabeto Unificado, Grafemario Raguileo) 1 2 (100.00%) Náhuatl de Orizaba 1 0 (0.00%)
Mapuche (Grafemario Azümchefe) 1 2 (100.00%) Suajili 1 0 (0.00%)
Mapuche (Grafemario Azümchefe, Alfabeto Unificado) 3 4 (95.24%) Prusiano antiguo 1 0 (0.00%)
Mapuche (Grafemario Raguileo) 2 4 (100.00%) Ido 1 0 (0.00%)
Mapuche (Grafemario Raguileo, Grafemario Raguileo) 2 4 (100.00%) Alemán antiguo 1 0 (0.00%)
Mapuche (Ortografía de Luis de Valdivia) 1 0 (0.00%) Leonés antiguo 1 0 (0.00%)
Maya yucateco 2 36 (0.00%) Ligur 1 0 (0.00%)
Mirandés 1 0 (0.00%) Náhuatl del norte de Puebla 1 0 (0.00%)
Mixteco del sur de Puebla 1 0 (0.00%) Náhuatl de Durango 1 0 (0.00%)
Mochica 1 2 (0.00%) Navajo 1 0 (0.00%)
Napolitano 1 0 (0.00%) Cheroqui 1 0 (0.00%)
Navajo 1 0 (0.00%) Ucraniano 1 0 (0.00%)
Navarro-aragonés 1 0 (0.00%) Inuktitut 1 0 (0.00%)
Neerlandés 9 18 (1.62%) Haida 1 0 (0.00%)
Neoarameo asirio 1 0 (0.00%) Xhosa 1 0 (0.00%)
Neolatín 1 2 (0.00%) Hindi 1 0 (0.00%)
Normando 3 2 (2.44%) Náhuatl de Tetelcingo 1 0 (0.00%)
Noruego 1 0 (0.00%) Galó 1 0 (0.00%)
Noruego bokmål 6 12 (1.64%) Japonés 1 20 (0.00%)
Noruego nynorsk 3 2 (0.00%) Coreano 1 2 (100.00%)
Náhuatl central 4 0 (0.00%) Mixteco del sur de Puebla 1 0 (0.00%)
Náhuatl clásico 2 0 (0.00%) Quechua ayacuchano 1 24 (100.00%)
Náhuatl clásico (Grafía normalizada) 2 0 (0.00%) Feroés 1 0 (0.00%)
Náhuatl de Durango 1 0 (0.00%) Hausa 1 0 (0.00%)
Náhuatl de Guerrero 2 0 (0.00%) Nórdico antiguo 1 0 (0.00%)
Náhuatl de Orizaba 1 0 (0.00%) Georgiano 1 0 (0.00%)
Náhuatl de Tetelcingo 1 0 (0.00%) Allentiac 1 0 (0.00%)
Náhuatl de la Huasteca central 2 0 (0.00%) Ainu 1 2 (100.00%)
Náhuatl de la Huasteca occidental 2 0 (0.00%) Arrumano 1 0 (0.00%)
Náhuatl de la Huasteca oriental 2 0 (0.00%) Istriorrumano 1 0 (0.00%)
Náhuatl del norte de Puebla 1 0 (0.00%) Charrúa 1 2 (100.00%)
Nórdico antiguo 1 0 (0.00%) Maorí 1 0 (0.00%)
Occitano 5 0 (0.00%) Inuktitut (Alfabeto latino) 1 0 (0.00%)
Persa 2 0 (0.00%) Limburgués 1 0 (0.00%)
Polaco 9 0 (0.00%) Búlgaro 1 0 (0.00%)
Portugués 9 2 (0.31%) Córnico (Standard Written Form) 1 0 (0.00%)
Protoindoeuropeo 1 2 (0.00%) Mochica 1 2 (0.00%)
Provenzal antiguo 1 0 (0.00%) Inglés antiguo 1 0 (0.00%)
Prusiano antiguo 1 0 (0.00%) Español (Ortografía de Bello) 1 0 (0.00%)
Quechua ayacuchano 1 24 (100.00%) Córnico (Kernewek Kemmyn) 1 0 (0.00%)
Quechua cuzqueño 2 24 (66.67%) Chewa 1 0 (0.00%)
Quechua de Huaylas Ancash 2 0 (0.00%) Hebreo antiguo 1 0 (0.00%)
Rapa nui 2 6 (8.00%) Arameo 1 0 (0.00%)
Romanche 2 0 (0.00%) Ídish 1 0 (0.00%)
Romaní (macrolengua) 1 0 (0.00%) Avéstico 1 0 (0.00%)
Rumano 11 20 (20.95%) Urdu 1 0 (0.00%)
Ruso 6 2 (23.08%) Amárico 1 0 (0.00%)
Sardo 2 0 (0.00%) Macedonio 1 0 (0.00%)
Serbocroata 1 0 (0.00%) Armenio 1 0 (0.00%)
Serbocroata (Cirílico) 1 0 (0.00%) Protoindoeuropeo 1 2 (0.00%)
Serbocroata (Círilico) 1 0 (0.00%) Serbocroata (Cirílico) 1 0 (0.00%)
Serbocroata (Latino) 1 0 (0.00%) Tehuelche 1 0 (0.00%)
Shawi 1 0 (0.00%) Córnico (Standard Written Form, Kernewek Kemmyn) 1 0 (0.00%)
Siciliano 3 2 (8.33%) Mapuche (Grafemario Azümchefe) 1 2 (100.00%)
Siríaco clásico 1 0 (0.00%) Latín vulgar 1 2 (0.00%)
Suajili 1 0 (0.00%) Noruego 1 0 (0.00%)
Sueco 20 8 (1.25%) Neoarameo asirio 1 0 (0.00%)
Tagalo 1 0 (0.00%) Siríaco clásico 1 0 (0.00%)
Tamil 2 0 (0.00%) Neolatín 1 2 (0.00%)
Tehuelche 1 0 (0.00%) Cupeño 1 0 (0.00%)
Telugú 2 2 (0.00%) Kikapú 1 0 (0.00%)
Translingüístico 1 4 (100.00%) Gǀwi 1 0 (0.00%)
Turco 1 0 (0.00%) Herero 1 0 (0.00%)
Ucraniano 1 0 (0.00%) Chaná 1 2 (100.00%)
Urdu 1 0 (0.00%) Mapuche (Alfabeto Unificado, Alfabeto Unificado) 1 2 (100.00%)
Valón 3 2 (5.56%) Mapuche (Ortografía de Luis de Valdivia) 1 0 (0.00%)
Vasco 8 0 (0.00%) Latín (Medieval) 1 2 (0.00%)
Volapuk 2 10 (98.25%) Judeoespañol (Ortografía turca) 1 0 (0.00%)
Véneto 3 0 (0.00%) Mapuche (Alfabeto Unificado, Grafemario Raguileo) 1 2 (100.00%)
Xhosa 1 0 (0.00%) Shawi 1 0 (0.00%)
unknown 1 0 (0.00%) Latín medieval 1 2 (0.00%)
Árabe 3 2 (7.14%) unknown 1 0 (0.00%)
Ídish 1 0 (0.00%) Serbocroata (Círilico) 1 0 (0.00%)

This page is a part of the kaikki.org machine-readable dictionary. This dictionary is based on structured data extracted on 2025-12-05 from the eswiktionary dump dated 2025-12-02 using wiktextract (ddb1505 and 9905b1f). The data shown on this site has been post-processed and various details (e.g., extra categories) removed, some information disambiguated, and additional data merged from other sources. See the raw data download page for the unprocessed wiktextract data.

If you use this data in academic research, please cite Tatu Ylonen: Wiktextract: Wiktionary as Machine-Readable Structured Data, Proceedings of the 13th Conference on Language Resources and Evaluation (LREC), pp. 1317-1325, Marseille, 20-25 June 2022. Linking to the relevant page(s) under https://kaikki.org would also be greatly appreciated.