Wiktionary data extraction errors and warnings

Inflection check

List of different kinds of inflection tables. When wiktextract parses word heads and tables, it assigns the forms it encounters with tags that describe grammatical or contextual information. The tags and forms that are found in head sections and tables are kept separate from other head section and table tags, and later they are merged with other heads and tables into table types that all contain the same number of word forms with the same tags for those forms.

The information presented here is mostly for debugging, but it can also be used to find interesting word paradigms and to hunt down mistakes, typoes and badly formated Wiktionary entries. A table type that has only a few unique instances is quite likely to contain some kind of minor error in the original data.

Language ⏶ Table forms Errors (% affected words) Language Table forms ⏷ Errors (% affected words)
Afrikáans 2 0 (0.00%) Latín 33 80 (0.00%)
Ainu 1 2 (100.00%) Español 31 54 (0.03%)
Albanés 3 0 (0.00%) Alemán 24 18 (1.17%)
Alemán 24 18 (1.17%) Sueco 18 8 (1.25%)
Alemán antiguo 1 0 (0.00%) Irlandés 15 176 (1.96%)
Allentiac 1 0 (0.00%) Inglés 13 16 (3.19%)
Amárico 1 0 (0.00%) Checo 13 12 (0.00%)
Aragonés 2 0 (0.00%) Gaélico escocés 11 32 (1.54%)
Arameo 1 0 (0.00%) Francés 10 4 (3.58%)
Armenio 1 0 (0.00%) Eslovaco 10 20 (0.00%)
Arrumano 1 0 (0.00%) Rumano 10 20 (20.87%)
Asturiano 6 2 (3.19%) Italiano 9 24 (5.17%)
Avéstico 1 0 (0.00%) Polaco 9 0 (0.00%)
Azerí 2 0 (0.00%) Portugués 9 2 (1.39%)
Bretón 6 18 (30.15%) Vasco 8 0 (0.00%)
Búlgaro 1 0 (0.00%) Neerlandés 8 12 (27.24%)
Caló 2 0 (0.00%) Manés 7 32 (7.32%)
Castellano antiguo 4 0 (0.00%) Griego 7 2 (1.60%)
Catalán 5 2 (0.90%) Asturiano 6 2 (3.19%)
Catalán antiguo 1 0 (0.00%) Bretón 6 18 (30.15%)
Chaná 1 2 (100.00%) Noruego bokmål 6 12 (1.64%)
Charrúa 1 2 (100.00%) Ruso 6 2 (23.08%)
Checo 13 12 (0.00%) Griego antiguo 6 0 (0.00%)
Cheroqui 1 0 (0.00%) Catalán 5 2 (0.90%)
Chewa 1 0 (0.00%) Francés antiguo 5 2 (1.92%)
Coreano 1 2 (100.00%) Danés 5 6 (4.88%)
Corso 1 0 (0.00%) Irlandés antiguo 5 0 (0.00%)
Cupeño 1 0 (0.00%) Galés 4 18 (33.33%)
Córnico 1 0 (0.00%) Castellano antiguo 4 0 (0.00%)
Córnico (Kernewek Kemmyn) 1 0 (0.00%) Francés medio 4 2 (3.95%)
Córnico (Standard Written Form) 1 0 (0.00%) Náhuatl central 4 0 (0.00%)
Córnico (Standard Written Form, Kernewek Kemmyn) 1 0 (0.00%) Hebreo 4 2 (13.02%)
Danés 5 6 (4.88%) Maya yucateco 3 36 (0.00%)
Dálmata 2 2 (6.67%) Esloveno 3 12 (0.00%)
Emiliano-romañol 2 2 (66.67%) Gallego 3 2 (0.73%)
Escocés 2 8 (0.00%) Interlingua 3 4 (4.35%)
Eslovaco 10 20 (0.00%) Islandés 3 0 (0.00%)
Esloveno 3 12 (0.00%) Occitano 3 0 (0.00%)
Español 31 54 (0.03%) Náhuatl clásico 3 0 (0.00%)
Español (Ortografía de Bello) 1 0 (0.00%) Judeoespañol 3 2 (0.81%)
Esperanto 3 0 (0.00%) Valón 3 2 (2.78%)
Extremeño 2 0 (0.00%) Esperanto 3 0 (0.00%)
Feroés 1 0 (0.00%) Noruego nynorsk 3 2 (0.00%)
Finés 1 0 (0.00%) Normando 3 2 (2.47%)
Francés 10 4 (3.58%) Guaraní 3 4 (20.00%)
Francés antiguo 5 2 (1.92%) Albanés 3 0 (0.00%)
Francés medio 4 2 (3.95%) Siciliano 3 2 (8.33%)
Frisón 1 0 (0.00%) Klingon 3 2 (1.13%)
Friulano 2 2 (4.55%) Náhuatl clásico (Grafía normalizada) 3 0 (0.00%)
Galaicoportugués 2 0 (0.00%) Quechua cuzqueño 3 24 (57.14%)
Gallego 3 2 (0.73%) Gótico 3 0 (0.00%)
Galés 4 18 (33.33%) Mapuche (Grafemario Azümchefe, Alfabeto Unificado) 3 4 (95.24%)
Galó 1 0 (0.00%) Árabe 3 2 (6.67%)
Gaélico escocés 11 32 (1.54%) Náhuatl de la Huasteca central 2 0 (0.00%)
Georgiano 1 0 (0.00%) Afrikáans 2 0 (0.00%)
Griego 7 2 (1.60%) Azerí 2 0 (0.00%)
Griego antiguo 6 0 (0.00%) Escocés 2 8 (0.00%)
Groenlandés 1 0 (0.00%) Japonés (Romaji) 2 20 (0.00%)
Guaraní 3 4 (20.00%) Lituano 2 0 (0.00%)
Gótico 3 0 (0.00%) Náhuatl de la Huasteca oriental 2 0 (0.00%)
Gǀwi 1 0 (0.00%) Aragonés 2 0 (0.00%)
Haida 1 0 (0.00%) Extremeño 2 0 (0.00%)
Hausa 1 0 (0.00%) Inglés medio 2 2 (10.00%)
Hawaiano 1 0 (0.00%) Náhuatl de la Huasteca occidental 2 0 (0.00%)
Hebreo 4 2 (13.02%) Volapuk 2 10 (98.25%)
Hebreo antiguo 1 0 (0.00%) Friulano 2 2 (4.55%)
Herero 1 0 (0.00%) Galaicoportugués 2 0 (0.00%)
Hindi 1 0 (0.00%) Véneto 2 0 (0.00%)
Húngaro 1 0 (0.00%) Italiano antiguo 2 0 (0.00%)
Ido 1 0 (0.00%) Romanche 2 0 (0.00%)
Inglés 13 16 (3.19%) Sardo 2 0 (0.00%)
Inglés antiguo 1 0 (0.00%) Mapuche (Alfabeto Unificado) 2 4 (100.00%)
Inglés medio 2 2 (10.00%) Maltés 2 0 (0.00%)
Interlingua 3 4 (4.35%) Náhuatl de Guerrero 2 0 (0.00%)
Inuktitut 1 0 (0.00%) Persa 2 0 (0.00%)
Inuktitut (Alfabeto latino) 1 0 (0.00%) Mapuche 2 4 (100.00%)
Irlandés 15 176 (1.96%) Quechua de Huaylas Ancash 2 0 (0.00%)
Irlandés antiguo 5 0 (0.00%) Mapuche (Alfabeto Unificado, Grafemario Azümchefe) 2 4 (100.00%)
Islandés 3 0 (0.00%) Dálmata 2 2 (6.67%)
Istriorrumano 1 0 (0.00%) Japonés (Hiragana) 2 20 (0.00%)
Italiano 9 24 (5.17%) Rapa nui 2 6 (8.00%)
Italiano antiguo 2 0 (0.00%) Japonés (Mixta) 2 20 (0.00%)
Japonés 1 20 (0.00%) Caló 2 0 (0.00%)
Japonés (Hiragana) 2 20 (0.00%) Emiliano-romañol 2 2 (66.67%)
Japonés (Mixta) 2 20 (0.00%) Mapuche (Grafemario Raguileo) 2 4 (100.00%)
Japonés (Romaji) 2 20 (0.00%) Tamil 2 0 (0.00%)
Judeoespañol 3 2 (0.81%) Mapuche (Grafemario Raguileo, Grafemario Raguileo) 2 4 (100.00%)
Kawésqar 1 0 (0.00%) Finés 1 0 (0.00%)
Kikapú 1 0 (0.00%) Húngaro 1 0 (0.00%)
Klingon 3 2 (1.13%) Luxemburgués 1 0 (0.00%)
Ladino 1 0 (0.00%) Malgache 1 2 (0.00%)
Latín 33 80 (0.00%) Serbocroata (Latino) 1 0 (0.00%)
Latín (Medieval) 1 2 (0.00%) Tagalo 1 0 (0.00%)
Latín medieval 1 2 (0.00%) Turco 1 0 (0.00%)
Latín vulgar 1 2 (0.00%) Navarro-aragonés 1 0 (0.00%)
Leonés antiguo 1 0 (0.00%) Frisón 1 0 (0.00%)
Letón 1 14 (0.00%) Provenzal antiguo 1 0 (0.00%)
Ligur 1 0 (0.00%) Córnico 1 0 (0.00%)
Limburgués 1 0 (0.00%) Catalán antiguo 1 0 (0.00%)
Lingua franca nova 1 0 (0.00%) Yagán 1 0 (0.00%)
Lituano 2 0 (0.00%) Lingua franca nova 1 0 (0.00%)
Luxemburgués 1 0 (0.00%) Napolitano 1 0 (0.00%)
Macedonio 1 0 (0.00%) Hawaiano 1 0 (0.00%)
Malgache 1 2 (0.00%) Letón 1 14 (0.00%)
Maltés 2 0 (0.00%) Mirandés 1 0 (0.00%)
Manés 7 32 (7.32%) Romaní (macrolengua) 1 0 (0.00%)
Maorí 1 0 (0.00%) Serbocroata 1 0 (0.00%)
Mapuche 2 4 (100.00%) Ladino 1 0 (0.00%)
Mapuche (Alfabeto Unificado) 2 4 (100.00%) Groenlandés 1 0 (0.00%)
Mapuche (Alfabeto Unificado, Alfabeto Unificado) 1 2 (100.00%) Náhuatl de Orizaba 1 0 (0.00%)
Mapuche (Alfabeto Unificado, Grafemario Azümchefe) 2 4 (100.00%) Suajili 1 0 (0.00%)
Mapuche (Alfabeto Unificado, Grafemario Raguileo) 1 2 (100.00%) Prusiano antiguo 1 0 (0.00%)
Mapuche (Grafemario Azümchefe) 1 2 (100.00%) Ido 1 0 (0.00%)
Mapuche (Grafemario Azümchefe, Alfabeto Unificado) 3 4 (95.24%) Corso 1 0 (0.00%)
Mapuche (Grafemario Raguileo) 2 4 (100.00%) Alemán antiguo 1 0 (0.00%)
Mapuche (Grafemario Raguileo, Grafemario Raguileo) 2 4 (100.00%) Leonés antiguo 1 0 (0.00%)
Mapuche (Ortografía de Luis de Valdivia) 1 0 (0.00%) Ligur 1 0 (0.00%)
Maya yucateco 3 36 (0.00%) Náhuatl del norte de Puebla 1 0 (0.00%)
Mirandés 1 0 (0.00%) Náhuatl de Durango 1 0 (0.00%)
Mixteco del sur de Puebla 1 0 (0.00%) Navajo 1 0 (0.00%)
Mochica 1 2 (0.00%) Cheroqui 1 0 (0.00%)
Napolitano 1 0 (0.00%) Ucraniano 1 0 (0.00%)
Navajo 1 0 (0.00%) Inuktitut 1 0 (0.00%)
Navarro-aragonés 1 0 (0.00%) Haida 1 0 (0.00%)
Neerlandés 8 12 (27.24%) Xhosa 1 0 (0.00%)
Neoarameo asirio 1 0 (0.00%) Hindi 1 0 (0.00%)
Neolatín 1 2 (0.00%) Náhuatl de Tetelcingo 1 0 (0.00%)
Normando 3 2 (2.47%) Translingüístico 1 2 (100.00%)
Noruego 1 0 (0.00%) Toki pona 1 0 (0.00%)
Noruego bokmål 6 12 (1.64%) Galó 1 0 (0.00%)
Noruego nynorsk 3 2 (0.00%) Japonés 1 20 (0.00%)
Náhuatl central 4 0 (0.00%) Coreano 1 2 (100.00%)
Náhuatl clásico 3 0 (0.00%) Mixteco del sur de Puebla 1 0 (0.00%)
Náhuatl clásico (Grafía normalizada) 3 0 (0.00%) Quechua ayacuchano 1 24 (100.00%)
Náhuatl de Durango 1 0 (0.00%) Feroés 1 0 (0.00%)
Náhuatl de Guerrero 2 0 (0.00%) Hausa 1 0 (0.00%)
Náhuatl de Orizaba 1 0 (0.00%) Nórdico antiguo 1 0 (0.00%)
Náhuatl de Tetelcingo 1 0 (0.00%) Georgiano 1 0 (0.00%)
Náhuatl de la Huasteca central 2 0 (0.00%) Allentiac 1 0 (0.00%)
Náhuatl de la Huasteca occidental 2 0 (0.00%) Ainu 1 2 (100.00%)
Náhuatl de la Huasteca oriental 2 0 (0.00%) Arrumano 1 0 (0.00%)
Náhuatl del norte de Puebla 1 0 (0.00%) Istriorrumano 1 0 (0.00%)
Nórdico antiguo 1 0 (0.00%) Charrúa 1 2 (100.00%)
Occitano 3 0 (0.00%) Maorí 1 0 (0.00%)
Persa 2 0 (0.00%) Inuktitut (Alfabeto latino) 1 0 (0.00%)
Polaco 9 0 (0.00%) Limburgués 1 0 (0.00%)
Portugués 9 2 (1.39%) Kawésqar 1 0 (0.00%)
Protoindoeuropeo 1 2 (0.00%) Búlgaro 1 0 (0.00%)
Provenzal antiguo 1 0 (0.00%) Córnico (Standard Written Form) 1 0 (0.00%)
Prusiano antiguo 1 0 (0.00%) Mochica 1 2 (0.00%)
Quechua ayacuchano 1 24 (100.00%) Inglés antiguo 1 0 (0.00%)
Quechua cuzqueño 3 24 (57.14%) Español (Ortografía de Bello) 1 0 (0.00%)
Quechua de Huaylas Ancash 2 0 (0.00%) Córnico (Kernewek Kemmyn) 1 0 (0.00%)
Rapa nui 2 6 (8.00%) Chewa 1 0 (0.00%)
Romanche 2 0 (0.00%) Íbero 1 0 (0.00%)
Romaní (macrolengua) 1 0 (0.00%) Hebreo antiguo 1 0 (0.00%)
Rumano 10 20 (20.87%) Arameo 1 0 (0.00%)
Ruso 6 2 (23.08%) Ídish 1 0 (0.00%)
Sardo 2 0 (0.00%) Avéstico 1 0 (0.00%)
Serbocroata 1 0 (0.00%) Urdu 1 0 (0.00%)
Serbocroata (Cirílico) 1 0 (0.00%) Amárico 1 0 (0.00%)
Serbocroata (Círilico) 1 0 (0.00%) Macedonio 1 0 (0.00%)
Serbocroata (Latino) 1 0 (0.00%) Armenio 1 0 (0.00%)
Shawi 1 0 (0.00%) Telugú 1 2 (0.00%)
Siciliano 3 2 (8.33%) Protoindoeuropeo 1 2 (0.00%)
Siríaco clásico 1 0 (0.00%) Serbocroata (Cirílico) 1 0 (0.00%)
Suajili 1 0 (0.00%) Tehuelche 1 0 (0.00%)
Sueco 18 8 (1.25%) Córnico (Standard Written Form, Kernewek Kemmyn) 1 0 (0.00%)
Tagalo 1 0 (0.00%) Mapuche (Grafemario Azümchefe) 1 2 (100.00%)
Tamil 2 0 (0.00%) Latín vulgar 1 2 (0.00%)
Tehuelche 1 0 (0.00%) Noruego 1 0 (0.00%)
Telugú 1 2 (0.00%) Neoarameo asirio 1 0 (0.00%)
Toki pona 1 0 (0.00%) Siríaco clásico 1 0 (0.00%)
Translingüístico 1 2 (100.00%) Neolatín 1 2 (0.00%)
Turco 1 0 (0.00%) Cupeño 1 0 (0.00%)
Ucraniano 1 0 (0.00%) Kikapú 1 0 (0.00%)
Urdu 1 0 (0.00%) Gǀwi 1 0 (0.00%)
Valón 3 2 (2.78%) Herero 1 0 (0.00%)
Vasco 8 0 (0.00%) Chaná 1 2 (100.00%)
Volapuk 2 10 (98.25%) Mapuche (Alfabeto Unificado, Alfabeto Unificado) 1 2 (100.00%)
Véneto 2 0 (0.00%) Mapuche (Ortografía de Luis de Valdivia) 1 0 (0.00%)
Xhosa 1 0 (0.00%) Latín (Medieval) 1 2 (0.00%)
Yagán 1 0 (0.00%) unknown 1 0 (0.00%)
unknown 1 0 (0.00%) Mapuche (Alfabeto Unificado, Grafemario Raguileo) 1 2 (100.00%)
Árabe 3 2 (6.67%) Shawi 1 0 (0.00%)
Íbero 1 0 (0.00%) Latín medieval 1 2 (0.00%)
Ídish 1 0 (0.00%) Serbocroata (Círilico) 1 0 (0.00%)

This page is a part of the kaikki.org machine-readable dictionary. This dictionary is based on structured data extracted on 2025-07-14 from the eswiktionary dump dated 2025-07-02 using wiktextract (6dade95 and f1c2b61). The data shown on this site has been post-processed and various details (e.g., extra categories) removed, some information disambiguated, and additional data merged from other sources. See the raw data download page for the unprocessed wiktextract data.

If you use this data in academic research, please cite Tatu Ylonen: Wiktextract: Wiktionary as Machine-Readable Structured Data, Proceedings of the 13th Conference on Language Resources and Evaluation (LREC), pp. 1317-1325, Marseille, 20-25 June 2022. Linking to the relevant page(s) under https://kaikki.org would also be greatly appreciated.