Wiktionary data extraction errors and warnings

Inflection check

List of different kinds of inflection tables. When wiktextract parses word heads and tables, it assigns the forms it encounters with tags that describe grammatical or contextual information. The tags and forms that are found in head sections and tables are kept separate from other head section and table tags, and later they are merged with other heads and tables into table types that all contain the same number of word forms with the same tags for those forms.

The information presented here is mostly for debugging, but it can also be used to find interesting word paradigms and to hunt down mistakes, typoes and badly formated Wiktionary entries. A table type that has only a few unique instances is quite likely to contain some kind of minor error in the original data.

Language ⏶ Table forms Errors (% affected words) Language Table forms ⏷ Errors (% affected words)
Afrikáans 1 0 (0.00%) Latín 28 110 (0.05%)
Ainu 1 2 (100.00%) Español 20 110 (33.96%)
Albanés 3 20 (91.43%) Sueco 15 230 (99.89%)
Alemán 14 78 (1.03%) Alemán 14 78 (1.03%)
Alemán antiguo 1 0 (0.00%) Inglés 10 6 (3.93%)
Allentiac 1 0 (0.00%) Francés 9 4 (3.45%)
Amárico 1 0 (0.00%) Rumano 9 104 (21.90%)
Aragonés 3 0 (0.00%) Polaco 8 14 (0.00%)
Arameo 1 0 (0.00%) Italiano 8 46 (5.40%)
Armenio 1 0 (0.00%) Neerlandés 8 18 (32.47%)
Arrumano 1 0 (0.00%) Gaélico escocés 8 32 (3.85%)
Asturiano 6 2 (3.30%) Irlandés 8 76 (86.47%)
Avéstico 1 0 (0.00%) Vasco 7 226 (27.54%)
Azerí 2 0 (0.00%) Checo 7 70 (0.00%)
Bretón 5 20 (26.64%) Asturiano 6 2 (3.30%)
Búlgaro 1 0 (0.00%) Ruso 6 12 (16.67%)
Caló 3 2 (25.00%) Griego 6 34 (7.32%)
Castellano antiguo 2 0 (0.00%) Griego antiguo 6 32 (37.25%)
Catalán 5 2 (0.85%) Catalán 5 2 (0.85%)
Catalán antiguo 1 0 (0.00%) Bretón 5 20 (26.64%)
Chaná 1 2 (100.00%) Portugués 5 2 (1.49%)
Charrúa 1 2 (100.00%) Manés 5 32 (9.76%)
Checo 7 70 (0.00%) Danés 5 36 (96.72%)
Cheroqui 1 0 (0.00%) Noruego bokmål 5 16 (3.23%)
Chewa 1 0 (0.00%) Francés antiguo 5 2 (1.94%)
Coreano 1 2 (100.00%) Irlandés antiguo 5 24 (83.33%)
Corso 1 0 (0.00%) Eslovaco 5 40 (0.00%)
Cupeño 1 0 (0.00%) Galés 4 18 (33.33%)
Córnico 1 0 (0.00%) Francés medio 4 2 (3.97%)
Córnico (Kernewek Kemmyn) 1 0 (0.00%) Náhuatl central 4 0 (0.00%)
Córnico (Standard Written Form) 1 0 (0.00%) Albanés 3 20 (91.43%)
Córnico (Standard Written Form, Kernewek Kemmyn) 1 0 (0.00%) Esperanto 3 0 (0.00%)
Córnico (Standard Written Form, Kernewek Kemmyn, Unified Cornish Revised) 1 0 (0.00%) Gallego 3 2 (0.66%)
Danés 5 36 (96.72%) Noruego nynorsk 3 12 (2.38%)
Deg xinag 1 0 (0.00%) Aragonés 3 0 (0.00%)
Dálmata 2 2 (6.67%) Judeoespañol 3 2 (0.82%)
Emiliano-romañol 2 2 (66.67%) Mapuche 3 4 (99.44%)
Escocés 2 10 (10.00%) Esloveno 3 36 (72.73%)
Eslovaco 5 40 (0.00%) Siciliano 3 2 (8.33%)
Esloveno 3 36 (72.73%) Normando 3 2 (2.44%)
Español 20 110 (33.96%) Valón 3 2 (2.86%)
Español (Ortografía de Bello) 1 0 (0.00%) Islandés 3 16 (83.33%)
Esperanto 3 0 (0.00%) Hebreo 3 2 (14.71%)
Extremeño 2 0 (0.00%) Gótico 3 8 (0.00%)
Feroés 1 0 (0.00%) Mapuche (Grafemario Azümchefe, Alfabeto Unificado) 3 4 (95.24%)
Finés 1 0 (0.00%) Caló 3 2 (25.00%)
Francés 9 4 (3.45%) Árabe 3 2 (6.67%)
Francés antiguo 5 2 (1.94%) Japonés (romaji) 2 72 (66.67%)
Francés medio 4 2 (3.97%) Volapuk 2 10 (96.23%)
Frisón 1 0 (0.00%) Interlingua 2 4 (4.76%)
Friulano 2 2 (4.55%) Mapuche (Grafemario Raguileo, Grafemario Azümchefe, Alfabeto Unificado) 2 4 (100.00%)
Galaicoportugués 2 0 (0.00%) Lituano 2 0 (0.00%)
Gallego 3 2 (0.66%) Maya yucateco 2 0 (0.00%)
Galés 4 18 (33.33%) Náhuatl de la Huasteca occidental 2 0 (0.00%)
Galó 1 0 (0.00%) Castellano antiguo 2 0 (0.00%)
Gaélico escocés 8 32 (3.85%) Azerí 2 0 (0.00%)
Georgiano 1 0 (0.00%) Inglés medio 2 2 (16.67%)
Griego 6 34 (7.32%) Occitano 2 0 (0.00%)
Griego antiguo 6 32 (37.25%) Náhuatl de la Huasteca central 2 0 (0.00%)
Groenlandés 1 0 (0.00%) Romanche 2 0 (0.00%)
Guaraní 1 4 (100.00%) Turco 2 2 (8.33%)
Gótico 3 8 (0.00%) Náhuatl clásico 2 0 (0.00%)
Haida 1 4 (100.00%) Náhuatl de la Huasteca oriental 2 0 (0.00%)
Hausa 1 0 (0.00%) Quechua cuzqueño 2 36 (66.67%)
Hawaiano 1 0 (0.00%) Klingon 2 2 (1.26%)
Hebreo 3 2 (14.71%) Escocés 2 10 (10.00%)
Hebreo antiguo 1 0 (0.00%) Véneto 2 0 (0.00%)
Herero 1 0 (0.00%) Náhuatl de Guerrero 2 0 (0.00%)
Hindi 1 0 (0.00%) Mapuche (Alfabeto Unificado) 2 4 (100.00%)
Húngaro 1 0 (0.00%) Italiano antiguo 2 0 (0.00%)
Ido 1 0 (0.00%) Persa 2 0 (0.00%)
Inglés 10 6 (3.93%) Friulano 2 2 (4.55%)
Inglés antiguo 1 0 (0.00%) Galaicoportugués 2 0 (0.00%)
Inglés medio 2 2 (16.67%) Extremeño 2 0 (0.00%)
Interlingua 2 4 (4.76%) Maltés 2 0 (0.00%)
Inuktitut 1 0 (0.00%) Sardo 2 0 (0.00%)
Inuktitut (alfabeto latino) 1 0 (0.00%) Dálmata 2 2 (6.67%)
Irlandés 8 76 (86.47%) Rapa nui 2 20 (8.00%)
Irlandés antiguo 5 24 (83.33%) Mapuche (Alfabeto Unificado, Grafemario Azümchefe) 2 4 (100.00%)
Islandés 3 16 (83.33%) Emiliano-romañol 2 2 (66.67%)
Istriorrumano 1 0 (0.00%) Mapuche (Grafemario Raguileo) 2 4 (100.00%)
Italiano 8 46 (5.40%) Mapuche (Alfabeto Unificado, Grafemario Azümchefe, Grafemario Raguileo) 2 4 (100.00%)
Italiano antiguo 2 0 (0.00%) Tamil 2 2 (16.67%)
Japonés 1 72 (100.00%) Mapuche (Alfabeto Unificado, Grafemario Azümchefe, Grafemario Azümchefe, Alfabeto Unificado) 2 4 (100.00%)
Japonés (Romaji) 1 72 (100.00%) Mapuche (Grafemario Raguileo, Grafemario Raguileo) 2 4 (100.00%)
Japonés (hiragana) 1 72 (100.00%) Frisón 1 0 (0.00%)
Japonés (mixta) 1 8 (100.00%) Córnico 1 0 (0.00%)
Japonés (romaji) 2 72 (66.67%) Hawaiano 1 0 (0.00%)
Judeoespañol 3 2 (0.82%) Napolitano 1 0 (0.00%)
Kikapú 1 0 (0.00%) Finés 1 0 (0.00%)
Klingon 2 2 (1.26%) Afrikáans 1 0 (0.00%)
Ladino 1 0 (0.00%) Prusiano antiguo 1 0 (0.00%)
Latín 28 110 (0.05%) Navarro-aragonés 1 0 (0.00%)
Latín (medieval) 1 2 (0.00%) Guaraní 1 4 (100.00%)
Leonés antiguo 1 0 (0.00%) Inuktitut 1 0 (0.00%)
Letón 1 14 (0.00%) Cheroqui 1 0 (0.00%)
Ligur 1 0 (0.00%) Navajo 1 0 (0.00%)
Limburgués 1 0 (0.00%) Húngaro 1 0 (0.00%)
Lituano 2 0 (0.00%) Serbocroata (latino) 1 0 (0.00%)
Luxemburgués 1 0 (0.00%) Xhosa 1 0 (0.00%)
Macedonio 1 0 (0.00%) Haida 1 4 (100.00%)
Malgache 1 8 (100.00%) Hindi 1 0 (0.00%)
Maltés 2 0 (0.00%) Serbocroata 1 0 (0.00%)
Manés 5 32 (9.76%) Alemán antiguo 1 0 (0.00%)
Maorí 1 0 (0.00%) Náhuatl de Tetelcingo 1 0 (0.00%)
Mapuche 3 4 (99.44%) Leonés antiguo 1 0 (0.00%)
Mapuche (Alfabeto Unificado) 2 4 (100.00%) Translingüístico 1 4 (100.00%)
Mapuche (Alfabeto Unificado, Alfabeto Unificado) 1 2 (100.00%) Náhuatl de Orizaba 1 0 (0.00%)
Mapuche (Alfabeto Unificado, Grafemario Azümchefe) 2 4 (100.00%) Japonés 1 72 (100.00%)
Mapuche (Alfabeto Unificado, Grafemario Azümchefe, Alfabeto Unificado) 1 2 (100.00%) Suajili 1 0 (0.00%)
Mapuche (Alfabeto Unificado, Grafemario Azümchefe, Grafemario Azümchefe, Alfabeto Unificado) 2 4 (100.00%) Mirandés 1 0 (0.00%)
Mapuche (Alfabeto Unificado, Grafemario Azümchefe, Grafemario Raguileo) 2 4 (100.00%) Quechua ayacuchano 1 36 (100.00%)
Mapuche (Alfabeto Unificado, Grafemario Raguileo) 1 2 (100.00%) Quechua de Huaylas Ancash 1 0 (0.00%)
Mapuche (Alfabeto Unificado, Grafemario Raguileo, Alfabeto Unificado) 1 2 (100.00%) Corso 1 0 (0.00%)
Mapuche (Grafemario Azümchefe) 1 2 (100.00%) Náhuatl del norte de Puebla 1 0 (0.00%)
Mapuche (Grafemario Azümchefe, Alfabeto Unificado) 3 4 (95.24%) Ido 1 0 (0.00%)
Mapuche (Grafemario Azümchefe, Grafemario Azümchefe) 1 2 (100.00%) Coreano 1 2 (100.00%)
Mapuche (Grafemario Raguileo) 2 4 (100.00%) Náhuatl de Durango 1 0 (0.00%)
Mapuche (Grafemario Raguileo, Grafemario Azümchefe, Alfabeto Unificado) 2 4 (100.00%) Provenzal antiguo 1 0 (0.00%)
Mapuche (Grafemario Raguileo, Grafemario Raguileo) 2 4 (100.00%) Catalán antiguo 1 0 (0.00%)
Maya yucateco 2 0 (0.00%) Ladino 1 0 (0.00%)
Mirandés 1 0 (0.00%) Galó 1 0 (0.00%)
Mixteco del sur de Puebla 1 0 (0.00%) Tagalo 1 0 (0.00%)
Napolitano 1 0 (0.00%) Feroés 1 0 (0.00%)
Navajo 1 0 (0.00%) Nórdico antiguo 1 0 (0.00%)
Navarro-aragonés 1 0 (0.00%) Luxemburgués 1 0 (0.00%)
Neerlandés 8 18 (32.47%) Malgache 1 8 (100.00%)
Neoarameo asirio 1 0 (0.00%) Japonés (hiragana) 1 72 (100.00%)
Normando 3 2 (2.44%) Georgiano 1 0 (0.00%)
Noruego bokmål 5 16 (3.23%) Japonés (Romaji) 1 72 (100.00%)
Noruego nynorsk 3 12 (2.38%) Romaní (macrolengua) 1 0 (0.00%)
Náhuatl central 4 0 (0.00%) Ligur 1 0 (0.00%)
Náhuatl clásico 2 0 (0.00%) Hausa 1 0 (0.00%)
Náhuatl de Durango 1 0 (0.00%) Córnico (Standard Written Form, Kernewek Kemmyn, Unified Cornish Revised) 1 0 (0.00%)
Náhuatl de Guerrero 2 0 (0.00%) Groenlandés 1 0 (0.00%)
Náhuatl de Orizaba 1 0 (0.00%) Mixteco del sur de Puebla 1 0 (0.00%)
Náhuatl de Tetelcingo 1 0 (0.00%) Allentiac 1 0 (0.00%)
Náhuatl de la Huasteca central 2 0 (0.00%) Letón 1 14 (0.00%)
Náhuatl de la Huasteca occidental 2 0 (0.00%) Ucraniano 1 0 (0.00%)
Náhuatl de la Huasteca oriental 2 0 (0.00%) Inuktitut (alfabeto latino) 1 0 (0.00%)
Náhuatl del norte de Puebla 1 0 (0.00%) Charrúa 1 2 (100.00%)
Nórdico antiguo 1 0 (0.00%) Maorí 1 0 (0.00%)
Occitano 2 0 (0.00%) Arrumano 1 0 (0.00%)
Persa 2 0 (0.00%) Istriorrumano 1 0 (0.00%)
Polaco 8 14 (0.00%) Búlgaro 1 0 (0.00%)
Portugués 5 2 (1.49%) Córnico (Standard Written Form) 1 0 (0.00%)
Provenzal antiguo 1 0 (0.00%) Inglés antiguo 1 0 (0.00%)
Prusiano antiguo 1 0 (0.00%) Japonés (mixta) 1 8 (100.00%)
Quechua ayacuchano 1 36 (100.00%) Español (Ortografía de Bello) 1 0 (0.00%)
Quechua cuzqueño 2 36 (66.67%) Limburgués 1 0 (0.00%)
Quechua de Huaylas Ancash 1 0 (0.00%) Chewa 1 0 (0.00%)
Rapa nui 2 20 (8.00%) Hebreo antiguo 1 0 (0.00%)
Romanche 2 0 (0.00%) Arameo 1 0 (0.00%)
Romaní (macrolengua) 1 0 (0.00%) Ídish 1 0 (0.00%)
Rumano 9 104 (21.90%) Avéstico 1 0 (0.00%)
Ruso 6 12 (16.67%) Urdu 1 0 (0.00%)
Sardo 2 0 (0.00%) Amárico 1 0 (0.00%)
Serbocroata 1 0 (0.00%) Armenio 1 0 (0.00%)
Serbocroata (cirílico) 1 0 (0.00%) Macedonio 1 0 (0.00%)
Serbocroata (latino) 1 0 (0.00%) Ainu 1 2 (100.00%)
Shawi 1 0 (0.00%) Serbocroata (cirílico) 1 0 (0.00%)
Siciliano 3 2 (8.33%) Tehuelche 1 0 (0.00%)
Siríaco clásico 1 0 (0.00%) Córnico (Kernewek Kemmyn) 1 0 (0.00%)
Suajili 1 0 (0.00%) Córnico (Standard Written Form, Kernewek Kemmyn) 1 0 (0.00%)
Sueco 15 230 (99.89%) Mapuche (Grafemario Azümchefe) 1 2 (100.00%)
Tagalo 1 0 (0.00%) Mapuche (Grafemario Azümchefe, Grafemario Azümchefe) 1 2 (100.00%)
Tamil 2 2 (16.67%) Mapuche (Alfabeto Unificado, Grafemario Azümchefe, Alfabeto Unificado) 1 2 (100.00%)
Tehuelche 1 0 (0.00%) Neoarameo asirio 1 0 (0.00%)
Translingüístico 1 4 (100.00%) Siríaco clásico 1 0 (0.00%)
Turco 2 2 (8.33%) Mapuche (Alfabeto Unificado, Grafemario Raguileo, Alfabeto Unificado) 1 2 (100.00%)
Ucraniano 1 0 (0.00%) Cupeño 1 0 (0.00%)
Urdu 1 0 (0.00%) Kikapú 1 0 (0.00%)
Valón 3 2 (2.86%) Herero 1 0 (0.00%)
Vasco 7 226 (27.54%) Chaná 1 2 (100.00%)
Volapuk 2 10 (96.23%) Mapuche (Alfabeto Unificado, Alfabeto Unificado) 1 2 (100.00%)
Véneto 2 0 (0.00%) Deg xinag 1 0 (0.00%)
Xhosa 1 0 (0.00%) Latín (medieval) 1 2 (0.00%)
Árabe 3 2 (6.67%) Mapuche (Alfabeto Unificado, Grafemario Raguileo) 1 2 (100.00%)
Ídish 1 0 (0.00%) Shawi 1 0 (0.00%)

This page is a part of the kaikki.org machine-readable dictionary. This dictionary is based on structured data extracted on 2024-05-31 from the eswiktionary dump dated 2024-05-02 using wiktextract (91e95e7 and db5a844). The data shown on this site has been post-processed and various details (e.g., extra categories) removed, some information disambiguated, and additional data merged from other sources. See the raw data download page for the unprocessed wiktextract data.

If you use this data in academic research, please cite Tatu Ylonen: Wiktextract: Wiktionary as Machine-Readable Structured Data, Proceedings of the 13th Conference on Language Resources and Evaluation (LREC), pp. 1317-1325, Marseille, 20-25 June 2022. Linking to the relevant page(s) under https://kaikki.org would also be greatly appreciated.