Wiktionary data extraction errors and warnings

Inflection check

List of different kinds of inflection tables. When wiktextract parses word heads and tables, it assigns the forms it encounters with tags that describe grammatical or contextual information. The tags and forms that are found in head sections and tables are kept separate from other head section and table tags, and later they are merged with other heads and tables into table types that all contain the same number of word forms with the same tags for those forms.

The information presented here is mostly for debugging, but it can also be used to find interesting word paradigms and to hunt down mistakes, typoes and badly formated Wiktionary entries. A table type that has only a few unique instances is quite likely to contain some kind of minor error in the original data.

Language ⏶ Table forms Errors (% affected words) Language Table forms ⏷ Errors (% affected words)
Abkhaz 1 2 (100.00%) Ancient Greek 23 6884 (99.88%)
Adyghe 1 2 (100.00%) Greek 17 2290 (61.91%)
Afar 1 2 (100.00%) English 10 80 (99.37%)
Afrikaans 1 2 (100.00%) French 7 36 (99.39%)
Albanian 1 2 (100.00%) German 6 408 (99.98%)
Issues with parsing articles inside parentheses.
Alemannic 1 2 (100.00%) Translingual 6 36 (99.19%)
Algonquin 1 2 (100.00%) Azerbaijani 6 104 (72.27%)
Amharic 1 2 (100.00%) Turkish 6 2746 (99.98%)
Ancient Egyptian 1 2 (100.00%) Latin 5 236 (100.00%)
Ancient Greek 23 6884 (99.88%) Italian 5 22 (100.00%)
Anglo-Norman 1 2 (100.00%) Old French 5 22 (93.29%)
Arabic 1 2 (100.00%) Spanish 5 274 (99.97%)
Aragonese 1 2 (100.00%) Portuguese 5 24 (99.89%)
Aramaic 1 6 (100.00%) Esperanto 5 28 (99.41%)
Armenian 2 4 (15.44%) Romanian 4 70 (99.85%)
Aromanian 2 4 (85.29%) Slovak 4 58 (99.88%)
Arpitan 1 2 (100.00%) Croatian 3 34 (99.79%)
Assamese 1 2 (100.00%) Japanese 3 12 (100.00%)
Asturian 1 2 (100.00%) Russian 3 60 (99.80%)
Aymara 1 2 (100.00%) Cretan Greek 3 64 (97.06%)
Azerbaijani 6 104 (72.27%) Dutch 2 4 (99.99%)
Bambara 1 2 (100.00%) Polish 2 60 (100.00%)
Bangla 1 2 (100.00%) Medieval Greek 2 6 (98.83%)
Bashkir 1 4 (100.00%) Swedish 2 4 (100.00%)
Basque 1 2 (100.00%) Norwegian 2 4 (99.98%)
Bavarian 1 2 (100.00%) Aromanian 2 4 (85.29%)
Belarusian 1 2 (100.00%) Ido 2 12 (100.00%)
Bemba 1 2 (100.00%) Serbian 2 4 (99.94%)
Biblical Hebrew 1 2 (100.00%) Serbo-Croatian 2 32 (100.00%)
Bikol Central 1 2 (100.00%) Cypriot Greek 2 4 (96.39%)
Bislama 1 2 (100.00%) Lombard 2 10 (100.00%)
Bosnian 1 2 (100.00%) Bulgarian 2 4 (99.75%)
Breton 1 2 (100.00%) Chinese 2 4 (95.47%)
Bulgarian 2 4 (99.75%) Georgian 2 4 (99.69%)
Burmese 1 2 (100.00%) Ladino 2 8 (100.00%)
Cantonese 1 2 (100.00%) Armenian 2 4 (15.44%)
Cappadocian Greek 1 2 (100.00%) Uzbek 2 4 (96.97%)
Carpathian Rusyn 1 2 (100.00%) Kazakh 2 4 (1.11%)
Catalan 1 2 (100.00%) Ottoman Turkish 2 4 (98.55%)
Cebuano 1 2 (100.00%) Jamaican Creole 2 8 (100.00%)
Central Kurdish 1 2 (100.00%) Irish 1 2 (100.00%)
Chamorro 1 2 (100.00%) Scottish Gaelic 1 2 (100.00%)
Chechen 1 2 (100.00%) Finnish 1 2 (100.00%)
Cherokee 1 2 (100.00%) Danish 1 2 (100.00%)
Cheyenne 1 2 (100.00%) Albanian 1 2 (100.00%)
Chinese 2 4 (95.47%) Icelandic 1 2 (100.00%)
Church Slavic 1 2 (100.00%) Afrikaans 1 2 (100.00%)
Chuvash 1 2 (100.00%) Indonesian 1 2 (100.00%)
Classical Nahuatl 1 2 (100.00%) Latvian 1 2 (100.00%)
Colognian 1 2 (100.00%) Luxembourgish 1 2 (100.00%)
Coptic 1 2 (100.00%) Low German 1 2 (100.00%)
Cornish 1 2 (100.00%) Interlingua 1 2 (100.00%)
Corsican 1 2 (100.00%) Catalan 1 2 (100.00%)
Cretan Greek 3 64 (97.06%) Occitan 1 2 (100.00%)
Crimean Tatar 1 2 (100.00%) Czech 1 2 (100.00%)
Croatian 3 34 (99.79%) Papiamento 1 2 (100.00%)
Cypriot Arabic 1 4 (100.00%) Tsakonian 1 2 (100.00%)
Cypriot Greek 2 4 (96.39%) Hungarian 1 2 (100.00%)
Czech 1 2 (100.00%) Novial 1 2 (100.00%)
Danish 1 2 (100.00%) Narom 1 2 (100.00%)
Dhivehi 1 2 (100.00%) West Flemish 1 2 (100.00%)
Dimli 1 2 (100.00%) Slovene 1 2 (100.00%)
Dutch 2 4 (99.99%) Breton 1 2 (100.00%)
Dutch Low Saxon 1 2 (100.00%) Basque 1 2 (100.00%)
Dzongkha 1 4 (100.00%) Venetan 1 2 (100.00%)
Egyptian Arabic 1 2 (100.00%) Ligurian 1 2 (100.00%)
Elamite 1 4 (100.00%) Piedmontese 1 2 (100.00%)
English 10 80 (99.37%) West Frisian 1 2 (100.00%)
Esperanto 5 28 (99.41%) Classical Nahuatl 1 2 (100.00%)
Estonian 1 2 (100.00%) Estonian 1 2 (100.00%)
Ewe 1 2 (100.00%) Norwegian Nynorsk 1 2 (100.00%)
Extremaduran 1 2 (100.00%) Manx 1 2 (100.00%)
Faroese 1 2 (100.00%) Sardinian 1 2 (100.00%)
Fiji Hindi 1 2 (100.00%) Welsh 1 2 (100.00%)
Fijian 1 2 (100.00%) Bosnian 1 2 (100.00%)
Finnish 1 2 (100.00%) Kurdish 1 2 (100.00%)
French 7 36 (99.39%) Limburgish 1 2 (100.00%)
Friulian 1 2 (100.00%) Scots 1 2 (100.00%)
Galician 1 2 (100.00%) Neapolitan 1 2 (100.00%)
Galoli 1 2 (100.00%) Galician 1 2 (100.00%)
Gan 1 2 (100.00%) Lithuanian 1 2 (100.00%)
Georgian 2 4 (99.69%) Corsican 1 2 (100.00%)
German 6 408 (99.98%)
Issues with parsing articles inside parentheses.
Sicilian 1 2 (100.00%)
Ghotuo 1 2 (100.00%) Old English 1 2 (100.00%)
Gothic 1 4 (100.00%) Sranan Tongo 1 2 (100.00%)
Greek 17 2290 (61.91%) Tarantino 1 2 (100.00%)
Greenlandic 1 2 (100.00%) Vietnamese 1 2 (100.00%)
Guarani 1 2 (100.00%) Malay 1 2 (100.00%)
Gujarati 1 2 (100.00%) Bavarian 1 2 (100.00%)
Haitian Creole 1 2 (100.00%) Faroese 1 2 (100.00%)
Hakka 1 2 (100.00%) Cornish 1 2 (100.00%)
Hausa 1 2 (100.00%) Ladin 1 2 (100.00%)
Hawaiian 1 2 (100.00%) Moldovan 1 2 (100.00%)
Hebrew 1 2 (100.00%) Asturian 1 2 (100.00%)
Hindi 1 2 (100.00%) Aragonese 1 2 (100.00%)
Hittite 1 4 (100.00%) Igbo 1 2 (100.00%)
Hungarian 1 2 (100.00%) Ancient Egyptian 1 2 (100.00%)
Icelandic 1 2 (100.00%) Tagalog 1 2 (100.00%)
Ido 2 12 (100.00%) Maltese 1 2 (100.00%)
Igbo 1 2 (100.00%) Ukrainian 1 2 (100.00%)
Ilocano 1 2 (100.00%) Volapük 1 2 (100.00%)
Inari Sami 1 2 (100.00%) Lower Sorbian 1 2 (100.00%)
Indonesian 1 2 (100.00%) Ewe 1 2 (100.00%)
Interlingua 1 2 (100.00%) Korean 1 2 (100.00%)
Interlingue 1 2 (100.00%) Hebrew 1 2 (100.00%)
Inuktitut 1 2 (100.00%) Quechua 1 2 (100.00%)
Irish 1 2 (100.00%) Swahili 1 2 (100.00%)
Because subject class concord and object class concord uses the same form (for example "c5"), we don't have yet a way to distinguish them. The meaning is human-parsable because we can look at the alignment of the headers (horizontal=subject concord, vertical=object concord), but the parser does not (yet) have this kind of memory or spatial awareness. /////// Sep 9th 2022: I've recently implemented a way to transform row or column tags into other tags based on language, but it doesn't solve an underlying issue with Swahili tables that I'd missed or forgotten about: The horizontal column headers for class and person don't get inherited through the whole template because it is chopped up into several subtables that don't directly inherit from above. It's an issue with 'bleed' again. /////// Later (comment in Dec 2022): forgot to put this here, but Swahili tables have now gotten their own big systems that make them work, including a "save to register" feature that can keep column headers in memory to be used in a later subtable within a template.
Italian 5 22 (100.00%) Friulian 1 2 (100.00%)
Italiot Greek 1 6 (100.00%) Ilocano 1 2 (100.00%)
Jamaican Creole 2 8 (100.00%) Cebuano 1 2 (100.00%)
Japanese 3 12 (100.00%) Zealandic 1 2 (100.00%)
Javanese 1 2 (100.00%) Cappadocian Greek 1 2 (100.00%)
Kabardian 1 4 (100.00%) Pontic 1 4 (100.00%)
Kabyle 1 2 (100.00%) Arabic 1 2 (100.00%)
Kannada 1 2 (100.00%) Belarusian 1 2 (100.00%)
Kapampangan 1 2 (100.00%) Macedonian 1 2 (100.00%)
Karachay-Balkar 1 2 (100.00%) Thai 1 2 (100.00%)
Kashmiri 1 2 (100.00%) Old High German 1 2 (100.00%)
Kashubian 1 2 (100.00%) Persian 1 2 (100.00%)
Kazakh 2 4 (1.11%) Hindi 1 2 (100.00%)
Khmer 1 2 (100.00%) Zulu 1 2 (100.00%)
Kinyarwanda 1 2 (100.00%) Kabyle 1 2 (100.00%)
Kongo 1 2 (100.00%) Kotava 1 2 (100.00%)
Korean 1 2 (100.00%) Crimean Tatar 1 2 (100.00%)
Kotava 1 2 (100.00%) Fijian 1 2 (100.00%)
Kurdish 1 2 (100.00%) Bambara 1 2 (100.00%)
Kyrgyz 1 2 (100.00%) Nepali 1 2 (100.00%)
Ladin 1 2 (100.00%) Tatar 1 2 (100.00%)
Ladino 2 8 (100.00%) Tamil 1 2 (100.00%)
Lakota 1 2 (100.00%) Urdu 1 2 (100.00%)
Lao 1 2 (100.00%) Gujarati 1 2 (100.00%)
Latin 5 236 (100.00%) Alemannic 1 2 (100.00%)
Latvian 1 2 (100.00%) Yiddish 1 2 (100.00%)
Ligurian 1 2 (100.00%) Samoan 1 2 (100.00%)
Limburgish 1 2 (100.00%) Kinyarwanda 1 2 (100.00%)
Lingala 1 2 (100.00%) Somali 1 2 (100.00%)
Lithuanian 1 2 (100.00%) Cantonese 1 2 (100.00%)
Livonian 1 2 (100.00%) Minnan 1 2 (100.00%)
Lojban 1 2 (100.00%) Romansch 1 2 (100.00%)
Lombard 2 10 (100.00%) Pali 1 2 (100.00%)
Low German 1 2 (100.00%) Bangla 1 2 (100.00%)
Lower Sorbian 1 2 (100.00%) Zhuang 1 2 (100.00%)
Luxembourgish 1 2 (100.00%) Walloon 1 2 (100.00%)
Lydian 1 4 (100.00%) Tajik 1 2 (100.00%)
Macedonian 1 2 (100.00%) Tahitian 1 2 (100.00%)
Malagasy 1 2 (100.00%) Chuvash 1 2 (100.00%)
Malay 1 2 (100.00%) Rundi 1 2 (100.00%)
Malayalam 1 2 (100.00%) Kashmiri 1 2 (100.00%)
Maltese 1 2 (100.00%) Malayalam 1 2 (100.00%)
Manchu 1 2 (100.00%) Ossetian 1 2 (100.00%)
Manx 1 2 (100.00%) Dhivehi 1 2 (100.00%)
Maori 1 2 (100.00%) Biblical Hebrew 1 2 (100.00%)
Mapuche 1 2 (100.00%) Yucatec Maya 1 2 (100.00%)
Marathi 1 2 (100.00%) Turkmen 1 2 (100.00%)
Mari 1 2 (100.00%) Elamite 1 4 (100.00%)
Medieval Greek 2 6 (98.83%) Bemba 1 2 (100.00%)
Middle Dutch 1 2 (100.00%) Swati 1 2 (100.00%)
Middle English 1 2 (100.00%) Northern Sotho 1 2 (100.00%)
Minnan 1 2 (100.00%) Uyghur 1 2 (100.00%)
Mirandese 1 2 (100.00%) Punjabi 1 2 (100.00%)
Moksha 1 2 (100.00%) Romani 1 2 (100.00%)
Moldovan 1 2 (100.00%) Maori 1 2 (100.00%)
Mongolian 1 2 (100.00%) Mongolian 1 2 (100.00%)
Mycenaean Greek 1 6 (100.00%) Kashubian 1 2 (100.00%)
Nahuatl 1 2 (100.00%) Tok Pisin 1 2 (100.00%)
Narom 1 2 (100.00%) Lingala 1 2 (100.00%)
Nauru 1 2 (100.00%) Chamorro 1 2 (100.00%)
Navajo 1 2 (100.00%) Haitian Creole 1 2 (100.00%)
Neapolitan 1 2 (100.00%) Inuktitut 1 2 (100.00%)
Nepali 1 2 (100.00%) Tongan 1 2 (100.00%)
Newar 1 2 (100.00%) Shona 1 2 (100.00%)
Northern Sami 1 2 (100.00%) Veps 1 2 (100.00%)
Northern Sotho 1 2 (100.00%) Marathi 1 2 (100.00%)
Norwegian 2 4 (99.98%) Kyrgyz 1 2 (100.00%)
Norwegian Bokmål 1 2 (100.00%) Northern Sami 1 2 (100.00%)
Norwegian Nynorsk 1 2 (100.00%) Western Maninkakan 1 2 (100.00%)
Novial 1 2 (100.00%) Yoruba 1 2 (100.00%)
Occitan 1 2 (100.00%) Wolof 1 2 (100.00%)
Odia 1 2 (100.00%) Sanskrit 1 2 (100.00%)
Ojibwa 1 2 (100.00%) Tswana 1 2 (100.00%)
Old Armenian 1 2 (100.00%) Samogitian 1 2 (100.00%)
Old English 1 2 (100.00%) Pennsylvania German 1 2 (100.00%)
Old French 5 22 (93.29%) Pashto 1 2 (100.00%)
Old High German 1 2 (100.00%) Lojban 1 2 (100.00%)
Old Irish 1 2 (100.00%)
Leave be; Old Irish tables can wait until they've sorted themselves out.
Kannada 1 2 (100.00%)
Old Norse 1 2 (100.00%) Malagasy 1 2 (100.00%)
Old Occitan 1 2 (100.00%) Tibetan 1 2 (100.00%)
Old Persian 1 2 (100.00%) Venda 1 2 (100.00%)
Old Turkic 1 4 (100.00%) Xhosa 1 2 (100.00%)
Ossetian 1 2 (100.00%) Sotho 1 2 (100.00%)
Ottoman Turkish 2 4 (98.55%) Tsonga 1 2 (100.00%)
Palatine German 1 2 (100.00%) Aymara 1 2 (100.00%)
Pali 1 2 (100.00%) Telugu 1 2 (100.00%)
Papiamento 1 2 (100.00%) Moksha 1 2 (100.00%)
Pashto 1 2 (100.00%) Middle Dutch 1 2 (100.00%)
Pennsylvania German 1 2 (100.00%) Hawaiian 1 2 (100.00%)
Persian 1 2 (100.00%) Rapa Nui 1 2 (100.00%)
Piedmontese 1 2 (100.00%) Guarani 1 2 (100.00%)
Polish 2 60 (100.00%) Javanese 1 2 (100.00%)
Pontic 1 4 (100.00%) Sundanese 1 2 (100.00%)
Portuguese 5 24 (99.89%) Livonian 1 2 (100.00%)
Proto-Albanian 1 2 (100.00%) Aramaic 1 6 (100.00%)
Proto-Balto-Slavic 1 2 (100.00%) Kapampangan 1 2 (100.00%)
Proto-Germanic 1 2 (100.00%) Abkhaz 1 2 (100.00%)
Proto-Hellenic 1 2 (100.00%) Adyghe 1 2 (100.00%)
Proto-Indo-European 1 2 (100.00%) Amharic 1 2 (100.00%)
Proto-Italic 1 2 (100.00%) Old Occitan 1 2 (100.00%)
Proto-Slavic 1 2 (100.00%) Dutch Low Saxon 1 2 (100.00%)
Punjabi 1 2 (100.00%) Dimli 1 2 (100.00%)
Quechua 1 2 (100.00%) Navajo 1 2 (100.00%)
Rapa Nui 1 2 (100.00%) Kongo 1 2 (100.00%)
Romani 1 2 (100.00%) Mapuche 1 2 (100.00%)
Romanian 4 70 (99.85%) Bislama 1 2 (100.00%)
Romansch 1 2 (100.00%) Colognian 1 2 (100.00%)
Rundi 1 2 (100.00%) Arpitan 1 2 (100.00%)
Russian 3 60 (99.80%) Tetum 1 2 (100.00%)
Samoan 1 2 (100.00%) Cherokee 1 2 (100.00%)
Samogitian 1 2 (100.00%) Cheyenne 1 2 (100.00%)
Sanskrit 1 2 (100.00%) Bashkir 1 4 (100.00%)
Sardinian 1 2 (100.00%) Extremaduran 1 2 (100.00%)
Saterland Frisian 1 2 (100.00%) Newar 1 2 (100.00%)
Scots 1 2 (100.00%) Gothic 1 4 (100.00%)
Scottish Gaelic 1 2 (100.00%) Interlingue 1 2 (100.00%)
Seneca 1 2 (100.00%) Seneca 1 2 (100.00%)
Serbian 2 4 (99.94%) Norwegian Bokmål 1 2 (100.00%)
Serbo-Croatian 2 32 (100.00%) Algonquin 1 2 (100.00%)
Shona 1 2 (100.00%) Afar 1 2 (100.00%)
Sicilian 1 2 (100.00%) Fiji Hindi 1 2 (100.00%)
Sinhala 1 2 (100.00%) Inari Sami 1 2 (100.00%)
Slovak 4 58 (99.88%) Greenlandic 1 2 (100.00%)
Slovene 1 2 (100.00%) Upper Sorbian 1 2 (100.00%)
Somali 1 2 (100.00%) Galoli 1 2 (100.00%)
Sotho 1 2 (100.00%) Lao 1 2 (100.00%)
Spanish 5 274 (99.97%) Hausa 1 2 (100.00%)
Sranan Tongo 1 2 (100.00%) Middle English 1 2 (100.00%)
Sundanese 1 2 (100.00%) Karachay-Balkar 1 2 (100.00%)
Swahili 1 2 (100.00%)
Because subject class concord and object class concord uses the same form (for example "c5"), we don't have yet a way to distinguish them. The meaning is human-parsable because we can look at the alignment of the headers (horizontal=subject concord, vertical=object concord), but the parser does not (yet) have this kind of memory or spatial awareness. /////// Sep 9th 2022: I've recently implemented a way to transform row or column tags into other tags based on language, but it doesn't solve an underlying issue with Swahili tables that I'd missed or forgotten about: The horizontal column headers for class and person don't get inherited through the whole template because it is chopped up into several subtables that don't directly inherit from above. It's an issue with 'bleed' again. /////// Later (comment in Dec 2022): forgot to put this here, but Swahili tables have now gotten their own big systems that make them work, including a "save to register" feature that can keep column headers in memory to be used in a later subtable within a template.
Burmese 1 2 (100.00%)
Swati 1 2 (100.00%) Sinhala 1 2 (100.00%)
Swedish 2 4 (100.00%) Central Kurdish 1 2 (100.00%)
Tagalog 1 2 (100.00%) Egyptian Arabic 1 2 (100.00%)
Tahitian 1 2 (100.00%) Bikol Central 1 2 (100.00%)
Tajik 1 2 (100.00%) Khmer 1 2 (100.00%)
Tamil 1 2 (100.00%) Kabardian 1 4 (100.00%)
Tarantino 1 2 (100.00%) Mari 1 2 (100.00%)
Tatar 1 2 (100.00%) Nahuatl 1 2 (100.00%)
Telugu 1 2 (100.00%) Palatine German 1 2 (100.00%)
Tetum 1 2 (100.00%) Western Punjabi 1 2 (100.00%)
Thai 1 2 (100.00%) Carpathian Rusyn 1 2 (100.00%)
Tibetan 1 2 (100.00%) Yakut 1 2 (100.00%)
Tok Pisin 1 2 (100.00%) Saterland Frisian 1 2 (100.00%)
Tongan 1 2 (100.00%) Waray 1 2 (100.00%)
Translingual 6 36 (99.19%) Gan 1 2 (100.00%)
Tsakonian 1 2 (100.00%) Anglo-Norman 1 2 (100.00%)
Tsonga 1 2 (100.00%) Italiot Greek 1 6 (100.00%)
Tswana 1 2 (100.00%) Wu 1 2 (100.00%)
Turkish 6 2746 (99.98%) Manchu 1 2 (100.00%)
Turkmen 1 2 (100.00%) Mycenaean Greek 1 6 (100.00%)
Udmurt 1 4 (100.00%) Proto-Indo-European 1 2 (100.00%)
Ukrainian 1 2 (100.00%) Ghotuo 1 2 (100.00%)
Upper Sorbian 1 2 (100.00%) Old Armenian 1 2 (100.00%)
Urdu 1 2 (100.00%) Nauru 1 2 (100.00%)
Uyghur 1 2 (100.00%) Chechen 1 2 (100.00%)
Uzbek 2 4 (96.97%) Hakka 1 2 (100.00%)
Venda 1 2 (100.00%) Odia 1 2 (100.00%)
Venetan 1 2 (100.00%) Old Persian 1 2 (100.00%)
Veps 1 2 (100.00%) Old Turkic 1 4 (100.00%)
Vietnamese 1 2 (100.00%) Assamese 1 2 (100.00%)
Volapük 1 2 (100.00%) Mirandese 1 2 (100.00%)
Walloon 1 2 (100.00%) Lydian 1 4 (100.00%)
Waray 1 2 (100.00%) Old Norse 1 2 (100.00%)
Welsh 1 2 (100.00%) Udmurt 1 4 (100.00%)
West Flemish 1 2 (100.00%) Lakota 1 2 (100.00%)
West Frisian 1 2 (100.00%) Ojibwa 1 2 (100.00%)
Western Maninkakan 1 2 (100.00%) Proto-Hellenic 1 2 (100.00%)
Western Punjabi 1 2 (100.00%) Dzongkha 1 4 (100.00%)
Wolof 1 2 (100.00%) Old Irish 1 2 (100.00%)
Leave be; Old Irish tables can wait until they've sorted themselves out.
Wu 1 2 (100.00%) Church Slavic 1 2 (100.00%)
Xhosa 1 2 (100.00%) Hittite 1 4 (100.00%)
Yakut 1 2 (100.00%) Coptic 1 2 (100.00%)
Yiddish 1 2 (100.00%) Cypriot Arabic 1 4 (100.00%)
Yoruba 1 2 (100.00%) Proto-Slavic 1 2 (100.00%)
Yucatec Maya 1 2 (100.00%) Proto-Balto-Slavic 1 2 (100.00%)
Zealandic 1 2 (100.00%) Proto-Italic 1 2 (100.00%)
Zhuang 1 2 (100.00%) Proto-Albanian 1 2 (100.00%)
Zulu 1 2 (100.00%) Proto-Germanic 1 2 (100.00%)

This page is a part of the kaikki.org machine-readable dictionary. This dictionary is based on structured data extracted on 2025-08-28 from the elwiktionary dump dated 2025-08-21 using wiktextract (ffdbfc3 and b9346a0). The data shown on this site has been post-processed and various details (e.g., extra categories) removed, some information disambiguated, and additional data merged from other sources. See the raw data download page for the unprocessed wiktextract data.

If you use this data in academic research, please cite Tatu Ylonen: Wiktextract: Wiktionary as Machine-Readable Structured Data, Proceedings of the 13th Conference on Language Resources and Evaluation (LREC), pp. 1317-1325, Marseille, 20-25 June 2022. Linking to the relevant page(s) under https://kaikki.org would also be greatly appreciated.