Wiktionary data extraction errors and warnings

Language ⏶	Table forms	Errors (% affected words)		Language	Table forms ⏷	Errors (% affected words)
Abkhaz	1	4 (100.00%)		Greek	1903	2452 (35.47%)
Adyghe	1	2 (100.00%)		Ancient Greek	565	876 (14.76%)
Afar	1	2 (100.00%)		Latin	68	80 (70.92%)
Afrikaans	1	2 (100.00%)		Medieval Greek	66	50 (36.63%)
Albanian	5	4 (8.72%)		English	40	30 (64.69%)
Alemannic	3	2 (19.05%)		German	25	24 (8.15%)	[Show ▼] [Hide ▲] Issues with parsing articles inside parentheses.
Algonquin	1	2 (100.00%)		Cypriot Greek	21	14 (38.91%)
Amharic	1	2 (100.00%)		French	19	26 (62.30%)
Ancient Egyptian	3	2 (86.21%)		Spanish	18	52 (93.68%)
Ancient Greek	565	876 (14.76%)		Cretan Greek	17	8 (30.00%)
Ancient Macedonian	2	0 (0.00%)		Turkish	15	20 (9.50%)
Anglo-Norman	1	2 (100.00%)		Multiple languages	14	6 (79.27%)
Arabic	2	2 (99.81%)		Esperanto	13	16 (85.13%)
Aragonese	1	2 (100.00%)		Serbo-Croatian	13	10 (84.89%)
Aramaic	4	2 (57.14%)		Romanian	12	26 (89.40%)
Armenian	11	18 (11.85%)		Polish	11	8 (55.59%)
Aromanian	6	4 (81.94%)		Old French	11	8 (56.80%)
Arpitan	1	2 (100.00%)		Slovak	11	40 (92.76%)
Assamese	1	2 (100.00%)		Italiot Greek	11	12 (85.33%)
Asturian	4	6 (91.18%)		Armenian	11	18 (11.85%)
Aymara	1	2 (100.00%)		Uni	10	16 (94.79%)
Azerbaijani	5	6 (62.86%)		Italian	10	10 (4.01%)
Bambara	1	2 (100.00%)		Russian	10	10 (97.27%)
Bangla	1	2 (100.00%)		Pontic	9	4 (20.73%)
Bashkir	1	4 (100.00%)		Dutch	7	4 (23.20%)
Basque	1	2 (100.00%)		Portuguese	7	6 (73.92%)
Bavarian	2	2 (69.57%)		Tsakonian	7	6 (76.76%)
Belarusian	1	2 (100.00%)		Mycenaean Greek	7	16 (95.43%)
Bemba	1	2 (100.00%)		Swedish	6	2 (3.61%)
Biblical Hebrew	4	2 (70.27%)		Aromanian	6	4 (81.94%)
Bislama	1	2 (100.00%)		Serbian	6	2 (33.41%)
Bosnian	1	2 (100.00%)		Bulgarian	6	4 (99.27%)
Breton	2	2 (99.42%)		Ladino	6	2 (22.73%)
Bulgarian	6	4 (99.27%)		Azerbaijani	5	6 (62.86%)
Burmese	1	2 (100.00%)		Albanian	5	4 (8.72%)
Cantonese	1	2 (100.00%)		Icelandic	5	4 (23.83%)
Cappadocian Greek	5	2 (45.45%)		Czech	5	2 (97.11%)
Carpathian Rusyn	2	2 (50.00%)		Croatian	5	6 (98.39%)
Catalan	3	2 (99.59%)		Slovene	5	2 (1.32%)
Cebuano	2	2 (90.91%)		Cappadocian Greek	5	2 (45.45%)
Central	2	2 (75.00%)		Macedonian	5	2 (98.12%)
Central Bikol	2	2 (66.67%)		Proto-Hellenic	5	6 (33.33%)
Central Nahuatl	1	2 (100.00%)		Finnish	4	2 (8.83%)
Chamorro	1	2 (100.00%)		Danish	4	2 (4.35%)
Chechen	1	2 (100.00%)		Norwegian	4	2 (12.37%)
Cherokee	1	2 (100.00%)		Japanese	4	20 (100.00%)
Cheyenne	1	2 (100.00%)		West Flemish	4	2 (0.03%)
Chinese	1	2 (100.00%)		Lithuanian	4	2 (99.64%)
Church Slavic	2	0 (0.00%)		Asturian	4	6 (91.18%)
Chuvash	1	2 (100.00%)		Georgian	4	4 (97.89%)
Classical Nahuatl	1	2 (100.00%)		Kabyle	4	2 (77.78%)
Colognian	1	2 (100.00%)		Biblical Hebrew	4	2 (70.27%)
Coptic	2	2 (70.97%)		Aramaic	4	2 (57.14%)
Cornish	1	2 (100.00%)		Proto-Indo-European	4	2 (80.00%)
Corsican	1	2 (100.00%)		Latvian	3	2 (99.33%)
Cretan Greek	17	8 (30.00%)		Catalan	3	2 (99.59%)
Crimean Tatar	1	2 (100.00%)		Hungarian	3	2 (99.72%)
Croatian	5	6 (98.39%)		Narom	3	2 (88.24%)
Cypriot Arabic	1	6 (100.00%)		Venetan	3	4 (95.60%)
Cypriot Greek	21	14 (38.91%)		Neapolitan	3	2 (96.05%)
Czech	5	2 (97.11%)		Lombard	3	4 (88.24%)
Danish	4	2 (4.35%)		Ancient Egyptian	3	2 (86.21%)
Dhivehi	1	2 (100.00%)		Maltese	3	2 (96.83%)
Dimli	1	2 (100.00%)		Ukrainian	3	2 (99.64%)
Dutch	7	4 (23.20%)		Hebrew	3	2 (99.68%)
Dzongkha	1	4 (100.00%)		Friulian	3	2 (90.00%)
East Circassian	1	4 (100.00%)		Alemannic	3	2 (19.05%)
Egyptian Arabic	1	2 (100.00%)		Ottoman Turkish	3	4 (93.59%)
Elamite	1	4 (100.00%)		Jamaican Creole	3	6 (62.50%)
English	40	30 (64.69%)		Old Irish	3	2 (21.43%)	[Show ▼] [Hide ▲] Leave be; Old Irish tables can wait until they've sorted themselves out.
Esperanto	13	16 (85.13%)		Gothic	3	4 (54.55%)
Estonian	2	2 (96.88%)		Proto-Slavic	3	0 (0.00%)
Ewe	1	2 (100.00%)		Proto-Italic	3	2 (25.00%)
Extremaduran	2	2 (50.00%)		Irish	2	2 (99.65%)
Faroese	1	2 (100.00%)		Indonesian	2	2 (99.63%)
Fiji Hindi	1	2 (100.00%)		Ido	2	4 (99.99%)
Fijian	1	2 (100.00%)		Breton	2	2 (99.42%)
Finnish	4	2 (8.83%)		Estonian	2	2 (96.88%)
French	19	26 (62.30%)		Sardinian	2	2 (96.61%)
Friulian	3	2 (90.00%)		Scots	2	2 (95.00%)
Galician	1	2 (100.00%)		Bavarian	2	2 (69.57%)
Galoli	1	2 (100.00%)		Igbo	2	2 (92.31%)
Gan	2	2 (50.00%)		Cebuano	2	2 (90.91%)
Georgian	4	4 (97.89%)		Zealandic	2	2 (42.31%)
German	25	24 (8.15%)	[Show ▼] [Hide ▲] Issues with parsing articles inside parentheses.	Arabic	2	2 (99.81%)
Ghotuo	1	2 (100.00%)		Minnan	2	2 (96.00%)
Gothic	3	4 (54.55%)		Romansh	2	2 (94.44%)
Greek	1903	2452 (35.47%)		Pali	2	2 (98.51%)
Greenlandic	1	2 (100.00%)		Uzbek	2	2 (71.43%)
Guarani	1	2 (100.00%)		Ossetian	2	2 (95.24%)
Gujarati	1	2 (100.00%)		Turkmen	2	2 (64.00%)
Haitian Creole	1	2 (100.00%)		Romani	2	2 (85.71%)
Hakka	1	2 (100.00%)		Kazakh	2	4 (1.11%)
Hausa	1	2 (100.00%)		Samogitian	2	2 (75.00%)
Hawaiian	2	2 (95.24%)		Pennsylvania German	2	2 (66.67%)
Hebrew	3	2 (99.68%)		Tibetan	2	4 (85.71%)
Hindi	1	2 (100.00%)		Hawaiian	2	2 (95.24%)
Hittite	2	4 (50.00%)		Javanese	2	2 (91.67%)
Hungarian	3	2 (99.72%)		Old Occitan	2	2 (33.33%)
Icelandic	5	4 (23.83%)		Mariupol Greek	2	10 (93.33%)
Ido	2	4 (99.99%)		Extremaduran	2	2 (50.00%)
Igbo	2	2 (92.31%)		Karachay-Balkar	2	2 (66.67%)
Ilocano	1	2 (100.00%)		Central	2	2 (75.00%)
Inari Sami	1	2 (100.00%)		Central Bikol	2	2 (66.67%)
Indonesian	2	2 (99.63%)		Nahuatl	2	2 (50.00%)
Interlingua	1	2 (100.00%)		Carpathian Rusyn	2	2 (50.00%)
Interlingue	1	2 (100.00%)		Saterland Frisian	2	2 (50.00%)
Inuktitut	1	2 (100.00%)		Gan	2	2 (50.00%)
Irish	2	2 (99.65%)		Old Persian	2	2 (50.00%)
Italian	10	10 (4.01%)		Old Norse	2	2 (80.00%)
Italiot Greek	11	12 (85.33%)		Church Slavic	2	0 (0.00%)
Jamaican Creole	3	6 (62.50%)		Hittite	2	4 (50.00%)
Japanese	4	20 (100.00%)		Coptic	2	2 (70.97%)
Javanese	2	2 (91.67%)		Proto-Albanian	2	0 (0.00%)
Kabyle	4	2 (77.78%)		Proto-Germanic	2	2 (50.00%)
Kannada	1	2 (100.00%)		Old East Slavic	2	0 (0.00%)
Kapampangan	1	2 (100.00%)		Ancient Macedonian	2	0 (0.00%)
Karachay-Balkar	2	2 (66.67%)		Scottish Gaelic	1	2 (100.00%)
Kashmiri	1	2 (100.00%)		Afrikaans	1	2 (100.00%)
Kashubian	1	2 (100.00%)		Luxembourgish	1	2 (100.00%)
Kazakh	2	4 (1.11%)		Low German	1	2 (100.00%)
Khmer	1	2 (100.00%)		Interlingua	1	2 (100.00%)
Kinyarwanda	1	2 (100.00%)		Occitan	1	2 (100.00%)
Kongo	1	2 (100.00%)		Papiamento	1	2 (100.00%)
Korean	1	2 (100.00%)		Novial	1	2 (100.00%)
Kotava	1	2 (100.00%)		Basque	1	2 (100.00%)
Kurdish	1	2 (100.00%)		Ligurian	1	2 (100.00%)
Kyrgyz	1	2 (100.00%)		Piedmontese	1	2 (100.00%)
Ladin	1	2 (100.00%)		West Frisian	1	2 (100.00%)
Ladino	6	2 (22.73%)		Classical Nahuatl	1	2 (100.00%)
Lakota	1	2 (100.00%)		Norwegian Nynorsk	1	2 (100.00%)
Lao	1	2 (100.00%)		Manx	1	2 (100.00%)
Latin	68	80 (70.92%)		Welsh	1	2 (100.00%)
Latvian	3	2 (99.33%)		Bosnian	1	2 (100.00%)
Ligurian	1	2 (100.00%)		Kurdish	1	2 (100.00%)
Limburgish	1	2 (100.00%)		Limburgish	1	2 (100.00%)
Lingala	1	2 (100.00%)		Galician	1	2 (100.00%)
Lithuanian	4	2 (99.64%)		Corsican	1	2 (100.00%)
Livonian	1	2 (100.00%)		Sicilian	1	2 (100.00%)
Lojban	1	2 (100.00%)		Old English	1	2 (100.00%)
Lombard	3	4 (88.24%)		Sranan Tongo	1	2 (100.00%)
Low German	1	2 (100.00%)		Tarantino	1	2 (100.00%)
Low Saxon	1	2 (100.00%)		Vietnamese	1	2 (100.00%)
Lower Sorbian	1	2 (100.00%)		Malay	1	2 (100.00%)
Luxembourgish	1	2 (100.00%)		Faroese	1	2 (100.00%)
Lydian	1	4 (100.00%)		Cornish	1	2 (100.00%)
Macedonian	5	2 (98.12%)		Ladin	1	2 (100.00%)
Malagasy	1	2 (100.00%)		Moldovan	1	2 (100.00%)
Malay	1	2 (100.00%)		Aragonese	1	2 (100.00%)
Malayalam	1	2 (100.00%)		Tagalog	1	2 (100.00%)
Maltese	3	2 (96.83%)		Volapük	1	2 (100.00%)
Manchu	1	2 (100.00%)		Lower Sorbian	1	2 (100.00%)
Manx	1	2 (100.00%)		Ewe	1	2 (100.00%)
Mapuche	1	2 (100.00%)		Korean	1	2 (100.00%)
Marathi	1	2 (100.00%)		Chinese	1	2 (100.00%)
Mari	1	2 (100.00%)		Quechua	1	2 (100.00%)
Mariupol Greek	2	10 (93.33%)		Swahili	1	2 (100.00%)	[Show ▼] [Hide ▲] Because subject class concord and object class concord uses the same form (for example "c5"), we don't have yet a way to distinguish them. The meaning is human-parsable because we can look at the alignment of the headers (horizontal=subject concord, vertical=object concord), but the parser does not (yet) have this kind of memory or spatial awareness. /////// Sep 9th 2022: I've recently implemented a way to transform row or column tags into other tags based on language, but it doesn't solve an underlying issue with Swahili tables that I'd missed or forgotten about: The horizontal column headers for class and person don't get inherited through the whole template because it is chopped up into several subtables that don't directly inherit from above. It's an issue with 'bleed' again. /////// Later (comment in Dec 2022): forgot to put this here, but Swahili tables have now gotten their own big systems that make them work, including a "save to register" feature that can keep column headers in memory to be used in a later subtable within a template.
Medieval Greek	66	50 (36.63%)		Ilocano	1	2 (100.00%)
Middle Dutch	1	0 (0.00%)		Belarusian	1	2 (100.00%)
Middle English	1	2 (100.00%)		Thai	1	2 (100.00%)
Middle French	1	0 (0.00%)		Old High German	1	2 (100.00%)
Middle High German	1	0 (0.00%)		Persian	1	2 (100.00%)
Minnan	2	2 (96.00%)		Hindi	1	2 (100.00%)
Mirandese	1	2 (100.00%)		Zulu	1	2 (100.00%)
Moksha	1	4 (100.00%)		Middle French	1	0 (0.00%)
Moldovan	1	2 (100.00%)		Kotava	1	2 (100.00%)
Mongolian	1	2 (100.00%)		Crimean Tatar	1	2 (100.00%)
Multiple languages	14	6 (79.27%)		Fijian	1	2 (100.00%)
Mycenaean Greek	7	16 (95.43%)		Bambara	1	2 (100.00%)
Māori	1	2 (100.00%)		Nepali	1	2 (100.00%)
Nahuatl	2	2 (50.00%)		Tatar	1	2 (100.00%)
Narom	3	2 (88.24%)		Tamil	1	2 (100.00%)
Nauru	1	2 (100.00%)		Urdu	1	2 (100.00%)
Navajo	1	2 (100.00%)		Gujarati	1	2 (100.00%)
Neapolitan	3	2 (96.05%)		Yiddish	1	2 (100.00%)
Nepali	1	2 (100.00%)		Samoan	1	2 (100.00%)
Newar	1	2 (100.00%)		Kinyarwanda	1	2 (100.00%)
Northern Sami	1	2 (100.00%)		Somali	1	2 (100.00%)
Northern Sotho	1	2 (100.00%)		Cantonese	1	2 (100.00%)
Norwegian	4	2 (12.37%)		Bangla	1	2 (100.00%)
Norwegian Bokmål	1	2 (100.00%)		Zhuang	1	2 (100.00%)
Norwegian Nynorsk	1	2 (100.00%)		Walloon	1	2 (100.00%)
Novial	1	2 (100.00%)		Tajik	1	2 (100.00%)
Occitan	1	2 (100.00%)		Tahitian	1	2 (100.00%)
Odia	1	2 (100.00%)		Chuvash	1	2 (100.00%)
Ojibwa	1	2 (100.00%)		Rundi	1	2 (100.00%)
Old Armenian	1	4 (100.00%)		Kashmiri	1	2 (100.00%)
Old East Slavic	2	0 (0.00%)		Malayalam	1	2 (100.00%)
Old English	1	2 (100.00%)		Dhivehi	1	2 (100.00%)
Old French	11	8 (56.80%)		Yucatec Maya	1	2 (100.00%)
Old High German	1	2 (100.00%)		Elamite	1	4 (100.00%)
Old Irish	3	2 (21.43%)	[Show ▼] [Hide ▲] Leave be; Old Irish tables can wait until they've sorted themselves out.	Bemba	1	2 (100.00%)
Old Norse	2	2 (80.00%)		Swati	1	2 (100.00%)
Old Occitan	2	2 (33.33%)		Northern Sotho	1	2 (100.00%)
Old Persian	2	2 (50.00%)		Uyghur	1	2 (100.00%)
Old Spanish	1	0 (0.00%)		Punjabi	1	2 (100.00%)
Old Turkic	1	4 (100.00%)		Māori	1	2 (100.00%)
Ossetian	2	2 (95.24%)		Mongolian	1	2 (100.00%)
Ottoman Turkish	3	4 (93.59%)		Kashubian	1	2 (100.00%)
Palatine German	1	2 (100.00%)		Tok Pisin	1	2 (100.00%)
Pali	2	2 (98.51%)		Lingala	1	2 (100.00%)
Papiamento	1	2 (100.00%)		Chamorro	1	2 (100.00%)
Pashto	1	2 (100.00%)		Haitian Creole	1	2 (100.00%)
Pennsylvania German	2	2 (66.67%)		Inuktitut	1	2 (100.00%)
Persian	1	2 (100.00%)		Tongan	1	2 (100.00%)
Piedmontese	1	2 (100.00%)		Shona	1	2 (100.00%)
Polish	11	8 (55.59%)		Veps	1	2 (100.00%)
Pontic	9	4 (20.73%)		Marathi	1	2 (100.00%)
Portuguese	7	6 (73.92%)		Kyrgyz	1	2 (100.00%)
Proto-Albanian	2	0 (0.00%)		Northern Sami	1	2 (100.00%)
Proto-Balto-Slavic	1	0 (0.00%)		Western Maninkakan	1	2 (100.00%)
Proto-Germanic	2	2 (50.00%)		Yoruba	1	4 (100.00%)
Proto-Hellenic	5	6 (33.33%)		Wolof	1	2 (100.00%)
Proto-Indo-European	4	2 (80.00%)		Sanskrit	1	2 (100.00%)
Proto-Italic	3	2 (25.00%)		Tswana	1	2 (100.00%)
Proto-Slavic	3	0 (0.00%)		Pashto	1	2 (100.00%)
Proto-Turkic	1	2 (100.00%)		Lojban	1	2 (100.00%)
Punjabi	1	2 (100.00%)		Kannada	1	2 (100.00%)
Quechua	1	2 (100.00%)		Malagasy	1	2 (100.00%)
Rapa Nui	1	2 (100.00%)		Venda	1	2 (100.00%)
Romani	2	2 (85.71%)		Xhosa	1	2 (100.00%)
Romanian	12	26 (89.40%)		Sotho	1	2 (100.00%)
Romansh	2	2 (94.44%)		Tsonga	1	2 (100.00%)
Rundi	1	2 (100.00%)		Aymara	1	2 (100.00%)
Russian	10	10 (97.27%)		Telugu	1	2 (100.00%)
Samoan	1	2 (100.00%)		Moksha	1	4 (100.00%)
Samogitian	2	2 (75.00%)		Middle Dutch	1	0 (0.00%)
Sanskrit	1	2 (100.00%)		Rapa Nui	1	2 (100.00%)
Sardinian	2	2 (96.61%)		Guarani	1	2 (100.00%)
Saterland Frisian	2	2 (50.00%)		Sundanese	1	2 (100.00%)
Scots	2	2 (95.00%)		Livonian	1	2 (100.00%)
Scottish Gaelic	1	2 (100.00%)		Kapampangan	1	2 (100.00%)
Seneca	1	2 (100.00%)		Abkhaz	1	4 (100.00%)
Serbian	6	2 (33.41%)		Adyghe	1	2 (100.00%)
Serbo-Croatian	13	10 (84.89%)		Amharic	1	2 (100.00%)
Shona	1	2 (100.00%)		Low Saxon	1	2 (100.00%)
Sicilian	1	2 (100.00%)		Dimli	1	2 (100.00%)
Sinhala	1	2 (100.00%)		Navajo	1	2 (100.00%)
Slovak	11	40 (92.76%)		Kongo	1	2 (100.00%)
Slovene	5	2 (1.32%)		Mapuche	1	2 (100.00%)
Somali	1	2 (100.00%)		Bislama	1	2 (100.00%)
Sotho	1	2 (100.00%)		Colognian	1	2 (100.00%)
Spanish	18	52 (93.68%)		Arpitan	1	2 (100.00%)
Sranan Tongo	1	2 (100.00%)		Tetum	1	2 (100.00%)
Sundanese	1	2 (100.00%)		Cherokee	1	2 (100.00%)
Swahili	1	2 (100.00%)	[Show ▼] [Hide ▲] Because subject class concord and object class concord uses the same form (for example "c5"), we don't have yet a way to distinguish them. The meaning is human-parsable because we can look at the alignment of the headers (horizontal=subject concord, vertical=object concord), but the parser does not (yet) have this kind of memory or spatial awareness. /////// Sep 9th 2022: I've recently implemented a way to transform row or column tags into other tags based on language, but it doesn't solve an underlying issue with Swahili tables that I'd missed or forgotten about: The horizontal column headers for class and person don't get inherited through the whole template because it is chopped up into several subtables that don't directly inherit from above. It's an issue with 'bleed' again. /////// Later (comment in Dec 2022): forgot to put this here, but Swahili tables have now gotten their own big systems that make them work, including a "save to register" feature that can keep column headers in memory to be used in a later subtable within a template.	Cheyenne	1	2 (100.00%)
Swati	1	2 (100.00%)		Bashkir	1	4 (100.00%)
Swedish	6	2 (3.61%)		Newar	1	2 (100.00%)
Tagalog	1	2 (100.00%)		Interlingue	1	2 (100.00%)
Tahitian	1	2 (100.00%)		Seneca	1	2 (100.00%)
Tajik	1	2 (100.00%)		Norwegian Bokmål	1	2 (100.00%)
Tamil	1	2 (100.00%)		Algonquin	1	2 (100.00%)
Tarantino	1	2 (100.00%)		Afar	1	2 (100.00%)
Tatar	1	2 (100.00%)		Fiji Hindi	1	2 (100.00%)
Telugu	1	2 (100.00%)		Inari Sami	1	2 (100.00%)
Tetum	1	2 (100.00%)		Greenlandic	1	2 (100.00%)
Thai	1	2 (100.00%)		Upper Sorbian	1	2 (100.00%)
Tibetan	2	4 (85.71%)		Galoli	1	2 (100.00%)
Tok Pisin	1	2 (100.00%)		Lao	1	2 (100.00%)
Tongan	1	2 (100.00%)		Central Nahuatl	1	2 (100.00%)
Tsakonian	7	6 (76.76%)		Hausa	1	2 (100.00%)
Tsonga	1	2 (100.00%)		Middle English	1	2 (100.00%)
Tswana	1	2 (100.00%)		Burmese	1	2 (100.00%)
Turkish	15	20 (9.50%)		Sinhala	1	2 (100.00%)
Turkmen	2	2 (64.00%)		Egyptian Arabic	1	2 (100.00%)
Udmurt	1	4 (100.00%)		Khmer	1	2 (100.00%)
Ukrainian	3	2 (99.64%)		East Circassian	1	4 (100.00%)
Uni	10	16 (94.79%)		Mari	1	2 (100.00%)
Upper Sorbian	1	2 (100.00%)		Palatine German	1	2 (100.00%)
Urdu	1	2 (100.00%)		Western Punjabi	1	2 (100.00%)
Uyghur	1	2 (100.00%)		Yakut	1	2 (100.00%)
Uzbek	2	2 (71.43%)		Waray	1	2 (100.00%)
Venda	1	2 (100.00%)		Anglo-Norman	1	2 (100.00%)
Venetan	3	4 (95.60%)		Wu	1	0 (0.00%)
Veps	1	2 (100.00%)		Manchu	1	2 (100.00%)
Vietnamese	1	2 (100.00%)		Ghotuo	1	2 (100.00%)
Volapük	1	2 (100.00%)		Old Armenian	1	4 (100.00%)
Walloon	1	2 (100.00%)		Nauru	1	2 (100.00%)
Waray	1	2 (100.00%)		Chechen	1	2 (100.00%)
Welsh	1	2 (100.00%)		Hakka	1	2 (100.00%)
West Flemish	4	2 (0.03%)		Odia	1	2 (100.00%)
West Frisian	1	2 (100.00%)		Old Turkic	1	4 (100.00%)
Western Maninkakan	1	2 (100.00%)		Assamese	1	2 (100.00%)
Western Punjabi	1	2 (100.00%)		Mirandese	1	2 (100.00%)
Wolof	1	2 (100.00%)		Lydian	1	4 (100.00%)
Wu	1	0 (0.00%)		Udmurt	1	4 (100.00%)
Xhosa	1	2 (100.00%)		Lakota	1	2 (100.00%)
Yakut	1	2 (100.00%)		Ojibwa	1	2 (100.00%)
Yiddish	1	2 (100.00%)		Dzongkha	1	4 (100.00%)
Yoruba	1	4 (100.00%)		Cypriot Arabic	1	6 (100.00%)
Yucatec Maya	1	2 (100.00%)		Proto-Balto-Slavic	1	0 (0.00%)
Zealandic	2	2 (42.31%)		Old Spanish	1	0 (0.00%)
Zhuang	1	2 (100.00%)		Middle High German	1	0 (0.00%)
Zulu	1	2 (100.00%)		Proto-Turkic	1	2 (100.00%)

Abkhaz

1

4 (100.00%)

Greek

1903

2452 (35.47%)

Adyghe

1

2 (100.00%)

Ancient Greek

565

876 (14.76%)

Afar

1

2 (100.00%)

Latin

68

80 (70.92%)

Afrikaans

1

2 (100.00%)

Medieval Greek

66

50 (36.63%)

Albanian

5

4 (8.72%)

English

40

30 (64.69%)

Alemannic

3

2 (19.05%)

German

25

24 (8.15%)

[Show ▼] [Hide ▲]

Issues with parsing articles inside parentheses.

Algonquin

1

2 (100.00%)

Cypriot Greek

21

14 (38.91%)

Amharic

1

2 (100.00%)

French

19

26 (62.30%)

Ancient Egyptian

3

2 (86.21%)

Spanish

18

52 (93.68%)

Ancient Greek

565

876 (14.76%)

Cretan Greek

17

8 (30.00%)

Ancient Macedonian

2

0 (0.00%)

Turkish

15

20 (9.50%)

Anglo-Norman

1

2 (100.00%)

Multiple languages

14

6 (79.27%)

Arabic

2

2 (99.81%)

Esperanto

13

16 (85.13%)

Aragonese

1

2 (100.00%)

Serbo-Croatian

13

10 (84.89%)

Aramaic

4

2 (57.14%)

Romanian

12

26 (89.40%)

Armenian

11

18 (11.85%)

Polish

11

8 (55.59%)

Aromanian

6

4 (81.94%)

Old French

11

8 (56.80%)

Arpitan

1

2 (100.00%)

Slovak

11

40 (92.76%)

Assamese

1

2 (100.00%)

Italiot Greek

11

12 (85.33%)

Asturian

4

6 (91.18%)

Armenian

11

18 (11.85%)

Aymara

1

2 (100.00%)

Uni

10

16 (94.79%)

Azerbaijani

5

6 (62.86%)

Italian

10

10 (4.01%)

Bambara

1

2 (100.00%)

Russian

10

10 (97.27%)

Bangla

1

2 (100.00%)

Pontic

9

4 (20.73%)

Bashkir

1

4 (100.00%)

Dutch

7

4 (23.20%)

Basque

1

2 (100.00%)

Portuguese

7

6 (73.92%)

Bavarian

2

2 (69.57%)

Tsakonian

7

6 (76.76%)

Belarusian

1

2 (100.00%)

Mycenaean Greek

7

16 (95.43%)

Bemba

1

2 (100.00%)

Swedish

6

2 (3.61%)

Biblical Hebrew

4

2 (70.27%)

Aromanian

6

4 (81.94%)

Bislama

1

2 (100.00%)

Serbian

6

2 (33.41%)

Bosnian

1

2 (100.00%)

Bulgarian

6

4 (99.27%)

Breton

2

2 (99.42%)

Ladino

6

2 (22.73%)

Bulgarian

6

4 (99.27%)

Azerbaijani

5

6 (62.86%)

Burmese

1

2 (100.00%)

Albanian

5

4 (8.72%)

Cantonese

1

2 (100.00%)

Icelandic

5

4 (23.83%)

Cappadocian Greek

5

2 (45.45%)

Czech

5

2 (97.11%)

Carpathian Rusyn

2

2 (50.00%)

Croatian

5

6 (98.39%)

Catalan

3

2 (99.59%)

Slovene

5

2 (1.32%)

Cebuano

2

2 (90.91%)

Cappadocian Greek

5

2 (45.45%)

Central

2

2 (75.00%)

Macedonian

5

2 (98.12%)

Central Bikol

2

2 (66.67%)

Proto-Hellenic

5

6 (33.33%)

Central Nahuatl

1

2 (100.00%)

Finnish

4

2 (8.83%)

Chamorro

1

2 (100.00%)

Danish

4

2 (4.35%)

Chechen

1

2 (100.00%)

Norwegian

4

2 (12.37%)

Cherokee

1

2 (100.00%)

Japanese

4

20 (100.00%)

Cheyenne

1

2 (100.00%)

West Flemish

4

2 (0.03%)

Chinese

1

2 (100.00%)

Lithuanian

4

2 (99.64%)

Church Slavic

2

0 (0.00%)

Asturian

4

6 (91.18%)

Chuvash

1

2 (100.00%)

Georgian

4

4 (97.89%)

Classical Nahuatl

1

2 (100.00%)

Kabyle

4

2 (77.78%)

Colognian

1

2 (100.00%)

Biblical Hebrew

4

2 (70.27%)

Coptic

2

2 (70.97%)

Aramaic

4

2 (57.14%)

Cornish

1

2 (100.00%)

Proto-Indo-European

4

2 (80.00%)

Corsican

1

2 (100.00%)

Latvian

3

2 (99.33%)

Cretan Greek

17

8 (30.00%)

Catalan

3

2 (99.59%)

Crimean Tatar

1

2 (100.00%)

Hungarian

3

2 (99.72%)

Croatian

5

6 (98.39%)

Narom

3

2 (88.24%)

Cypriot Arabic

1

6 (100.00%)

Venetan

3

4 (95.60%)

Cypriot Greek

21

14 (38.91%)

Neapolitan

3

2 (96.05%)

Czech

5

2 (97.11%)

Lombard

3

4 (88.24%)

Danish

4

2 (4.35%)

Ancient Egyptian

3

2 (86.21%)

Dhivehi

1

2 (100.00%)

Maltese

3

2 (96.83%)

Dimli

1

2 (100.00%)

Ukrainian

3

2 (99.64%)

Dutch

7

4 (23.20%)

Hebrew

3

2 (99.68%)

Dzongkha

1

4 (100.00%)

Friulian

3

2 (90.00%)

East Circassian

1

4 (100.00%)

Alemannic

3

2 (19.05%)

Egyptian Arabic

1

2 (100.00%)

Ottoman Turkish

3

4 (93.59%)

Elamite

1

4 (100.00%)

Jamaican Creole

3

6 (62.50%)

English

40

30 (64.69%)

Old Irish

3

2 (21.43%)

[Show ▼] [Hide ▲]

Leave be; Old Irish tables can wait until they've sorted themselves out.

Esperanto

13

16 (85.13%)

Gothic

3

4 (54.55%)

Estonian

2

2 (96.88%)

Proto-Slavic

3

0 (0.00%)

Ewe

1

2 (100.00%)

Proto-Italic

3

2 (25.00%)

Extremaduran

2

2 (50.00%)

Irish

2

2 (99.65%)

Faroese

1

2 (100.00%)

Indonesian

2

2 (99.63%)

Fiji Hindi

1

2 (100.00%)

Ido

2

4 (99.99%)

Fijian

1

2 (100.00%)

Breton

2

2 (99.42%)

Finnish

4

2 (8.83%)

Estonian

2

2 (96.88%)

French

19

26 (62.30%)

Sardinian

2

2 (96.61%)

Friulian

3

2 (90.00%)

Scots

2

2 (95.00%)

Galician

1

2 (100.00%)

Bavarian

2

2 (69.57%)

Galoli

1

2 (100.00%)

Igbo

2

2 (92.31%)

Gan

2

2 (50.00%)

Cebuano

2

2 (90.91%)

Georgian

4

4 (97.89%)

Zealandic

2

2 (42.31%)

German

25

24 (8.15%)

[Show ▼] [Hide ▲]

Issues with parsing articles inside parentheses.

Arabic

2

2 (99.81%)

Ghotuo

1

2 (100.00%)

Minnan

2

2 (96.00%)

Gothic

3

4 (54.55%)

Romansh

2

2 (94.44%)

Greek

1903

2452 (35.47%)

Pali

2

2 (98.51%)

Greenlandic

1

2 (100.00%)

Uzbek

2

2 (71.43%)

Guarani

1

2 (100.00%)

Ossetian

2

2 (95.24%)

Gujarati

1

2 (100.00%)

Turkmen

2

2 (64.00%)

Haitian Creole

1

2 (100.00%)

Romani

2

2 (85.71%)

Hakka

1

2 (100.00%)

Kazakh

2

4 (1.11%)

Hausa

1

2 (100.00%)

Samogitian

2

2 (75.00%)

Hawaiian

2

2 (95.24%)

Pennsylvania German

2

2 (66.67%)

Hebrew

3

2 (99.68%)

Tibetan

2

4 (85.71%)

Hindi

1

2 (100.00%)

Hawaiian

2

2 (95.24%)

Hittite

2

4 (50.00%)

Javanese

2

2 (91.67%)

Hungarian

3

2 (99.72%)

Old Occitan

2

2 (33.33%)

Icelandic

5

4 (23.83%)

Mariupol Greek

2

10 (93.33%)

Ido

2

4 (99.99%)

Extremaduran

2

2 (50.00%)

Igbo

2

2 (92.31%)

Karachay-Balkar

2

2 (66.67%)

Ilocano

1

2 (100.00%)

Central

2

2 (75.00%)

Inari Sami

1

2 (100.00%)

Central Bikol

2

2 (66.67%)

Indonesian

2

2 (99.63%)

Nahuatl

2

2 (50.00%)

Interlingua

1

2 (100.00%)

Carpathian Rusyn

2

2 (50.00%)

Interlingue

1

2 (100.00%)

Saterland Frisian

2

2 (50.00%)

Inuktitut

1

2 (100.00%)

Gan

2

2 (50.00%)

Irish

2

2 (99.65%)

Old Persian

2

2 (50.00%)

Italian

10

10 (4.01%)

Old Norse

2

2 (80.00%)

Italiot Greek

11

12 (85.33%)

Church Slavic

2

0 (0.00%)

Jamaican Creole

3

6 (62.50%)

Hittite

2

4 (50.00%)

Japanese

4

20 (100.00%)

Coptic

2

2 (70.97%)

Javanese

2

2 (91.67%)

Proto-Albanian

2

0 (0.00%)

Kabyle

4

2 (77.78%)

Proto-Germanic

2

2 (50.00%)

Kannada

1

2 (100.00%)

Old East Slavic

2

0 (0.00%)

Kapampangan

1

2 (100.00%)

Ancient Macedonian

2

0 (0.00%)

Karachay-Balkar

2

2 (66.67%)

Scottish Gaelic

1

2 (100.00%)

Kashmiri

1

2 (100.00%)

Afrikaans

1

2 (100.00%)

Kashubian

1

2 (100.00%)

Luxembourgish

1

2 (100.00%)

Kazakh

2

4 (1.11%)

Low German

1

2 (100.00%)

Khmer

1

2 (100.00%)

Interlingua

1

2 (100.00%)

Kinyarwanda

1

2 (100.00%)

Occitan

1

2 (100.00%)

Kongo

1

2 (100.00%)

Papiamento

1

2 (100.00%)

Korean

1

2 (100.00%)

Novial

1

2 (100.00%)

Kotava

1

2 (100.00%)

Basque

1

2 (100.00%)

Kurdish

1

2 (100.00%)

Ligurian

1

2 (100.00%)

Kyrgyz

1

2 (100.00%)

Piedmontese

1

2 (100.00%)

Ladin

1

2 (100.00%)

West Frisian

1

2 (100.00%)

Ladino

6

2 (22.73%)

Classical Nahuatl

1

2 (100.00%)

Lakota

1

2 (100.00%)

Norwegian Nynorsk

1

2 (100.00%)

Lao

1

2 (100.00%)

Manx

1

2 (100.00%)

Latin

68

80 (70.92%)

Welsh

1

2 (100.00%)

Latvian

3

2 (99.33%)

Bosnian

1

2 (100.00%)

Ligurian

1

2 (100.00%)

Kurdish

1

2 (100.00%)

Limburgish

1

2 (100.00%)

Limburgish

1

2 (100.00%)

Lingala

1

2 (100.00%)

Galician

1

2 (100.00%)

Lithuanian

4

2 (99.64%)

Corsican

1

2 (100.00%)

Livonian

1

2 (100.00%)

Sicilian

1

2 (100.00%)

Lojban

1

2 (100.00%)

Old English

1

2 (100.00%)

Lombard

3

4 (88.24%)

Sranan Tongo

1

2 (100.00%)

Low German

1

2 (100.00%)

Tarantino

1

2 (100.00%)

Low Saxon

1

2 (100.00%)

Vietnamese

1

2 (100.00%)

Lower Sorbian

1

2 (100.00%)

Malay

1

2 (100.00%)

Luxembourgish

1

2 (100.00%)

Faroese

1

2 (100.00%)

Lydian

1

4 (100.00%)

Cornish

1

2 (100.00%)

Macedonian

5

2 (98.12%)

Ladin

1

2 (100.00%)

Malagasy

1

2 (100.00%)

Moldovan

1

2 (100.00%)

Malay

1

2 (100.00%)

Aragonese

1

2 (100.00%)

Malayalam

1

2 (100.00%)

Tagalog

1

2 (100.00%)

Maltese

3

2 (96.83%)

Volapük

1

2 (100.00%)

Manchu

1

2 (100.00%)

Lower Sorbian

1

2 (100.00%)

Manx

1

2 (100.00%)

Ewe

1

2 (100.00%)

Mapuche

1

2 (100.00%)

Korean

1

2 (100.00%)

Marathi

1

2 (100.00%)

Chinese

1

2 (100.00%)

Mari

1

2 (100.00%)

Quechua

1

2 (100.00%)

Mariupol Greek

2

10 (93.33%)

Swahili

1

2 (100.00%)

[Show ▼] [Hide ▲]

Because subject class concord and object class concord uses the same form (for example "c5"), we don't have yet a way to distinguish them. The meaning is human-parsable because we can look at the alignment of the headers (horizontal=subject concord, vertical=object concord), but the parser does not (yet) have this kind of memory or spatial awareness. /////// Sep 9th 2022: I've recently implemented a way to transform row or column tags into other tags based on language, but it doesn't solve an underlying issue with Swahili tables that I'd missed or forgotten about: The horizontal column headers for class and person don't get inherited through the whole template because it is chopped up into several subtables that don't directly inherit from above. It's an issue with 'bleed' again. /////// Later (comment in Dec 2022): forgot to put this here, but Swahili tables have now gotten their own big systems that make them work, including a "save to register" feature that can keep column headers in memory to be used in a later subtable within a template.

Medieval Greek

66

50 (36.63%)

Ilocano

1

2 (100.00%)

Middle Dutch

1

0 (0.00%)

Belarusian

1

2 (100.00%)

Middle English

1

2 (100.00%)

Thai

1

2 (100.00%)

Middle French

1

0 (0.00%)

Old High German

1

2 (100.00%)

Middle High German

1

0 (0.00%)

Persian

1

2 (100.00%)

Minnan

2

2 (96.00%)

Hindi

1

2 (100.00%)

Mirandese

1

2 (100.00%)

Zulu

1

2 (100.00%)

Moksha

1

4 (100.00%)

Middle French

1

0 (0.00%)

Moldovan

1

2 (100.00%)

Kotava

1

2 (100.00%)

Mongolian

1

2 (100.00%)

Crimean Tatar

1

2 (100.00%)

Multiple languages

14

6 (79.27%)

Fijian

1

2 (100.00%)

Mycenaean Greek

7

16 (95.43%)

Bambara

1

2 (100.00%)

Māori

1

2 (100.00%)

Nepali

1

2 (100.00%)

Nahuatl

2

2 (50.00%)

Tatar

1

2 (100.00%)

Narom

3

2 (88.24%)

Tamil

1

2 (100.00%)

Nauru

1

2 (100.00%)

Urdu

1

2 (100.00%)

Navajo

1

2 (100.00%)

Gujarati

1

2 (100.00%)

Neapolitan

3

2 (96.05%)

Yiddish

1

2 (100.00%)

Nepali

1

2 (100.00%)

Samoan

1

2 (100.00%)

Newar

1

2 (100.00%)

Kinyarwanda

1

2 (100.00%)

Northern Sami

1

2 (100.00%)

Somali

1

2 (100.00%)

Northern Sotho

1

2 (100.00%)

Cantonese

1

2 (100.00%)

Norwegian

4

2 (12.37%)

Bangla

1

2 (100.00%)

Norwegian Bokmål

1

2 (100.00%)

Zhuang

1

2 (100.00%)

Norwegian Nynorsk

1

2 (100.00%)

Walloon

1

2 (100.00%)

Novial

1

2 (100.00%)

Tajik

1

2 (100.00%)

Occitan

1

2 (100.00%)

Tahitian

1

2 (100.00%)

Odia

1

2 (100.00%)

Chuvash

1

2 (100.00%)

Ojibwa

1

2 (100.00%)

Rundi

1

2 (100.00%)

Old Armenian

1

4 (100.00%)

Kashmiri

1

2 (100.00%)

Old East Slavic

2

0 (0.00%)

Malayalam

1

2 (100.00%)

Old English

1

2 (100.00%)

Dhivehi

1

2 (100.00%)

Old French

11

8 (56.80%)

Yucatec Maya

1

2 (100.00%)

Old High German

1

2 (100.00%)

Elamite

1

4 (100.00%)

Old Irish

3

2 (21.43%)

[Show ▼] [Hide ▲]

Leave be; Old Irish tables can wait until they've sorted themselves out.

Bemba

1

2 (100.00%)

Old Norse

2

2 (80.00%)

Swati

1

2 (100.00%)

Old Occitan

2

2 (33.33%)

Northern Sotho

1

2 (100.00%)

Old Persian

2

2 (50.00%)

Uyghur

1

2 (100.00%)

Old Spanish

1

0 (0.00%)

Punjabi

1

2 (100.00%)

Old Turkic

1

4 (100.00%)

Māori

1

2 (100.00%)

Ossetian

2

2 (95.24%)

Mongolian

1

2 (100.00%)

Ottoman Turkish

3

4 (93.59%)

Kashubian

1

2 (100.00%)

Palatine German

1

2 (100.00%)

Tok Pisin

1

2 (100.00%)

Pali

2

2 (98.51%)

Lingala

1

2 (100.00%)

Papiamento

1

2 (100.00%)

Chamorro

1

2 (100.00%)

Pashto

1

2 (100.00%)

Haitian Creole

1

2 (100.00%)

Pennsylvania German

2

2 (66.67%)

Inuktitut

1

2 (100.00%)

Persian

1

2 (100.00%)

Tongan

1

2 (100.00%)

Piedmontese

1

2 (100.00%)

Shona

1

2 (100.00%)

Polish

11

8 (55.59%)

Veps

1

2 (100.00%)

Pontic

9

4 (20.73%)

Marathi

1

2 (100.00%)

Portuguese

7

6 (73.92%)

Kyrgyz

1

2 (100.00%)

Proto-Albanian

2

0 (0.00%)

Northern Sami

1

2 (100.00%)

Proto-Balto-Slavic

1

0 (0.00%)

Western Maninkakan

1

2 (100.00%)

Proto-Germanic

2

2 (50.00%)

Yoruba

1

4 (100.00%)

Proto-Hellenic

5

6 (33.33%)

Wolof

1

2 (100.00%)

Proto-Indo-European

4

2 (80.00%)

Sanskrit

1

2 (100.00%)

Proto-Italic

3

2 (25.00%)

Tswana

1

2 (100.00%)

Proto-Slavic

3

0 (0.00%)

Pashto

1

2 (100.00%)

Proto-Turkic

1

2 (100.00%)

Lojban

1

2 (100.00%)

Punjabi

1

2 (100.00%)

Kannada

1

2 (100.00%)

Quechua

1

2 (100.00%)

Malagasy

1

2 (100.00%)

Rapa Nui

1

2 (100.00%)

Venda

1

2 (100.00%)

Romani

2

2 (85.71%)

Xhosa

1

2 (100.00%)

Romanian

12

26 (89.40%)

Sotho

1

2 (100.00%)

Romansh

2

2 (94.44%)

Tsonga

1

2 (100.00%)

Rundi

1

2 (100.00%)

Aymara

1

2 (100.00%)

Russian

10

10 (97.27%)

Telugu

1

2 (100.00%)

Samoan

1

2 (100.00%)

Moksha

1

4 (100.00%)

Samogitian

2

2 (75.00%)

Middle Dutch

1

0 (0.00%)

Sanskrit

1

2 (100.00%)

Rapa Nui

1

2 (100.00%)

Sardinian

2

2 (96.61%)

Guarani

1

2 (100.00%)

Saterland Frisian

2

2 (50.00%)

Sundanese

1

2 (100.00%)

Scots

2

2 (95.00%)

Livonian

1

2 (100.00%)

Scottish Gaelic

1

2 (100.00%)

Kapampangan

1

2 (100.00%)

Seneca

1

2 (100.00%)

Abkhaz

1

4 (100.00%)

Serbian

6

2 (33.41%)

Adyghe

1

2 (100.00%)

Serbo-Croatian

13

10 (84.89%)

Amharic

1

2 (100.00%)

Shona

1

2 (100.00%)

Low Saxon

1

2 (100.00%)

Sicilian

1

2 (100.00%)

Dimli

1

2 (100.00%)

Sinhala

1

2 (100.00%)

Navajo

1

2 (100.00%)

Slovak

11

40 (92.76%)

Kongo

1

2 (100.00%)

Slovene

5

2 (1.32%)

Mapuche

1

2 (100.00%)

Somali

1

2 (100.00%)

Bislama

1

2 (100.00%)

Sotho

1

2 (100.00%)

Colognian

1

2 (100.00%)

Spanish

18

52 (93.68%)

Arpitan

1

2 (100.00%)

Sranan Tongo

1

2 (100.00%)

Tetum

1

2 (100.00%)

Sundanese

1

2 (100.00%)

Cherokee

1

2 (100.00%)

Swahili

1

2 (100.00%)

[Show ▼] [Hide ▲]

Because subject class concord and object class concord uses the same form (for example "c5"), we don't have yet a way to distinguish them. The meaning is human-parsable because we can look at the alignment of the headers (horizontal=subject concord, vertical=object concord), but the parser does not (yet) have this kind of memory or spatial awareness. /////// Sep 9th 2022: I've recently implemented a way to transform row or column tags into other tags based on language, but it doesn't solve an underlying issue with Swahili tables that I'd missed or forgotten about: The horizontal column headers for class and person don't get inherited through the whole template because it is chopped up into several subtables that don't directly inherit from above. It's an issue with 'bleed' again. /////// Later (comment in Dec 2022): forgot to put this here, but Swahili tables have now gotten their own big systems that make them work, including a "save to register" feature that can keep column headers in memory to be used in a later subtable within a template.

Cheyenne

1

2 (100.00%)

Swati

1

2 (100.00%)

Bashkir

1

4 (100.00%)

Swedish

6

2 (3.61%)