Wiktionary data extraction errors and warnings

Inflection check

List of different kinds of inflection tables. When wiktextract parses word heads and tables, it assigns the forms it encounters with tags that describe grammatical or contextual information. The tags and forms that are found in head sections and tables are kept separate from other head section and table tags, and later they are merged with other heads and tables into table types that all contain the same number of word forms with the same tags for those forms.

The information presented here is mostly for debugging, but it can also be used to find interesting word paradigms and to hunt down mistakes, typoes and badly formated Wiktionary entries. A table type that has only a few unique instances is quite likely to contain some kind of minor error in the original data.

Language ⏶ Table forms Errors (% affected words) Language Table forms ⏷ Errors (% affected words)
:Templat:Dayak Benuaq 1 2 (100.00%) bahasa Indonesia 15 16 (3.22%)
:Templat:Dayak Maanyan 1 2 (100.00%) bahasa Indonesia Peranakan 10 4 (1.55%)
:Templat:batak toba 1 2 (100.00%) Bahasa Indonesia 7 2 (8.46%)
:Templat:dayak kenyah 1 2 (100.00%) bahasa Melayu 4 2 (23.08%)
:Templat:dusun balangan 1 2 (100.00%) Bahasa Indonesia Peranakan 4 2 (4.55%)
:Templat:jawa ngapak 1 2 (100.00%) Bahasa Jawa 3 2 (80.00%)
:Templat:jawa ngoko 1 2 (100.00%) bahasa Banjar 3 2 (57.14%)
:Templat:kutai 1 2 (100.00%) bahasa Jepang 3 20 (94.32%)
:Templat:lamin adat 1 2 (100.00%) bahasa Batak Simalungun 3 4 (40.00%)
:Templat:maanyan siong 1 2 (100.00%) bahasa Inggris 2 2 (85.71%)
:Templat:maluku 1 2 (100.00%) bahasa Jawa 2 2 (42.86%)
:Templat:palembang ogan 1 2 (100.00%) bahasa Belanda 2 4 (100.00%)
:Templat:samihim 1 2 (100.00%) bahasa Vietnam 2 4 (90.62%)
:Templat:semarang 1 2 (100.00%) bahasa Palembang 1 2 (100.00%)
:Templat:semendo 1 2 (100.00%) bahasa Madura 1 2 (100.00%)
:Templat:sulawesi 1 2 (100.00%) Bahasa Melayu 1 0 (0.00%)
Bahasa Belanda 1 4 (100.00%) bahasa Kawi 1 4 (100.00%)
Bahasa Indonesia 7 2 (8.46%) Bahasa Belanda 1 4 (100.00%)
Bahasa Indonesia Peranakan 4 2 (4.55%) bahasa Sunda 1 4 (100.00%)
Bahasa Jawa 3 2 (80.00%) bahasa Bali 1 2 (100.00%)
Bahasa Jepang 1 0 (0.00%) bahasa Italia 1 2 (100.00%)
Bahasa Lampung Api 1 2 (100.00%) bahasa Komering 1 2 (100.00%)
Bahasa Melayu 1 0 (0.00%) bahasa Hakka 1 6 (100.00%)
Bahasa Minangkabau 1 2 (100.00%) bahasa Gorontalo 1 2 (100.00%)
Bahasa Sunda 1 2 (100.00%) bahasa Aceh 1 2 (100.00%)
Bahasa Tidung Nunukan 1 2 (100.00%) bahasa Minangkabau 1 2 (100.00%)
Bahasa Using 1 2 (100.00%) :Templat:sulawesi 1 2 (100.00%)
Bahasa Vietnam 1 26 (100.00%) bahasa Malagasi 1 4 (100.00%)
Bahasa Yami 1 0 (0.00%) Bahasa Minangkabau 1 2 (100.00%)
GHWOSMXbahasa Indonesia 1 0 (0.00%) bahasa Galisia 1 0 (0.00%)
Lintas bahasa 1 0 (0.00%) bahasa Brunei 1 2 (100.00%)
bahasa Aceh 1 2 (100.00%) Bahasa Lampung Api 1 2 (100.00%)
bahasa Armenia 1 0 (0.00%) bahasa Lampung Api 1 2 (100.00%)
bahasa Bali 1 2 (100.00%) Bahasa Tidung Nunukan 1 2 (100.00%)
bahasa Banjar 3 2 (57.14%) bahasa Melayu Tengah 1 2 (100.00%)
bahasa Batak Simalungun 3 4 (40.00%) bahasa Sunda kuno 1 2 (100.00%)
bahasa Belanda 2 4 (100.00%) :Templat:maluku 1 2 (100.00%)
bahasa Betawi 1 2 (100.00%) :Templat:batak toba 1 2 (100.00%)
bahasa Bintauna 1 0 (0.00%) bahasa Bintauna 1 0 (0.00%)
bahasa Brunei 1 2 (100.00%) Bahasa Using 1 2 (100.00%)
bahasa Galisia 1 0 (0.00%) Bahasa Jepang 1 0 (0.00%)
bahasa Gorontalo 1 2 (100.00%) Bahasa Sunda 1 2 (100.00%)
bahasa Gorontalo ( dalam Bahasa Belanda ) 1 2 (100.00%) GHWOSMXbahasa Indonesia 1 0 (0.00%)
bahasa Hakka 1 6 (100.00%) bahasa Tionghoa 1 0 (0.00%)
bahasa Indonesia 15 16 (3.22%) bahasa Korea 1 4 (100.00%)
bahasa Indonesia Peranakan 10 4 (1.55%) Lintas bahasa 1 0 (0.00%)
bahasa Inggris 2 2 (85.71%) Bahasa Vietnam 1 26 (100.00%)
bahasa Italia 1 2 (100.00%) :Templat:kutai 1 2 (100.00%)
bahasa Jawa 2 2 (42.86%) bahasa Melayu Manado 1 2 (100.00%)
bahasa Jepang 3 20 (94.32%) bahasa Armenia 1 0 (0.00%)
bahasa Jepang Kuno 1 2 (100.00%) bahasa Rusia 1 2 (100.00%)
bahasa Jepang lama 1 2 (100.00%) bahasa Turkmen 1 0 (0.00%)
bahasa Kawi 1 4 (100.00%) :Templat:maanyan siong 1 2 (100.00%)
bahasa Kerinci 1 2 (100.00%) :Templat:samihim 1 2 (100.00%)
bahasa Kimaragang 1 2 (100.00%) bahasa Betawi 1 2 (100.00%)
bahasa Komering 1 2 (100.00%) :Templat:dayak kenyah 1 2 (100.00%)
bahasa Korea 1 4 (100.00%) :Templat:dusun balangan 1 2 (100.00%)
bahasa Lampung Api 1 2 (100.00%) bahasa Gorontalo ( dalam Bahasa Belanda ) 1 2 (100.00%)
bahasa Madura 1 2 (100.00%) bahasa Kerinci 1 2 (100.00%)
bahasa Malagasi 1 4 (100.00%) :Templat:semarang 1 2 (100.00%)
bahasa Melayu 4 2 (23.08%) bahasa Jepang lama 1 2 (100.00%)
bahasa Melayu Manado 1 2 (100.00%) bahasa Jepang Kuno 1 2 (100.00%)
bahasa Melayu Tengah 1 2 (100.00%) bahasa Tamiang 1 0 (0.00%)
bahasa Minangkabau 1 2 (100.00%) bahasa indonesia 1 0 (0.00%)
bahasa Palembang 1 2 (100.00%) Bahasa Yami 1 0 (0.00%)
bahasa Rusia 1 2 (100.00%) bahasa Kimaragang 1 2 (100.00%)
bahasa Sunda 1 4 (100.00%) :Templat:jawa ngapak 1 2 (100.00%)
bahasa Sunda kuno 1 2 (100.00%) :Templat:semendo 1 2 (100.00%)
bahasa Tamiang 1 0 (0.00%) :Templat:jawa ngoko 1 2 (100.00%)
bahasa Tionghoa 1 0 (0.00%) :Templat:palembang ogan 1 2 (100.00%)
bahasa Turkmen 1 0 (0.00%) :Templat:lamin adat 1 2 (100.00%)
bahasa Vietnam 2 4 (90.62%) :Templat:Dayak Maanyan 1 2 (100.00%)
bahasa indonesia 1 0 (0.00%) :Templat:Dayak Benuaq 1 2 (100.00%)

This page is a part of the kaikki.org machine-readable dictionary. This dictionary is based on structured data extracted on 2026-01-09 from the idwiktionary dump dated 2026-01-02 using wiktextract (96027d6 and 9905b1f). The data shown on this site has been post-processed and various details (e.g., extra categories) removed, some information disambiguated, and additional data merged from other sources. See the raw data download page for the unprocessed wiktextract data.

If you use this data in academic research, please cite Tatu Ylonen: Wiktextract: Wiktionary as Machine-Readable Structured Data, Proceedings of the 13th Conference on Language Resources and Evaluation (LREC), pp. 1317-1325, Marseille, 20-25 June 2022. Linking to the relevant page(s) under https://kaikki.org would also be greatly appreciated.