Wiktionary data extraction errors and warnings

ba/Vietnamese/adj

Return to 'Debug messages subpage 2559'

ba (Vietnamese adj) ba/Vietnamese/adj: invalid uppercase tag Hà-Nội not in or uppercase_tags: {"categories": ["Pages with 73 entries", "Pages with entries", "Sino-Vietnamese words", "Vietnamese adjectives", "Vietnamese determiners", "Vietnamese entries with incorrect language header", "Vietnamese lemmas", "Vietnamese nouns", "Vietnamese numerals", "Vietnamese terms derived from Proto-Austroasiatic", "Vietnamese terms derived from Proto-Vietic", "Vietnamese terms inherited from Proto-Austroasiatic", "Vietnamese terms inherited from Proto-Vietic", "Vietnamese terms with IPA pronunciation", "Vietnamese terms with redundant script codes", "vi:Male", "vi:Parents", "vi:Three"], "derived": [{"word": "ba phải"}, {"word": "ba que"}, {"word": "ba que xỏ lá"}, {"word": "Ba Tàu"}, {"word": "ba xạo"}, {"word": "dăm ba"}, {"word": "tháng ba"}, {"word": "thứ ba"}], "etymology_number": 2, "etymology_templates": [{"args": {"1": "vi", "2": "2", "3": "3", "4": "4", "5": "hai", "6": "bốn", "ord": "thứ ba"}, "expansion": "", "name": "cardinalbox"}, {"args": {"1": "vi", "2": "mkh-vie-pro", "3": "*paː"}, "expansion": "Proto-Vietic *paː", "name": "inh"}, {"args": {"1": "vi", "2": "aav-pro", "3": "*peːʔ", "4": "", "5": "three"}, "expansion": "Proto-Austroasiatic *peːʔ (“three”)", "name": "inh"}, {"args": {"1": "mtq", "2": "pa"}, "expansion": "Muong pa", "name": "cog"}, {"args": {"1": "km", "2": "បី"}, "expansion": "Khmer បី (bəy)", "name": "cog"}, {"args": {"1": "hal", "2": "pe"}, "expansion": "Halang pe", "name": "cog"}, {"args": {"1": "pac", "2": "pe"}, "expansion": "Pacoh pe", "name": "cog"}, {"args": {"1": "mnw", "2": "ပိ"}, "expansion": "Mon ပိ", "name": "cog"}, {"args": {"1": "sat", "2": "ᱯᱮ"}, "expansion": "Santali ᱯᱮ (pe)", "name": "cog"}], "etymology_text": "From Proto-Vietic *paː, from Proto-Austroasiatic *peːʔ (“three”). Cognate with Muong pa, Khmer បី (bəy), Halang pe, Pacoh pe, Mon ပိ, Santali ᱯᱮ (pe).", "forms": [{"form": "𠀧", "tags": ["CJK"]}, {"form": "巴", "tags": ["CJK"]}], "head_templates": [{"args": {"1": "vi", "2": "adjective", "3": "", "4": "", "5": "", "6": "", "7": "", "8": "", "head": "", "tr": "𠀧, 巴"}, "expansion": "ba • (𠀧, 巴)", "name": "head"}, {"args": {"1": "𠀧, 巴"}, "expansion": "ba • (𠀧, 巴)", "name": "vi-adj"}], "lang": "Vietnamese", "lang_code": "vi", "pos": "adj", "senses": [{"categories": ["Southern Vietnamese", "Vietnamese ordinal numbers", "Vietnamese terms with usage examples"], "examples": [{"english": "second eldest brother/sister", "text": "anh/chị ba", "type": "example"}, {"english": "second eldest brother/sister of one's parent", "text": "bác ba", "type": "example"}, {"english": "secondborn younger brother of one's father", "text": "chú ba", "type": "example"}], "glosses": ["secondborn"], "links": [["secondborn", "secondborn"]], "raw_glosses": ["(Southern Vietnam, of a sibling) secondborn"], "raw_tags": ["of a sibling"], "tags": ["Southern", "Vietnam"]}], "sounds": [{"ipa": "[ʔɓaː˧˧]", "tags": ["Hà-Nội"]}, {"ipa": "[ʔɓaː˧˧]", "tags": ["Huế"]}, {"ipa": "[ʔɓaː˧˧]", "note": "Saigon"}, {"audio": "LL-Q9199 (vie)-Penn Zero MSSJ-ba.wav", "mp3_url": "https://upload.wikimedia.org/wikipedia/commons/transcoded/8/83/LL-Q9199_%28vie%29-Penn_Zero_MSSJ-ba.wav/LL-Q9199_%28vie%29-Penn_Zero_MSSJ-ba.wav.mp3", "ogg_url": "https://upload.wikimedia.org/wikipedia/commons/transcoded/8/83/LL-Q9199_%28vie%29-Penn_Zero_MSSJ-ba.wav/LL-Q9199_%28vie%29-Penn_Zero_MSSJ-ba.wav.ogg"}, {"audio": "LL-Q9199 (vie)-Jessica Nguyen (Pamputt)-ba.wav", "mp3_url": "https://upload.wikimedia.org/wikipedia/commons/transcoded/f/fd/LL-Q9199_%28vie%29-Jessica_Nguyen_%28Pamputt%29-ba.wav/LL-Q9199_%28vie%29-Jessica_Nguyen_%28Pamputt%29-ba.wav.mp3", "ogg_url": "https://upload.wikimedia.org/wikipedia/commons/transcoded/f/fd/LL-Q9199_%28vie%29-Jessica_Nguyen_%28Pamputt%29-ba.wav/LL-Q9199_%28vie%29-Jessica_Nguyen_%28Pamputt%29-ba.wav.ogg"}], "word": "ba"}


This page is a part of the kaikki.org machine-readable dictionary. This dictionary is based on structured data extracted on 2025-03-13 from the enwiktionary dump dated 2025-03-02 using wiktextract (f074e77 and 633533e). The data shown on this site has been post-processed and various details (e.g., extra categories) removed, some information disambiguated, and additional data merged from other sources. See the raw data download page for the unprocessed wiktextract data.

If you use this data in academic research, please cite Tatu Ylonen: Wiktextract: Wiktionary as Machine-Readable Structured Data, Proceedings of the 13th Conference on Language Resources and Evaluation (LREC), pp. 1317-1325, Marseille, 20-25 June 2022. Linking to the relevant page(s) under https://kaikki.org would also be greatly appreciated.