Wiktionary data extraction errors and warnings

bi/Vietnamese/noun

Return to 'Debug messages subpage 2282'

bi (Vietnamese noun) bi/Vietnamese/noun: invalid uppercase tag Hà-Nội not in or uppercase_tags: {"categories": ["Pages with 64 entries", "Pages with entries", "Vietnamese entries with incorrect language header", "Vietnamese lemmas", "Vietnamese non-lemma forms", "Vietnamese nouns", "Vietnamese nouns classified by cái", "Vietnamese romanizations", "Vietnamese terms borrowed from French", "Vietnamese terms derived from French", "Vietnamese terms with IPA pronunciation", "vi:Toys"], "etymology_number": 2, "etymology_templates": [{"args": {"1": "vi", "2": "fr", "3": "bille", "4": "", "5": "tree log, trunk"}, "expansion": "French bille (“tree log, trunk”)", "name": "bor"}], "etymology_text": "Probably from French bille (“tree log, trunk”).", "forms": [{"form": "cái", "tags": ["classifier"]}], "head_templates": [{"args": {"1": "vi", "2": "noun", "3": "", "4": "", "head": "", "tr": ""}, "expansion": "bi", "name": "head"}, {"args": {"cls": "cái"}, "expansion": "(classifier cái) bi", "name": "vi-noun"}], "lang": "Vietnamese", "lang_code": "vi", "pos": "noun", "senses": [{"glosses": ["big concrete sewer"], "links": [["concrete", "concrete"], ["sewer", "sewer"]]}, {"glosses": ["a type of water tank made from concrete"], "links": [["water tank", "water tank"], ["concrete", "concrete"]], "raw_glosses": ["(by extension) a type of water tank made from concrete"], "tags": ["broadly"]}], "sounds": [{"ipa": "[ʔɓi˧˧]", "tags": ["Hà-Nội"]}, {"ipa": "[ʔɓɪj˧˧]", "tags": ["Huế"]}, {"ipa": "[ʔɓɪj˧˧]", "note": "Saigon"}], "word": "bi"}

bi (Vietnamese noun) bi/Vietnamese/noun: invalid uppercase tag Hà-Nội not in or uppercase_tags: {"categories": ["Pages with 64 entries", "Pages with entries", "Vietnamese entries with incorrect language header", "Vietnamese lemmas", "Vietnamese non-lemma forms", "Vietnamese nouns", "Vietnamese nouns classified by cục", "Vietnamese nouns classified by hòn", "Vietnamese nouns classified by viên", "Vietnamese romanizations", "Vietnamese terms borrowed from French", "Vietnamese terms derived from French", "Vietnamese terms with IPA pronunciation", "vi:Toys"], "etymology_number": 1, "etymology_templates": [{"args": {"1": "vi", "2": "fr", "3": "bille", "4": "", "5": "marble, ball"}, "expansion": "French bille (“marble, ball”)", "name": "bor"}], "etymology_text": "Borrowed from French bille (“marble, ball”).", "forms": [{"form": "cục", "tags": ["classifier"]}, {"form": "hòn", "tags": ["classifier"]}, {"form": "viên", "tags": ["classifier"]}], "head_templates": [{"args": {"1": "vi", "2": "noun", "3": "", "4": "", "head": "", "tr": ""}, "expansion": "bi", "name": "head"}, {"args": {"cls": "cục, hòn, viên"}, "expansion": "(classifier cục, hòn, viên) bi", "name": "vi-noun"}], "lang": "Vietnamese", "lang_code": "vi", "pos": "noun", "related": [{"word": "bi cái"}, {"word": "bút bi"}], "senses": [{"categories": ["Vietnamese terms with usage examples"], "examples": [{"english": "to shoot marbles", "text": "bắn bi", "type": "example"}], "glosses": ["a marble (spherical ball)"], "links": [["marble", "marble"]]}, {"categories": ["Vietnamese terms with usage examples", "vi:Billiards", "vi:Snooker"], "examples": [{"english": "a cue ball", "text": "bi cái", "type": "example"}], "glosses": ["a ball"], "links": [["billiards", "billiards"], ["snooker", "snooker#Noun"], ["ball", "ball"]], "raw_glosses": ["(billiards, snooker) a ball"], "topics": ["ball-games", "billiards", "games", "hobbies", "lifestyle", "snooker", "sports"]}, {"categories": ["Vietnamese slang", "Vietnamese terms with usage examples"], "examples": [{"text": "Á! Dập bi tao rồi!\nOw! My bawlls popped!", "type": "example"}], "glosses": ["a ball (testicle)"], "links": [["ball", "ball"]], "raw_glosses": ["(slang) a ball (testicle)"], "tags": ["slang"]}], "sounds": [{"ipa": "[ʔɓi˧˧]", "tags": ["Hà-Nội"]}, {"ipa": "[ʔɓɪj˧˧]", "tags": ["Huế"]}, {"ipa": "[ʔɓɪj˧˧]", "note": "Saigon"}], "word": "bi"}


This page is a part of the kaikki.org machine-readable dictionary. This dictionary is based on structured data extracted on 2024-12-21 from the enwiktionary dump dated 2024-12-04 using wiktextract (d8cb2f3 and 4e554ae). The data shown on this site has been post-processed and various details (e.g., extra categories) removed, some information disambiguated, and additional data merged from other sources. See the raw data download page for the unprocessed wiktextract data.

If you use this data in academic research, please cite Tatu Ylonen: Wiktextract: Wiktionary as Machine-Readable Structured Data, Proceedings of the 13th Conference on Language Resources and Evaluation (LREC), pp. 1317-1325, Marseille, 20-25 June 2022. Linking to the relevant page(s) under https://kaikki.org would also be greatly appreciated.