Wiktionary data extraction errors and warnings

house/Chinese/noun

Return to 'Debug messages subpage 2567'

house (Chinese noun) house/Chinese/noun: invalid uppercase tag Hong-Kong not in or uppercase_tags: {"etymology_templates": [{"args": {"1": "yue", "2": "en", "3": "house"}, "expansion": "English house", "name": "bor"}], "etymology_text": "From English house.", "head_templates": [{"args": {"1": "zh", "2": "noun"}, "expansion": "house", "name": "head"}], "lang": "Chinese", "lang_code": "zh", "pos": "noun", "senses": [{"categories": ["Cantonese lemmas", "Cantonese nouns", "Cantonese terms borrowed from English", "Cantonese terms derived from English", "Chinese entries with incorrect language header", "Chinese lemmas", "Chinese nouns", "Chinese nouns classified by 間/间", "Chinese terms with IPA pronunciation", "Chinese terms written in foreign scripts", "Hong Kong Cantonese", "Pages with 15 entries", "Pages with entries"], "glosses": ["mansion; large house (Classifier: 間/间 c)"], "links": [["mansion", "mansion"], ["house", "#English"], ["間", "間#Chinese"], ["间", "间#Chinese"]], "raw_glosses": ["(Hong Kong Cantonese) mansion; large house (Classifier: 間/间 c)"], "tags": ["Cantonese", "Hong-Kong"]}], "sounds": [{"tags": ["Cantonese", "Jyutping"], "zh-pron": "hau¹ si²"}, {"tags": ["Cantonese", "Yale"], "zh-pron": "hāu sí"}, {"tags": ["Cantonese", "Pinyin"], "zh-pron": "hau¹ si²"}, {"tags": ["Cantonese", "Guangdong-Romanization"], "zh-pron": "heo¹ xi²"}, {"ipa": "/hɐu̯⁵⁵ siː³⁵/", "tags": ["Cantonese", "Sinological-IPA"]}, {"ipa": "/hɐu̯⁵⁵ siː³⁵/"}], "word": "house"}


This page is a part of the kaikki.org machine-readable dictionary. This dictionary is based on structured data extracted on 2025-03-13 from the enwiktionary dump dated 2025-03-02 using wiktextract (f074e77 and 633533e). The data shown on this site has been post-processed and various details (e.g., extra categories) removed, some information disambiguated, and additional data merged from other sources. See the raw data download page for the unprocessed wiktextract data.

If you use this data in academic research, please cite Tatu Ylonen: Wiktextract: Wiktionary as Machine-Readable Structured Data, Proceedings of the 13th Conference on Language Resources and Evaluation (LREC), pp. 1317-1325, Marseille, 20-25 June 2022. Linking to the relevant page(s) under https://kaikki.org would also be greatly appreciated.