Wiktionary data extraction errors and warnings

sweet/Chinese/adj

Return to 'Debug messages subpage 1931'

sweet (Chinese adj) sweet/Chinese/adj: invalid uppercase tag Hong-Kong not in or uppercase_tags: {"derived": [{"word": "sweet sweet"}], "etymology_templates": [{"args": {"1": "yue", "2": "en", "3": "sweet"}, "expansion": "English sweet", "name": "bor"}], "etymology_text": "From English sweet.", "head_templates": [{"args": {"1": "zh", "2": "adjective"}, "expansion": "sweet", "name": "head"}], "lang": "Chinese", "lang_code": "zh", "pos": "adj", "senses": [{"categories": ["Cantonese adjectives", "Cantonese lemmas", "Cantonese terms borrowed from English", "Cantonese terms derived from English", "Chinese adjectives", "Chinese entries with incorrect language header", "Chinese lemmas", "Chinese terms with IPA pronunciation", "Chinese terms written in foreign scripts", "Hong Kong Cantonese", "Pages with 5 entries", "Pages with entries"], "glosses": ["romantic"], "links": [["romantic", "romantic"]], "raw_glosses": ["(Hong Kong Cantonese) romantic"], "tags": ["Cantonese", "Hong-Kong"]}], "sounds": [{"tags": ["Cantonese", "Jyutping"], "zh-pron": "si⁴ wit¹"}, {"tags": ["Cantonese", "Jyutping"], "zh-pron": "si⁶ wit¹"}, {"tags": ["Cantonese", "Yale"], "zh-pron": "sìh wīt"}, {"tags": ["Cantonese", "Yale"], "zh-pron": "sih wīt"}, {"tags": ["Cantonese", "Pinyin"], "zh-pron": "si⁴ wit⁷"}, {"tags": ["Cantonese", "Pinyin"], "zh-pron": "si⁶ wit⁷"}, {"tags": ["Cantonese", "Guangdong-Romanization"], "zh-pron": "xi⁴ wid¹"}, {"tags": ["Cantonese", "Guangdong-Romanization"], "zh-pron": "xi⁶ wid¹"}, {"ipa": "/siː²¹ wiːt̚⁵/", "tags": ["Cantonese", "Sinological-IPA"]}, {"ipa": "/siː²² wiːt̚⁵/", "tags": ["Cantonese", "Sinological-IPA"]}, {"ipa": "/siː²¹ wiːt̚⁵/"}, {"ipa": "/siː²² wiːt̚⁵/"}], "synonyms": [{"roman": "si⁴ wit¹", "word": "時weet/时weet"}, {"roman": "si4 wit1", "word": "時weet"}, {"roman": "si4 wit1", "word": "时weet"}, {"roman": "si6 wit1", "word": "是weet"}, {"roman": "si6 wit1", "word": "士weet"}], "word": "sweet"}


This page is a part of the kaikki.org machine-readable dictionary. This dictionary is based on structured data extracted on 2024-11-06 from the enwiktionary dump dated 2024-10-02 using wiktextract (fbeafe8 and 7f03c9b). The data shown on this site has been post-processed and various details (e.g., extra categories) removed, some information disambiguated, and additional data merged from other sources. See the raw data download page for the unprocessed wiktextract data.

If you use this data in academic research, please cite Tatu Ylonen: Wiktextract: Wiktionary as Machine-Readable Structured Data, Proceedings of the 13th Conference on Language Resources and Evaluation (LREC), pp. 1317-1325, Marseille, 20-25 June 2022. Linking to the relevant page(s) under https://kaikki.org would also be greatly appreciated.