Wiktionary data extraction errors and warnings

四/Korean/character

Return to 'Debug messages subpage 1957'

四 (Korean character) 四/Korean/character: invalid uppercase tag SK-Standard not in or uppercase_tags: {"derived": [{"alt": "四角", "roman": "sagak", "word": "사각"}, {"alt": "四季", "roman": "sagye", "word": "사계"}, {"alt": "四球", "roman": "sagu", "word": "사구"}, {"alt": "四方", "roman": "sabang", "word": "사방"}, {"alt": "四書", "roman": "saseo", "word": "사서"}, {"alt": "四時", "roman": "sasi", "word": "사시"}, {"alt": "四十", "roman": "sasip", "word": "사십"}, {"alt": "四月", "roman": "sawol", "word": "사월"}, {"alt": "四肢", "roman": "saji", "word": "사지"}, {"alt": "四川", "roman": "Sacheon", "word": "사천"}, {"alt": "四寸", "roman": "sachon", "word": "사촌"}, {"alt": "十四", "roman": "sipsa", "word": "십사"}, {"alt": "四角形", "roman": "sagakhyeong", "word": "사각형"}, {"alt": "四季節", "roman": "sagyejeol", "word": "사계절"}, {"alt": "四天王", "roman": "sacheonwang", "word": "사천왕"}, {"alt": "四不像", "roman": "sabulsang", "word": "사불상"}, {"alt": "正四品", "roman": "jeongsapum", "word": "정사품"}, {"alt": "四捨五入", "roman": "sasaoip", "word": "사사오입"}, {"alt": "正四角形", "roman": "jeongsagakhyeong", "word": "정사각형"}, {"alt": "正四面體", "roman": "jeongsamyeonche", "word": "정사면체"}, {"alt": "再三再四", "roman": "jaesamjaesa", "word": "재삼재사"}], "etymology_templates": [{"args": {"1": "ko", "2": "ltc", "3": "-", "sort": "사"}, "expansion": "Middle Chinese", "name": "der"}, {"args": {"1": "MC", "2": "Middle Chinese"}, "expansion": "MC", "name": "abbr"}], "etymology_text": "From Middle Chinese 四 (MC siɪᴴ).\nHistorical readings\n* Recorded as Middle Korean ᄉᆞᆼ〮 (Yale: só) in Dongguk Jeongun (東國正韻 / 동국정운), 1448.\n* Recorded as Middle Korean ᄉᆞ ( so)^訓 (Yale: so) in Hunmong Jahoe (訓蒙字會 / 훈몽자회), 1527.\n* Recorded as Middle Korean ᄉᆞ ( so)^訓 (Yale: so) in Gwangju Cheonjamun (光州千字文 / 광주천자문), 1575.\n* Recorded as Middle Korean ᄉᆞ ( so)^訓 (Yale: so) in Sinjeung Yuhap (新增類合 / 신증유합), 1576.", "forms": [{"form": "넉 사", "roman": "neok sa", "tags": ["eumhun"]}], "head_templates": [{"args": {"1": "넉", "2": "사"}, "expansion": "四 (eumhun 넉 사 (neok sa))", "name": "ko-hanja"}], "lang": "Korean", "lang_code": "ko", "pos": "character", "senses": [{"categories": ["Korean entries with incorrect language header", "Korean hanja", "Korean hanja forms", "Korean lemmas", "Korean links with redundant wikilinks", "Korean numeral symbols", "Korean terms derived from Middle Chinese", "Korean terms with long vowels in the first syllable", "Korean terms with non-redundant non-automated sortkeys", "Middle Chinese terms with non-redundant manual transliterations", "Middle Korean hanja", "Pages with 6 entries", "Pages with entries", "Pages with raw sortkeys", "ko:Four"], "form_of": [{"extra": "four", "word": "사"}], "glosses": ["hanja form of 사 (“four”)"], "links": [["hanja", "hanja#English"], ["사", "사#Korean"], ["four", "four"]], "raw_tags": ["Hanja"], "tags": ["form-of", "hanja"]}], "sounds": [{"ipa": "[sʰa̠(ː)]", "tags": ["SK-Standard", "Seoul"]}, {"hangeul": "사(ː)"}, {"other": "[사(ː)]"}], "word": "四"}


This page is a part of the kaikki.org machine-readable dictionary. This dictionary is based on structured data extracted on 2024-11-06 from the enwiktionary dump dated 2024-10-02 using wiktextract (fbeafe8 and 7f03c9b). The data shown on this site has been post-processed and various details (e.g., extra categories) removed, some information disambiguated, and additional data merged from other sources. See the raw data download page for the unprocessed wiktextract data.

If you use this data in academic research, please cite Tatu Ylonen: Wiktextract: Wiktionary as Machine-Readable Structured Data, Proceedings of the 13th Conference on Language Resources and Evaluation (LREC), pp. 1317-1325, Marseille, 20-25 June 2022. Linking to the relevant page(s) under https://kaikki.org would also be greatly appreciated.