JSON data structure browser, subpage: senses

Download full JSON mapping

Fields and tags index

[
{
"alt_of":
[
{
"roman": roman seen in ฮ้อน [คำเมือง]; reached 2485 times,
"word": word seen in อุรา [ไทย]; reached 33414 times
}
]
"categories":
[categories seen in กากภาษา [ไทย]; reached 205931 times]
"classifiers":
[
{
"classifier": classifier seen in ใจ [ไทย]; reached 770 times,
"tags":
[tags seen in 容器 [จีนกลาง]; reached 639 times]
}
]
"examples":
[
{
"bold_literal_offsets":
[
[bold_literal_offsets seen in panna [ฟินแลนด์]; reached 16 times]
]
"bold_roman_offsets":
[
[bold_roman_offsets seen in rumis [ฟินแลนด์]; reached 9772 times]
]
"bold_text_offsets":
[
[bold_text_offsets seen in เครน [ไทย]; reached 26246 times]
]
"bold_translation_offsets":
[
[bold_translation_offsets seen in ไฟ [ไทย]; reached 8040 times]
]
"literal_meaning": literal_meaning seen in panna [ฟินแลนด์]; reached 22 times,
"raw_tags":
[raw_tags seen in ingewikkeld [ดัตช์]; reached 170 times]
"ref": ref seen in ศีรษะ [ไทย]; reached 2532 times,
"roman": roman seen in rumis [ฟินแลนด์]; reached 5067 times,
"ruby":
[
[ruby seen in 道理 [ญี่ปุ่น]; reached 2824 times]
]
"sounds":
[
{
"audio": audio seen in map [อังกฤษ]; reached 24 times,
"mp3_url": mp3_url seen in map [อังกฤษ]; reached 24 times,
"ogg_url": ogg_url seen in map [อังกฤษ]; reached 24 times,
"raw_tags":
[raw_tags seen in map [อังกฤษ]; reached 2 times]
"wav_url": wav_url seen in ကၟာဲ [มอญ]; reached 22 times
}
]
"tags":
[tags seen in 當然 [จีน]; reached 3880 times]
"text": text seen in กากภาษา [ไทย]; reached 25821 times,
"translation": translation seen in เงื่อน [ไทย]; reached 7918 times
}
]
"form_of":
[
{
"roman": roman seen in ᨠᩣ᩠ᩁᩁᩬᨯ [คำเมือง]; reached 580973 times,
"word": word seen in ความเตรียมตรม [ไทย]; reached 2262226 times
}
]
"glosses":
[glosses seen in กากภาษา [ไทย]; reached 2727644 times]
"raw_tags":
[raw_tags seen in อุรา [ไทย]; reached 7169 times]
"tags":
[tags seen in แอหนัง [ไทย]; reached 2499007 times]
"topics":
[topics seen in เปรต [ไทย]; reached 5461 times]
}
]

This page is a part of the kaikki.org machine-readable dictionary. This dictionary is based on structured data extracted on 2026-02-27 from the thwiktionary dump dated 2026-02-01 using wiktextract (c4ca749 and 59dc20b). The data shown on this site has been post-processed and various details (e.g., extra categories) removed, some information disambiguated, and additional data merged from other sources. See the raw data download page for the unprocessed wiktextract data.

If you use this data in academic research, please cite Tatu Ylonen: Wiktextract: Wiktionary as Machine-Readable Structured Data, Proceedings of the 13th Conference on Language Resources and Evaluation (LREC), pp. 1317-1325, Marseille, 20-25 June 2022. Linking to the relevant page(s) under https://kaikki.org would also be greatly appreciated.