JSON data structure browser, subpage: senses

Download full JSON mapping

Fields and tags index

[
{
"alt_of":
[
{
"roman": roman seen in чернило [Tiếng Anh]; reached 260 times,
"word": word seen in hòa hợp [Tiếng Việt]; reached 1433 times
}
]
"categories":
[categories seen in giởi [Tiếng Việt]; reached 71251 times]
"classifiers":
[
{
"classifier": classifier seen in 有限責任公司 [Tiếng Trung Quốc]; reached 134 times,
"raw_tags":
[raw_tags seen in 蠓罩 [Tiếng Trung Quốc]; reached 15 times]
"tags":
[tags seen in 有限責任公司 [Tiếng Trung Quốc]; reached 201 times]
}
]
"examples":
[
{
"bold_literal_offsets":
[
[bold_literal_offsets seen in в [Tiếng Nga]; reached 26 times]
]
"bold_roman_offsets":
[
[bold_roman_offsets seen in lầu xanh [Tiếng Việt]; reached 22344 times]
]
"bold_text_offsets":
[
[bold_text_offsets seen in thói tục [Tiếng Việt]; reached 350146 times]
]
"bold_translation_offsets":
[
[bold_translation_offsets seen in [Tiếng Việt]; reached 5900 times]
]
"literal_meaning": literal_meaning seen in в [Tiếng Nga]; reached 17 times,
"raw_tags":
[raw_tags seen in đầu gối [Tiếng Việt]; reached 64 times]
"ref": ref seen in vàng xuộm [Tiếng Việt]; reached 1078 times,
"roman": roman seen in lầu xanh [Tiếng Việt]; reached 10538 times,
"ruby":
[
[ruby seen in [Tiếng Việt]; reached 952 times]
]
"tags":
[tags seen in gái [Tiếng Việt]; reached 504 times]
"text": text seen in thói tục [Tiếng Việt]; reached 173076 times,
"translation": translation seen in [Tiếng Việt]; reached 118172 times
}
]
"form_of":
[
{
"roman": roman seen in IT [Tiếng Anh]; reached 2094 times,
"word": word seen in aankondigingen [Tiếng Việt]; reached 40588 times
}
]
"glosses":
[glosses seen in thói tục [Tiếng Việt]; reached 505744 times]
"raw_tags":
[raw_tags seen in mịch [Tiếng Việt]; reached 3462 times]
"tags":
[tags seen in giởi [Tiếng Việt]; reached 52823 times]
"topics":
[topics seen in đầu thú [Tiếng Việt]; reached 1969 times]
}
]

This page is a part of the kaikki.org machine-readable dictionary. This dictionary is based on structured data extracted on 2026-03-02 from the viwiktionary dump dated 2026-02-01 using wiktextract (d146717 and 59dc20b). The data shown on this site has been post-processed and various details (e.g., extra categories) removed, some information disambiguated, and additional data merged from other sources. See the raw data download page for the unprocessed wiktextract data.

If you use this data in academic research, please cite Tatu Ylonen: Wiktextract: Wiktionary as Machine-Readable Structured Data, Proceedings of the 13th Conference on Language Resources and Evaluation (LREC), pp. 1317-1325, Marseille, 20-25 June 2022. Linking to the relevant page(s) under https://kaikki.org would also be greatly appreciated.