JSON data structure browser, subpage: sounds

Download full JSON mapping

Fields and tags index

[
{
"audio": audio seen in synthesis [Bahasa Inggeris]; reached 4168 times,
"flac_url": flac_url seen in bahari [Bahasa Swahili]; reached 7 times,
"hangeul": hangeul seen in 외할아버지 [Bahasa Korea]; reached 149 times,
"ipa": ipa seen in synthesis [Bahasa Inggeris]; reached 15222 times,
"mp3_url": mp3_url seen in synthesis [Bahasa Inggeris]; reached 4168 times,
"oga_url": oga_url seen in Czechia [Bahasa Inggeris]; reached 43 times,
"ogg_url": ogg_url seen in synthesis [Bahasa Inggeris]; reached 4126 times,
"opus_url": opus_url seen in ethyl [Bahasa Inggeris]; reached 4 times,
"other": other seen in Google [Bahasa Inggeris]; reached 7025 times,
"raw_tags":
[raw_tags seen in reduction [Bahasa Inggeris]; reached 8317 times]
"rhymes": rhymes seen in Romania [Bahasa Inggeris]; reached 8229 times,
"roman": roman seen in 외할아버지 [Bahasa Korea]; reached 529 times,
"tags":
[tags seen in Romania [Bahasa Inggeris]; reached 2558 times]
"wav_url": wav_url seen in synthesis [Bahasa Inggeris]; reached 1152 times
}
]

This page is a part of the kaikki.org machine-readable dictionary. This dictionary is based on structured data extracted on 2026-06-21 from the mswiktionary dump dated 2026-06-01 using wiktextract (ade7ec3 and 7f4db16). The data shown on this site has been post-processed and various details (e.g., extra categories) removed, some information disambiguated, and additional data merged from other sources. See the raw data download page for the unprocessed wiktextract data.

If you use this data in academic research, please cite Tatu Ylonen: Wiktextract: Wiktionary as Machine-Readable Structured Data, Proceedings of the 13th Conference on Language Resources and Evaluation (LREC), pp. 1317-1325, Marseille, 20-25 June 2022. Linking to the relevant page(s) under https://kaikki.org would also be greatly appreciated.