"phoneme" meaning in English

See phoneme in All languages combined, or Wiktionary

Noun

IPA: /ˈfəʊ.niːm/ [Received-Pronunciation], /ˈfoʊ.nim/ [General-American] Audio: En-us-phoneme.ogg [US] Forms: phonemes [plural]
Rhymes: -əʊniːm Etymology: From Ancient Greek φώνημα (phṓnēma, “sound”), from φωνέω (phōnéō, “to sound”), from φωνή (phōnḗ, “sound”). By surface analysis, phone (“speech sound”) + -eme (“unit”). Etymology templates: {{root|en|ine-pro|*bʰeh₂-|id=speak}}, {{der|en|grc|φώνημα||sound}} Ancient Greek φώνημα (phṓnēma, “sound”), {{m|grc|φωνέω||to sound}} φωνέω (phōnéō, “to sound”), {{m|grc|φωνή||sound}} φωνή (phōnḗ, “sound”), {{surf|en|phone<t:speech sound>|-eme<t:unit>}} By surface analysis, phone (“speech sound”) + -eme (“unit”) Head templates: {{en-noun}} phoneme (plural phonemes)
  1. An indivisible unit of sound in a given language. A phoneme is an abstraction of the physical speech sounds (phones) and may encompass several different phones. Categories (topical): Phonology Derived forms: archiphoneme, diaphoneme, phonemic, phonemical, phonemically, phonemicist, phonemicity, phonemics, subphoneme Translations (indivisible unit of sound): fonemë [feminine] (Albanian), فُونِيم (fōnēm) [masculine] (Arabic), صَوْت لُغَوِيّ (ṣawt luḡawiyy) [masculine] (Arabic), հնչույթ (hnčʿuytʿ) (Armenian), fonema [masculine] (Asturian), фане́ма (fanjéma) [feminine] (Belarusian), фанэ́ма (fanéma) [feminine] (Belarusian), soniad (Breton), фоне́ма (fonéma) [feminine] (Bulgarian), fonema [masculine] (Catalan), ponema (Cebuano), 音素 (im-sò͘) [Hokkien] (Chinese), 音素 (english: jam¹ sou³) (Chinese Cantonese), 音位 (alt: jam¹ wai⁶⁻²) (Chinese Cantonese), 音素 (yīnsù) (Chinese Mandarin), 音位 (yīnwèi) (Chinese Mandarin), foném [masculine] (Czech), fonem [neuter] (Danish), foneem [neuter] (Dutch), fonemo (Esperanto), häälik (Estonian), foneem (Estonian), foneemi (Finnish), phonème [masculine] (French), fonema [masculine] (Galician), ბგერა (bgera) (Georgian), Phonem [neuter] (German), φώνημα (fónima) [neuter] (Greek), ધ્વનિઘટક (dhvanighṭak) (Gujarati), ध्वनिग्राम (dhvanigrām) [masculine] (Hindi), ध्वनि (dhvani) [feminine] (Hindi), वाच् (vāc) [feminine] (Hindi), व्योम (vyom) [masculine] (Hindi), fonéma (Hungarian), hljóðan (Icelandic), fonemo (Ido), phonema (Interlingua), fonema [masculine] (Italian), 音素 (onso) (alt: おんそ) (Japanese), ಧ್ವನಿಮಾ (dhvanimā) (Kannada), សទ្ទតា (sattĕəʼtaa) (Khmer), សទ្ទភូត (sattĕəʼ phuut) (Khmer), 음소 (eumso) (alt: 音素) (Korean), 표음 (pyoeum) (alt: 表音) (Korean), 낱소리 (natsori) (Korean), phōnēma [feminine] (Latin), fonēma [feminine] (Latvian), fonema [feminine] (Lithuanian), fonem [masculine] (Lower Sorbian), fonem (Malay), warna bunyi (Malay), myn-heean (Manx), ध्वनिघटक (dhvanighṭak) (Marathi), авиалбар (avialbar) [Cyrillic] (Mongolian), фонем (fonem) [Cyrillic] (Mongolian), ᠠᠪᠢᠶᠠᠯᠪᠤᠷᠢ (abiyalburi) [Mongolian] (Mongolian), ᠹᠣᠨᠧᠮ (fonēm) [Mongolian] (Mongolian), vac (Northern Kurdish), fonèma [masculine] (Occitan), واج (vâj) (Persian), fonem [masculine] (Polish), fonema [masculine] (Portuguese), fonem [neuter] (Romanian), фоне́ма (fonéma) [feminine] (Russian), фо̀не̄м [Cyrillic, masculine] (Serbo-Croatian), фоне́ма [Cyrillic, feminine] (Serbo-Croatian), fònēm [Roman, masculine] (Serbo-Croatian), fonéma [Roman, feminine] (Serbo-Croatian), funima [masculine] (Sicilian), وايوم (viome) [feminine] (Sindhi), fonéma [feminine] (Slovak), fonem [masculine] (Slovene), glásnik [masculine] (Slovene), fonema [masculine] (Spanish), fonimu (Swahili), fonem [neuter] (Swedish), multinig (Tagalog), ஒலியன் (oliyaṉ) (Tamil), หน่วยเสียง (nùai-sǐiang) (Thai), མ་སྒྲ (ma sgra) (Tibetan), sesbirim (Turkish), фоне́ма (fonéma) [feminine] (Ukrainian), fonem [masculine] (Upper Sorbian), âm vị (alt: 音位) (Vietnamese), oyon (Walloon), פֿאָנעם (fonem) [masculine] (Yiddish)

Inflected forms

Download JSON data for phoneme meaning in English (14.7kB)

{
  "etymology_templates": [
    {
      "args": {
        "1": "en",
        "2": "ine-pro",
        "3": "*bʰeh₂-",
        "id": "speak"
      },
      "expansion": "",
      "name": "root"
    },
    {
      "args": {
        "1": "en",
        "2": "grc",
        "3": "φώνημα",
        "4": "",
        "5": "sound"
      },
      "expansion": "Ancient Greek φώνημα (phṓnēma, “sound”)",
      "name": "der"
    },
    {
      "args": {
        "1": "grc",
        "2": "φωνέω",
        "3": "",
        "4": "to sound"
      },
      "expansion": "φωνέω (phōnéō, “to sound”)",
      "name": "m"
    },
    {
      "args": {
        "1": "grc",
        "2": "φωνή",
        "3": "",
        "4": "sound"
      },
      "expansion": "φωνή (phōnḗ, “sound”)",
      "name": "m"
    },
    {
      "args": {
        "1": "en",
        "2": "phone<t:speech sound>",
        "3": "-eme<t:unit>"
      },
      "expansion": "By surface analysis, phone (“speech sound”) + -eme (“unit”)",
      "name": "surf"
    }
  ],
  "etymology_text": "From Ancient Greek φώνημα (phṓnēma, “sound”), from φωνέω (phōnéō, “to sound”), from φωνή (phōnḗ, “sound”). By surface analysis, phone (“speech sound”) + -eme (“unit”).",
  "forms": [
    {
      "form": "phonemes",
      "tags": [
        "plural"
      ]
    }
  ],
  "head_templates": [
    {
      "args": {},
      "expansion": "phoneme (plural phonemes)",
      "name": "en-noun"
    }
  ],
  "lang": "English",
  "lang_code": "en",
  "pos": "noun",
  "senses": [
    {
      "categories": [
        {
          "kind": "other",
          "name": "Arabic terms with non-redundant manual transliterations",
          "parents": [
            "Terms with non-redundant manual transliterations",
            "Entry maintenance"
          ],
          "source": "w"
        },
        {
          "kind": "other",
          "name": "Cantonese terms with redundant transliterations",
          "parents": [
            "Terms with redundant transliterations",
            "Entry maintenance"
          ],
          "source": "w"
        },
        {
          "kind": "other",
          "name": "English entries with incorrect language header",
          "parents": [
            "Entries with incorrect language header",
            "Entry maintenance"
          ],
          "source": "w"
        },
        {
          "kind": "other",
          "name": "English entries with topic categories using raw markup",
          "parents": [
            "Entries with topic categories using raw markup",
            "Entry maintenance"
          ],
          "source": "w"
        },
        {
          "kind": "other",
          "name": "English terms suffixed with -eme",
          "parents": [],
          "source": "w"
        },
        {
          "kind": "other",
          "name": "Mandarin terms with redundant transliterations",
          "parents": [
            "Terms with redundant transliterations",
            "Entry maintenance"
          ],
          "source": "w"
        },
        {
          "kind": "topical",
          "langcode": "en",
          "name": "Phonology",
          "orig": "en:Phonology",
          "parents": [
            "Linguistics",
            "Language",
            "Social sciences",
            "Communication",
            "Sciences",
            "Society",
            "All topics",
            "Fundamental"
          ],
          "source": "w"
        }
      ],
      "derived": [
        {
          "word": "archiphoneme"
        },
        {
          "word": "diaphoneme"
        },
        {
          "word": "phonemic"
        },
        {
          "word": "phonemical"
        },
        {
          "word": "phonemically"
        },
        {
          "word": "phonemicist"
        },
        {
          "word": "phonemicity"
        },
        {
          "word": "phonemics"
        },
        {
          "word": "subphoneme"
        }
      ],
      "examples": [
        {
          "ref": "1990, Jarmo Lainio, “Sweden Finnish — development or deterioration?”, in Durk Gorter, editor, Fourth International Conference on Minority Languages: Western and Eastern European papers, Multilingual Matters, page 31",
          "text": "It is crucial for the phoneme structure of Finnish — traditionally /d/ has not been included in the Finnish phonotax, but it fulfils the criteria of a phoneme (Karlsson, 1983: 66-7).",
          "type": "quotation"
        }
      ],
      "glosses": [
        "An indivisible unit of sound in a given language. A phoneme is an abstraction of the physical speech sounds (phones) and may encompass several different phones."
      ],
      "id": "en-phoneme-en-noun-8xumvNeT",
      "links": [
        [
          "unit",
          "unit"
        ],
        [
          "sound",
          "sound"
        ],
        [
          "phone",
          "phone"
        ]
      ],
      "related": [
        {
          "word": "allophone"
        },
        {
          "word": "allophonic"
        },
        {
          "word": "allophonical"
        },
        {
          "word": "allophonically"
        },
        {
          "word": "allophonics"
        },
        {
          "word": "diaphone"
        },
        {
          "word": "diaphonic"
        },
        {
          "word": "diaphonical"
        },
        {
          "word": "diaphonically"
        },
        {
          "word": "diaphonics"
        },
        {
          "word": "diaphonologic"
        },
        {
          "word": "diaphonological"
        },
        {
          "word": "diaphonologically"
        },
        {
          "word": "diaphonology"
        },
        {
          "word": "phone"
        },
        {
          "word": "phonetic"
        },
        {
          "word": "phonetical"
        },
        {
          "word": "phonetically"
        },
        {
          "word": "phonetics"
        },
        {
          "word": "phonic"
        },
        {
          "word": "phonical"
        },
        {
          "word": "phonically"
        },
        {
          "word": "phonics"
        },
        {
          "word": "phonologic"
        },
        {
          "word": "phonological"
        },
        {
          "word": "phonologically"
        },
        {
          "word": "phonologist"
        },
        {
          "word": "phonology"
        },
        {
          "word": "chereme"
        },
        {
          "word": "chroneme"
        },
        {
          "word": "grammeme"
        },
        {
          "word": "grapheme"
        },
        {
          "word": "lemma"
        },
        {
          "word": "lexeme"
        },
        {
          "word": "listeme"
        },
        {
          "word": "morpheme"
        },
        {
          "word": "sememe"
        },
        {
          "word": "toneme"
        }
      ],
      "translations": [
        {
          "code": "sq",
          "lang": "Albanian",
          "sense": "indivisible unit of sound",
          "tags": [
            "feminine"
          ],
          "word": "fonemë"
        },
        {
          "code": "ar",
          "lang": "Arabic",
          "roman": "fōnēm",
          "sense": "indivisible unit of sound",
          "tags": [
            "masculine"
          ],
          "word": "فُونِيم"
        },
        {
          "code": "ar",
          "lang": "Arabic",
          "roman": "ṣawt luḡawiyy",
          "sense": "indivisible unit of sound",
          "tags": [
            "masculine"
          ],
          "word": "صَوْت لُغَوِيّ"
        },
        {
          "code": "hy",
          "lang": "Armenian",
          "roman": "hnčʿuytʿ",
          "sense": "indivisible unit of sound",
          "word": "հնչույթ"
        },
        {
          "code": "ast",
          "lang": "Asturian",
          "sense": "indivisible unit of sound",
          "tags": [
            "masculine"
          ],
          "word": "fonema"
        },
        {
          "code": "be",
          "lang": "Belarusian",
          "roman": "fanjéma",
          "sense": "indivisible unit of sound",
          "tags": [
            "feminine"
          ],
          "word": "фане́ма"
        },
        {
          "code": "be",
          "lang": "Belarusian",
          "roman": "fanéma",
          "sense": "indivisible unit of sound",
          "tags": [
            "feminine"
          ],
          "word": "фанэ́ма"
        },
        {
          "code": "br",
          "lang": "Breton",
          "sense": "indivisible unit of sound",
          "word": "soniad"
        },
        {
          "code": "bg",
          "lang": "Bulgarian",
          "roman": "fonéma",
          "sense": "indivisible unit of sound",
          "tags": [
            "feminine"
          ],
          "word": "фоне́ма"
        },
        {
          "code": "ca",
          "lang": "Catalan",
          "sense": "indivisible unit of sound",
          "tags": [
            "masculine"
          ],
          "word": "fonema"
        },
        {
          "code": "ceb",
          "lang": "Cebuano",
          "sense": "indivisible unit of sound",
          "word": "ponema"
        },
        {
          "code": "yue",
          "english": "jam¹ sou³",
          "lang": "Chinese Cantonese",
          "sense": "indivisible unit of sound",
          "word": "音素"
        },
        {
          "alt": "jam¹ wai⁶⁻²",
          "code": "yue",
          "lang": "Chinese Cantonese",
          "sense": "indivisible unit of sound",
          "word": "音位"
        },
        {
          "code": "nan-hbl",
          "lang": "Chinese",
          "roman": "im-sò͘",
          "sense": "indivisible unit of sound",
          "tags": [
            "Hokkien"
          ],
          "word": "音素"
        },
        {
          "code": "cmn",
          "lang": "Chinese Mandarin",
          "roman": "yīnsù",
          "sense": "indivisible unit of sound",
          "word": "音素"
        },
        {
          "code": "cmn",
          "lang": "Chinese Mandarin",
          "roman": "yīnwèi",
          "sense": "indivisible unit of sound",
          "word": "音位"
        },
        {
          "code": "cs",
          "lang": "Czech",
          "sense": "indivisible unit of sound",
          "tags": [
            "masculine"
          ],
          "word": "foném"
        },
        {
          "code": "da",
          "lang": "Danish",
          "sense": "indivisible unit of sound",
          "tags": [
            "neuter"
          ],
          "word": "fonem"
        },
        {
          "code": "nl",
          "lang": "Dutch",
          "sense": "indivisible unit of sound",
          "tags": [
            "neuter"
          ],
          "word": "foneem"
        },
        {
          "code": "eo",
          "lang": "Esperanto",
          "sense": "indivisible unit of sound",
          "word": "fonemo"
        },
        {
          "code": "et",
          "lang": "Estonian",
          "sense": "indivisible unit of sound",
          "word": "häälik"
        },
        {
          "code": "et",
          "lang": "Estonian",
          "sense": "indivisible unit of sound",
          "word": "foneem"
        },
        {
          "code": "fi",
          "lang": "Finnish",
          "sense": "indivisible unit of sound",
          "word": "foneemi"
        },
        {
          "code": "fr",
          "lang": "French",
          "sense": "indivisible unit of sound",
          "tags": [
            "masculine"
          ],
          "word": "phonème"
        },
        {
          "code": "gl",
          "lang": "Galician",
          "sense": "indivisible unit of sound",
          "tags": [
            "masculine"
          ],
          "word": "fonema"
        },
        {
          "code": "ka",
          "lang": "Georgian",
          "roman": "bgera",
          "sense": "indivisible unit of sound",
          "word": "ბგერა"
        },
        {
          "code": "de",
          "lang": "German",
          "sense": "indivisible unit of sound",
          "tags": [
            "neuter"
          ],
          "word": "Phonem"
        },
        {
          "code": "el",
          "lang": "Greek",
          "roman": "fónima",
          "sense": "indivisible unit of sound",
          "tags": [
            "neuter"
          ],
          "word": "φώνημα"
        },
        {
          "code": "gu",
          "lang": "Gujarati",
          "roman": "dhvanighṭak",
          "sense": "indivisible unit of sound",
          "word": "ધ્વનિઘટક"
        },
        {
          "code": "hi",
          "lang": "Hindi",
          "roman": "dhvanigrām",
          "sense": "indivisible unit of sound",
          "tags": [
            "masculine"
          ],
          "word": "ध्वनिग्राम"
        },
        {
          "code": "hi",
          "lang": "Hindi",
          "roman": "dhvani",
          "sense": "indivisible unit of sound",
          "tags": [
            "feminine"
          ],
          "word": "ध्वनि"
        },
        {
          "code": "hi",
          "lang": "Hindi",
          "roman": "vāc",
          "sense": "indivisible unit of sound",
          "tags": [
            "feminine"
          ],
          "word": "वाच्"
        },
        {
          "code": "hi",
          "lang": "Hindi",
          "roman": "vyom",
          "sense": "indivisible unit of sound",
          "tags": [
            "masculine"
          ],
          "word": "व्योम"
        },
        {
          "code": "hu",
          "lang": "Hungarian",
          "sense": "indivisible unit of sound",
          "word": "fonéma"
        },
        {
          "code": "is",
          "lang": "Icelandic",
          "sense": "indivisible unit of sound",
          "word": "hljóðan"
        },
        {
          "code": "io",
          "lang": "Ido",
          "sense": "indivisible unit of sound",
          "word": "fonemo"
        },
        {
          "code": "ia",
          "lang": "Interlingua",
          "sense": "indivisible unit of sound",
          "word": "phonema"
        },
        {
          "code": "it",
          "lang": "Italian",
          "sense": "indivisible unit of sound",
          "tags": [
            "masculine"
          ],
          "word": "fonema"
        },
        {
          "alt": "おんそ",
          "code": "ja",
          "lang": "Japanese",
          "roman": "onso",
          "sense": "indivisible unit of sound",
          "word": "音素"
        },
        {
          "code": "kn",
          "lang": "Kannada",
          "roman": "dhvanimā",
          "sense": "indivisible unit of sound",
          "word": "ಧ್ವನಿಮಾ"
        },
        {
          "code": "km",
          "lang": "Khmer",
          "roman": "sattĕəʼtaa",
          "sense": "indivisible unit of sound",
          "word": "សទ្ទតា"
        },
        {
          "code": "km",
          "lang": "Khmer",
          "roman": "sattĕəʼ phuut",
          "sense": "indivisible unit of sound",
          "word": "សទ្ទភូត"
        },
        {
          "alt": "音素",
          "code": "ko",
          "lang": "Korean",
          "roman": "eumso",
          "sense": "indivisible unit of sound",
          "word": "음소"
        },
        {
          "alt": "表音",
          "code": "ko",
          "lang": "Korean",
          "roman": "pyoeum",
          "sense": "indivisible unit of sound",
          "word": "표음"
        },
        {
          "code": "ko",
          "lang": "Korean",
          "roman": "natsori",
          "sense": "indivisible unit of sound",
          "word": "낱소리"
        },
        {
          "code": "kmr",
          "lang": "Northern Kurdish",
          "sense": "indivisible unit of sound",
          "word": "vac"
        },
        {
          "code": "la",
          "lang": "Latin",
          "sense": "indivisible unit of sound",
          "tags": [
            "feminine"
          ],
          "word": "phōnēma"
        },
        {
          "code": "lv",
          "lang": "Latvian",
          "sense": "indivisible unit of sound",
          "tags": [
            "feminine"
          ],
          "word": "fonēma"
        },
        {
          "code": "lt",
          "lang": "Lithuanian",
          "sense": "indivisible unit of sound",
          "tags": [
            "feminine"
          ],
          "word": "fonema"
        },
        {
          "code": "ms",
          "lang": "Malay",
          "sense": "indivisible unit of sound",
          "word": "fonem"
        },
        {
          "code": "ms",
          "lang": "Malay",
          "sense": "indivisible unit of sound",
          "word": "warna bunyi"
        },
        {
          "code": "gv",
          "lang": "Manx",
          "sense": "indivisible unit of sound",
          "word": "myn-heean"
        },
        {
          "code": "mr",
          "lang": "Marathi",
          "roman": "dhvanighṭak",
          "sense": "indivisible unit of sound",
          "word": "ध्वनिघटक"
        },
        {
          "code": "mn",
          "lang": "Mongolian",
          "roman": "avialbar",
          "sense": "indivisible unit of sound",
          "tags": [
            "Cyrillic"
          ],
          "word": "авиалбар"
        },
        {
          "code": "mn",
          "lang": "Mongolian",
          "roman": "fonem",
          "sense": "indivisible unit of sound",
          "tags": [
            "Cyrillic"
          ],
          "word": "фонем"
        },
        {
          "code": "mn",
          "lang": "Mongolian",
          "roman": "abiyalburi",
          "sense": "indivisible unit of sound",
          "tags": [
            "Mongolian"
          ],
          "word": "ᠠᠪᠢᠶᠠᠯᠪᠤᠷᠢ"
        },
        {
          "code": "mn",
          "lang": "Mongolian",
          "roman": "fonēm",
          "sense": "indivisible unit of sound",
          "tags": [
            "Mongolian"
          ],
          "word": "ᠹᠣᠨᠧᠮ"
        },
        {
          "code": "oc",
          "lang": "Occitan",
          "sense": "indivisible unit of sound",
          "tags": [
            "masculine"
          ],
          "word": "fonèma"
        },
        {
          "code": "fa",
          "lang": "Persian",
          "roman": "vâj",
          "sense": "indivisible unit of sound",
          "word": "واج"
        },
        {
          "code": "pl",
          "lang": "Polish",
          "sense": "indivisible unit of sound",
          "tags": [
            "masculine"
          ],
          "word": "fonem"
        },
        {
          "code": "pt",
          "lang": "Portuguese",
          "sense": "indivisible unit of sound",
          "tags": [
            "masculine"
          ],
          "word": "fonema"
        },
        {
          "code": "ro",
          "lang": "Romanian",
          "sense": "indivisible unit of sound",
          "tags": [
            "neuter"
          ],
          "word": "fonem"
        },
        {
          "code": "ru",
          "lang": "Russian",
          "roman": "fonéma",
          "sense": "indivisible unit of sound",
          "tags": [
            "feminine"
          ],
          "word": "фоне́ма"
        },
        {
          "code": "sh",
          "lang": "Serbo-Croatian",
          "sense": "indivisible unit of sound",
          "tags": [
            "Cyrillic",
            "masculine"
          ],
          "word": "фо̀не̄м"
        },
        {
          "code": "sh",
          "lang": "Serbo-Croatian",
          "sense": "indivisible unit of sound",
          "tags": [
            "Cyrillic",
            "feminine"
          ],
          "word": "фоне́ма"
        },
        {
          "code": "sh",
          "lang": "Serbo-Croatian",
          "sense": "indivisible unit of sound",
          "tags": [
            "Roman",
            "masculine"
          ],
          "word": "fònēm"
        },
        {
          "code": "sh",
          "lang": "Serbo-Croatian",
          "sense": "indivisible unit of sound",
          "tags": [
            "Roman",
            "feminine"
          ],
          "word": "fonéma"
        },
        {
          "code": "scn",
          "lang": "Sicilian",
          "sense": "indivisible unit of sound",
          "tags": [
            "masculine"
          ],
          "word": "funima"
        },
        {
          "code": "sd",
          "lang": "Sindhi",
          "roman": "viome",
          "sense": "indivisible unit of sound",
          "tags": [
            "feminine"
          ],
          "word": "وايوم"
        },
        {
          "code": "sk",
          "lang": "Slovak",
          "sense": "indivisible unit of sound",
          "tags": [
            "feminine"
          ],
          "word": "fonéma"
        },
        {
          "code": "sl",
          "lang": "Slovene",
          "sense": "indivisible unit of sound",
          "tags": [
            "masculine"
          ],
          "word": "fonem"
        },
        {
          "code": "sl",
          "lang": "Slovene",
          "sense": "indivisible unit of sound",
          "tags": [
            "masculine"
          ],
          "word": "glásnik"
        },
        {
          "code": "dsb",
          "lang": "Lower Sorbian",
          "sense": "indivisible unit of sound",
          "tags": [
            "masculine"
          ],
          "word": "fonem"
        },
        {
          "code": "hsb",
          "lang": "Upper Sorbian",
          "sense": "indivisible unit of sound",
          "tags": [
            "masculine"
          ],
          "word": "fonem"
        },
        {
          "code": "es",
          "lang": "Spanish",
          "sense": "indivisible unit of sound",
          "tags": [
            "masculine"
          ],
          "word": "fonema"
        },
        {
          "code": "sw",
          "lang": "Swahili",
          "sense": "indivisible unit of sound",
          "word": "fonimu"
        },
        {
          "code": "sv",
          "lang": "Swedish",
          "sense": "indivisible unit of sound",
          "tags": [
            "neuter"
          ],
          "word": "fonem"
        },
        {
          "code": "tl",
          "lang": "Tagalog",
          "sense": "indivisible unit of sound",
          "word": "multinig"
        },
        {
          "code": "ta",
          "lang": "Tamil",
          "roman": "oliyaṉ",
          "sense": "indivisible unit of sound",
          "word": "ஒலியன்"
        },
        {
          "code": "th",
          "lang": "Thai",
          "roman": "nùai-sǐiang",
          "sense": "indivisible unit of sound",
          "word": "หน่วยเสียง"
        },
        {
          "code": "bo",
          "lang": "Tibetan",
          "roman": "ma sgra",
          "sense": "indivisible unit of sound",
          "word": "མ་སྒྲ"
        },
        {
          "code": "tr",
          "lang": "Turkish",
          "sense": "indivisible unit of sound",
          "word": "sesbirim"
        },
        {
          "code": "uk",
          "lang": "Ukrainian",
          "roman": "fonéma",
          "sense": "indivisible unit of sound",
          "tags": [
            "feminine"
          ],
          "word": "фоне́ма"
        },
        {
          "alt": "音位",
          "code": "vi",
          "lang": "Vietnamese",
          "sense": "indivisible unit of sound",
          "word": "âm vị"
        },
        {
          "code": "wa",
          "lang": "Walloon",
          "sense": "indivisible unit of sound",
          "word": "oyon"
        },
        {
          "code": "yi",
          "lang": "Yiddish",
          "roman": "fonem",
          "sense": "indivisible unit of sound",
          "tags": [
            "masculine"
          ],
          "word": "פֿאָנעם"
        }
      ]
    }
  ],
  "sounds": [
    {
      "ipa": "/ˈfəʊ.niːm/",
      "tags": [
        "Received-Pronunciation"
      ]
    },
    {
      "ipa": "/ˈfoʊ.nim/",
      "tags": [
        "General-American"
      ]
    },
    {
      "rhymes": "-əʊniːm"
    },
    {
      "audio": "En-us-phoneme.ogg",
      "mp3_url": "https://upload.wikimedia.org/wikipedia/commons/transcoded/4/47/En-us-phoneme.ogg/En-us-phoneme.ogg.mp3",
      "ogg_url": "https://upload.wikimedia.org/wikipedia/commons/4/47/En-us-phoneme.ogg",
      "tags": [
        "US"
      ],
      "text": "Audio (US)"
    }
  ],
  "word": "phoneme"
}
{
  "derived": [
    {
      "word": "archiphoneme"
    },
    {
      "word": "diaphoneme"
    },
    {
      "word": "phonemic"
    },
    {
      "word": "phonemical"
    },
    {
      "word": "phonemically"
    },
    {
      "word": "phonemicist"
    },
    {
      "word": "phonemicity"
    },
    {
      "word": "phonemics"
    },
    {
      "word": "subphoneme"
    }
  ],
  "etymology_templates": [
    {
      "args": {
        "1": "en",
        "2": "ine-pro",
        "3": "*bʰeh₂-",
        "id": "speak"
      },
      "expansion": "",
      "name": "root"
    },
    {
      "args": {
        "1": "en",
        "2": "grc",
        "3": "φώνημα",
        "4": "",
        "5": "sound"
      },
      "expansion": "Ancient Greek φώνημα (phṓnēma, “sound”)",
      "name": "der"
    },
    {
      "args": {
        "1": "grc",
        "2": "φωνέω",
        "3": "",
        "4": "to sound"
      },
      "expansion": "φωνέω (phōnéō, “to sound”)",
      "name": "m"
    },
    {
      "args": {
        "1": "grc",
        "2": "φωνή",
        "3": "",
        "4": "sound"
      },
      "expansion": "φωνή (phōnḗ, “sound”)",
      "name": "m"
    },
    {
      "args": {
        "1": "en",
        "2": "phone<t:speech sound>",
        "3": "-eme<t:unit>"
      },
      "expansion": "By surface analysis, phone (“speech sound”) + -eme (“unit”)",
      "name": "surf"
    }
  ],
  "etymology_text": "From Ancient Greek φώνημα (phṓnēma, “sound”), from φωνέω (phōnéō, “to sound”), from φωνή (phōnḗ, “sound”). By surface analysis, phone (“speech sound”) + -eme (“unit”).",
  "forms": [
    {
      "form": "phonemes",
      "tags": [
        "plural"
      ]
    }
  ],
  "head_templates": [
    {
      "args": {},
      "expansion": "phoneme (plural phonemes)",
      "name": "en-noun"
    }
  ],
  "lang": "English",
  "lang_code": "en",
  "pos": "noun",
  "related": [
    {
      "word": "allophone"
    },
    {
      "word": "allophonic"
    },
    {
      "word": "allophonical"
    },
    {
      "word": "allophonically"
    },
    {
      "word": "allophonics"
    },
    {
      "word": "diaphone"
    },
    {
      "word": "diaphonic"
    },
    {
      "word": "diaphonical"
    },
    {
      "word": "diaphonically"
    },
    {
      "word": "diaphonics"
    },
    {
      "word": "diaphonologic"
    },
    {
      "word": "diaphonological"
    },
    {
      "word": "diaphonologically"
    },
    {
      "word": "diaphonology"
    },
    {
      "word": "phone"
    },
    {
      "word": "phonetic"
    },
    {
      "word": "phonetical"
    },
    {
      "word": "phonetically"
    },
    {
      "word": "phonetics"
    },
    {
      "word": "phonic"
    },
    {
      "word": "phonical"
    },
    {
      "word": "phonically"
    },
    {
      "word": "phonics"
    },
    {
      "word": "phonologic"
    },
    {
      "word": "phonological"
    },
    {
      "word": "phonologically"
    },
    {
      "word": "phonologist"
    },
    {
      "word": "phonology"
    },
    {
      "word": "chereme"
    },
    {
      "word": "chroneme"
    },
    {
      "word": "grammeme"
    },
    {
      "word": "grapheme"
    },
    {
      "word": "lemma"
    },
    {
      "word": "lexeme"
    },
    {
      "word": "listeme"
    },
    {
      "word": "morpheme"
    },
    {
      "word": "sememe"
    },
    {
      "word": "toneme"
    }
  ],
  "senses": [
    {
      "categories": [
        "Arabic terms with non-redundant manual transliterations",
        "Cantonese terms with redundant transliterations",
        "English 2-syllable words",
        "English countable nouns",
        "English entries with incorrect language header",
        "English entries with topic categories using raw markup",
        "English lemmas",
        "English nouns",
        "English terms derived from Ancient Greek",
        "English terms derived from Proto-Indo-European",
        "English terms derived from the Proto-Indo-European root *bʰeh₂- (speak)",
        "English terms suffixed with -eme",
        "English terms with IPA pronunciation",
        "English terms with audio links",
        "English terms with quotations",
        "Mandarin terms with redundant transliterations",
        "Rhymes:English/əʊniːm",
        "Rhymes:English/əʊniːm/2 syllables",
        "en:Phonology"
      ],
      "examples": [
        {
          "ref": "1990, Jarmo Lainio, “Sweden Finnish — development or deterioration?”, in Durk Gorter, editor, Fourth International Conference on Minority Languages: Western and Eastern European papers, Multilingual Matters, page 31",
          "text": "It is crucial for the phoneme structure of Finnish — traditionally /d/ has not been included in the Finnish phonotax, but it fulfils the criteria of a phoneme (Karlsson, 1983: 66-7).",
          "type": "quotation"
        }
      ],
      "glosses": [
        "An indivisible unit of sound in a given language. A phoneme is an abstraction of the physical speech sounds (phones) and may encompass several different phones."
      ],
      "links": [
        [
          "unit",
          "unit"
        ],
        [
          "sound",
          "sound"
        ],
        [
          "phone",
          "phone"
        ]
      ]
    }
  ],
  "sounds": [
    {
      "ipa": "/ˈfəʊ.niːm/",
      "tags": [
        "Received-Pronunciation"
      ]
    },
    {
      "ipa": "/ˈfoʊ.nim/",
      "tags": [
        "General-American"
      ]
    },
    {
      "rhymes": "-əʊniːm"
    },
    {
      "audio": "En-us-phoneme.ogg",
      "mp3_url": "https://upload.wikimedia.org/wikipedia/commons/transcoded/4/47/En-us-phoneme.ogg/En-us-phoneme.ogg.mp3",
      "ogg_url": "https://upload.wikimedia.org/wikipedia/commons/4/47/En-us-phoneme.ogg",
      "tags": [
        "US"
      ],
      "text": "Audio (US)"
    }
  ],
  "translations": [
    {
      "code": "sq",
      "lang": "Albanian",
      "sense": "indivisible unit of sound",
      "tags": [
        "feminine"
      ],
      "word": "fonemë"
    },
    {
      "code": "ar",
      "lang": "Arabic",
      "roman": "fōnēm",
      "sense": "indivisible unit of sound",
      "tags": [
        "masculine"
      ],
      "word": "فُونِيم"
    },
    {
      "code": "ar",
      "lang": "Arabic",
      "roman": "ṣawt luḡawiyy",
      "sense": "indivisible unit of sound",
      "tags": [
        "masculine"
      ],
      "word": "صَوْت لُغَوِيّ"
    },
    {
      "code": "hy",
      "lang": "Armenian",
      "roman": "hnčʿuytʿ",
      "sense": "indivisible unit of sound",
      "word": "հնչույթ"
    },
    {
      "code": "ast",
      "lang": "Asturian",
      "sense": "indivisible unit of sound",
      "tags": [
        "masculine"
      ],
      "word": "fonema"
    },
    {
      "code": "be",
      "lang": "Belarusian",
      "roman": "fanjéma",
      "sense": "indivisible unit of sound",
      "tags": [
        "feminine"
      ],
      "word": "фане́ма"
    },
    {
      "code": "be",
      "lang": "Belarusian",
      "roman": "fanéma",
      "sense": "indivisible unit of sound",
      "tags": [
        "feminine"
      ],
      "word": "фанэ́ма"
    },
    {
      "code": "br",
      "lang": "Breton",
      "sense": "indivisible unit of sound",
      "word": "soniad"
    },
    {
      "code": "bg",
      "lang": "Bulgarian",
      "roman": "fonéma",
      "sense": "indivisible unit of sound",
      "tags": [
        "feminine"
      ],
      "word": "фоне́ма"
    },
    {
      "code": "ca",
      "lang": "Catalan",
      "sense": "indivisible unit of sound",
      "tags": [
        "masculine"
      ],
      "word": "fonema"
    },
    {
      "code": "ceb",
      "lang": "Cebuano",
      "sense": "indivisible unit of sound",
      "word": "ponema"
    },
    {
      "code": "yue",
      "english": "jam¹ sou³",
      "lang": "Chinese Cantonese",
      "sense": "indivisible unit of sound",
      "word": "音素"
    },
    {
      "alt": "jam¹ wai⁶⁻²",
      "code": "yue",
      "lang": "Chinese Cantonese",
      "sense": "indivisible unit of sound",
      "word": "音位"
    },
    {
      "code": "nan-hbl",
      "lang": "Chinese",
      "roman": "im-sò͘",
      "sense": "indivisible unit of sound",
      "tags": [
        "Hokkien"
      ],
      "word": "音素"
    },
    {
      "code": "cmn",
      "lang": "Chinese Mandarin",
      "roman": "yīnsù",
      "sense": "indivisible unit of sound",
      "word": "音素"
    },
    {
      "code": "cmn",
      "lang": "Chinese Mandarin",
      "roman": "yīnwèi",
      "sense": "indivisible unit of sound",
      "word": "音位"
    },
    {
      "code": "cs",
      "lang": "Czech",
      "sense": "indivisible unit of sound",
      "tags": [
        "masculine"
      ],
      "word": "foném"
    },
    {
      "code": "da",
      "lang": "Danish",
      "sense": "indivisible unit of sound",
      "tags": [
        "neuter"
      ],
      "word": "fonem"
    },
    {
      "code": "nl",
      "lang": "Dutch",
      "sense": "indivisible unit of sound",
      "tags": [
        "neuter"
      ],
      "word": "foneem"
    },
    {
      "code": "eo",
      "lang": "Esperanto",
      "sense": "indivisible unit of sound",
      "word": "fonemo"
    },
    {
      "code": "et",
      "lang": "Estonian",
      "sense": "indivisible unit of sound",
      "word": "häälik"
    },
    {
      "code": "et",
      "lang": "Estonian",
      "sense": "indivisible unit of sound",
      "word": "foneem"
    },
    {
      "code": "fi",
      "lang": "Finnish",
      "sense": "indivisible unit of sound",
      "word": "foneemi"
    },
    {
      "code": "fr",
      "lang": "French",
      "sense": "indivisible unit of sound",
      "tags": [
        "masculine"
      ],
      "word": "phonème"
    },
    {
      "code": "gl",
      "lang": "Galician",
      "sense": "indivisible unit of sound",
      "tags": [
        "masculine"
      ],
      "word": "fonema"
    },
    {
      "code": "ka",
      "lang": "Georgian",
      "roman": "bgera",
      "sense": "indivisible unit of sound",
      "word": "ბგერა"
    },
    {
      "code": "de",
      "lang": "German",
      "sense": "indivisible unit of sound",
      "tags": [
        "neuter"
      ],
      "word": "Phonem"
    },
    {
      "code": "el",
      "lang": "Greek",
      "roman": "fónima",
      "sense": "indivisible unit of sound",
      "tags": [
        "neuter"
      ],
      "word": "φώνημα"
    },
    {
      "code": "gu",
      "lang": "Gujarati",
      "roman": "dhvanighṭak",
      "sense": "indivisible unit of sound",
      "word": "ધ્વનિઘટક"
    },
    {
      "code": "hi",
      "lang": "Hindi",
      "roman": "dhvanigrām",
      "sense": "indivisible unit of sound",
      "tags": [
        "masculine"
      ],
      "word": "ध्वनिग्राम"
    },
    {
      "code": "hi",
      "lang": "Hindi",
      "roman": "dhvani",
      "sense": "indivisible unit of sound",
      "tags": [
        "feminine"
      ],
      "word": "ध्वनि"
    },
    {
      "code": "hi",
      "lang": "Hindi",
      "roman": "vāc",
      "sense": "indivisible unit of sound",
      "tags": [
        "feminine"
      ],
      "word": "वाच्"
    },
    {
      "code": "hi",
      "lang": "Hindi",
      "roman": "vyom",
      "sense": "indivisible unit of sound",
      "tags": [
        "masculine"
      ],
      "word": "व्योम"
    },
    {
      "code": "hu",
      "lang": "Hungarian",
      "sense": "indivisible unit of sound",
      "word": "fonéma"
    },
    {
      "code": "is",
      "lang": "Icelandic",
      "sense": "indivisible unit of sound",
      "word": "hljóðan"
    },
    {
      "code": "io",
      "lang": "Ido",
      "sense": "indivisible unit of sound",
      "word": "fonemo"
    },
    {
      "code": "ia",
      "lang": "Interlingua",
      "sense": "indivisible unit of sound",
      "word": "phonema"
    },
    {
      "code": "it",
      "lang": "Italian",
      "sense": "indivisible unit of sound",
      "tags": [
        "masculine"
      ],
      "word": "fonema"
    },
    {
      "alt": "おんそ",
      "code": "ja",
      "lang": "Japanese",
      "roman": "onso",
      "sense": "indivisible unit of sound",
      "word": "音素"
    },
    {
      "code": "kn",
      "lang": "Kannada",
      "roman": "dhvanimā",
      "sense": "indivisible unit of sound",
      "word": "ಧ್ವನಿಮಾ"
    },
    {
      "code": "km",
      "lang": "Khmer",
      "roman": "sattĕəʼtaa",
      "sense": "indivisible unit of sound",
      "word": "សទ្ទតា"
    },
    {
      "code": "km",
      "lang": "Khmer",
      "roman": "sattĕəʼ phuut",
      "sense": "indivisible unit of sound",
      "word": "សទ្ទភូត"
    },
    {
      "alt": "音素",
      "code": "ko",
      "lang": "Korean",
      "roman": "eumso",
      "sense": "indivisible unit of sound",
      "word": "음소"
    },
    {
      "alt": "表音",
      "code": "ko",
      "lang": "Korean",
      "roman": "pyoeum",
      "sense": "indivisible unit of sound",
      "word": "표음"
    },
    {
      "code": "ko",
      "lang": "Korean",
      "roman": "natsori",
      "sense": "indivisible unit of sound",
      "word": "낱소리"
    },
    {
      "code": "kmr",
      "lang": "Northern Kurdish",
      "sense": "indivisible unit of sound",
      "word": "vac"
    },
    {
      "code": "la",
      "lang": "Latin",
      "sense": "indivisible unit of sound",
      "tags": [
        "feminine"
      ],
      "word": "phōnēma"
    },
    {
      "code": "lv",
      "lang": "Latvian",
      "sense": "indivisible unit of sound",
      "tags": [
        "feminine"
      ],
      "word": "fonēma"
    },
    {
      "code": "lt",
      "lang": "Lithuanian",
      "sense": "indivisible unit of sound",
      "tags": [
        "feminine"
      ],
      "word": "fonema"
    },
    {
      "code": "ms",
      "lang": "Malay",
      "sense": "indivisible unit of sound",
      "word": "fonem"
    },
    {
      "code": "ms",
      "lang": "Malay",
      "sense": "indivisible unit of sound",
      "word": "warna bunyi"
    },
    {
      "code": "gv",
      "lang": "Manx",
      "sense": "indivisible unit of sound",
      "word": "myn-heean"
    },
    {
      "code": "mr",
      "lang": "Marathi",
      "roman": "dhvanighṭak",
      "sense": "indivisible unit of sound",
      "word": "ध्वनिघटक"
    },
    {
      "code": "mn",
      "lang": "Mongolian",
      "roman": "avialbar",
      "sense": "indivisible unit of sound",
      "tags": [
        "Cyrillic"
      ],
      "word": "авиалбар"
    },
    {
      "code": "mn",
      "lang": "Mongolian",
      "roman": "fonem",
      "sense": "indivisible unit of sound",
      "tags": [
        "Cyrillic"
      ],
      "word": "фонем"
    },
    {
      "code": "mn",
      "lang": "Mongolian",
      "roman": "abiyalburi",
      "sense": "indivisible unit of sound",
      "tags": [
        "Mongolian"
      ],
      "word": "ᠠᠪᠢᠶᠠᠯᠪᠤᠷᠢ"
    },
    {
      "code": "mn",
      "lang": "Mongolian",
      "roman": "fonēm",
      "sense": "indivisible unit of sound",
      "tags": [
        "Mongolian"
      ],
      "word": "ᠹᠣᠨᠧᠮ"
    },
    {
      "code": "oc",
      "lang": "Occitan",
      "sense": "indivisible unit of sound",
      "tags": [
        "masculine"
      ],
      "word": "fonèma"
    },
    {
      "code": "fa",
      "lang": "Persian",
      "roman": "vâj",
      "sense": "indivisible unit of sound",
      "word": "واج"
    },
    {
      "code": "pl",
      "lang": "Polish",
      "sense": "indivisible unit of sound",
      "tags": [
        "masculine"
      ],
      "word": "fonem"
    },
    {
      "code": "pt",
      "lang": "Portuguese",
      "sense": "indivisible unit of sound",
      "tags": [
        "masculine"
      ],
      "word": "fonema"
    },
    {
      "code": "ro",
      "lang": "Romanian",
      "sense": "indivisible unit of sound",
      "tags": [
        "neuter"
      ],
      "word": "fonem"
    },
    {
      "code": "ru",
      "lang": "Russian",
      "roman": "fonéma",
      "sense": "indivisible unit of sound",
      "tags": [
        "feminine"
      ],
      "word": "фоне́ма"
    },
    {
      "code": "sh",
      "lang": "Serbo-Croatian",
      "sense": "indivisible unit of sound",
      "tags": [
        "Cyrillic",
        "masculine"
      ],
      "word": "фо̀не̄м"
    },
    {
      "code": "sh",
      "lang": "Serbo-Croatian",
      "sense": "indivisible unit of sound",
      "tags": [
        "Cyrillic",
        "feminine"
      ],
      "word": "фоне́ма"
    },
    {
      "code": "sh",
      "lang": "Serbo-Croatian",
      "sense": "indivisible unit of sound",
      "tags": [
        "Roman",
        "masculine"
      ],
      "word": "fònēm"
    },
    {
      "code": "sh",
      "lang": "Serbo-Croatian",
      "sense": "indivisible unit of sound",
      "tags": [
        "Roman",
        "feminine"
      ],
      "word": "fonéma"
    },
    {
      "code": "scn",
      "lang": "Sicilian",
      "sense": "indivisible unit of sound",
      "tags": [
        "masculine"
      ],
      "word": "funima"
    },
    {
      "code": "sd",
      "lang": "Sindhi",
      "roman": "viome",
      "sense": "indivisible unit of sound",
      "tags": [
        "feminine"
      ],
      "word": "وايوم"
    },
    {
      "code": "sk",
      "lang": "Slovak",
      "sense": "indivisible unit of sound",
      "tags": [
        "feminine"
      ],
      "word": "fonéma"
    },
    {
      "code": "sl",
      "lang": "Slovene",
      "sense": "indivisible unit of sound",
      "tags": [
        "masculine"
      ],
      "word": "fonem"
    },
    {
      "code": "sl",
      "lang": "Slovene",
      "sense": "indivisible unit of sound",
      "tags": [
        "masculine"
      ],
      "word": "glásnik"
    },
    {
      "code": "dsb",
      "lang": "Lower Sorbian",
      "sense": "indivisible unit of sound",
      "tags": [
        "masculine"
      ],
      "word": "fonem"
    },
    {
      "code": "hsb",
      "lang": "Upper Sorbian",
      "sense": "indivisible unit of sound",
      "tags": [
        "masculine"
      ],
      "word": "fonem"
    },
    {
      "code": "es",
      "lang": "Spanish",
      "sense": "indivisible unit of sound",
      "tags": [
        "masculine"
      ],
      "word": "fonema"
    },
    {
      "code": "sw",
      "lang": "Swahili",
      "sense": "indivisible unit of sound",
      "word": "fonimu"
    },
    {
      "code": "sv",
      "lang": "Swedish",
      "sense": "indivisible unit of sound",
      "tags": [
        "neuter"
      ],
      "word": "fonem"
    },
    {
      "code": "tl",
      "lang": "Tagalog",
      "sense": "indivisible unit of sound",
      "word": "multinig"
    },
    {
      "code": "ta",
      "lang": "Tamil",
      "roman": "oliyaṉ",
      "sense": "indivisible unit of sound",
      "word": "ஒலியன்"
    },
    {
      "code": "th",
      "lang": "Thai",
      "roman": "nùai-sǐiang",
      "sense": "indivisible unit of sound",
      "word": "หน่วยเสียง"
    },
    {
      "code": "bo",
      "lang": "Tibetan",
      "roman": "ma sgra",
      "sense": "indivisible unit of sound",
      "word": "མ་སྒྲ"
    },
    {
      "code": "tr",
      "lang": "Turkish",
      "sense": "indivisible unit of sound",
      "word": "sesbirim"
    },
    {
      "code": "uk",
      "lang": "Ukrainian",
      "roman": "fonéma",
      "sense": "indivisible unit of sound",
      "tags": [
        "feminine"
      ],
      "word": "фоне́ма"
    },
    {
      "alt": "音位",
      "code": "vi",
      "lang": "Vietnamese",
      "sense": "indivisible unit of sound",
      "word": "âm vị"
    },
    {
      "code": "wa",
      "lang": "Walloon",
      "sense": "indivisible unit of sound",
      "word": "oyon"
    },
    {
      "code": "yi",
      "lang": "Yiddish",
      "roman": "fonem",
      "sense": "indivisible unit of sound",
      "tags": [
        "masculine"
      ],
      "word": "פֿאָנעם"
    }
  ],
  "word": "phoneme"
}

This page is a part of the kaikki.org machine-readable English dictionary. This dictionary is based on structured data extracted on 2024-04-26 from the enwiktionary dump dated 2024-04-21 using wiktextract (93a6c53 and 21a9316). The data shown on this site has been post-processed and various details (e.g., extra categories) removed, some information disambiguated, and additional data merged from other sources. See the raw data download page for the unprocessed wiktextract data.

If you use this data in academic research, please cite Tatu Ylonen: Wiktextract: Wiktionary as Machine-Readable Structured Data, Proceedings of the 13th Conference on Language Resources and Evaluation (LREC), pp. 1317-1325, Marseille, 20-25 June 2022. Linking to the relevant page(s) under https://kaikki.org would also be greatly appreciated.