"আজম" meaning in All languages combined

See আজম on Wiktionary

Proper name [Bengali]

IPA: /azom/ (note: Rarh), [ˈazom] (note: Rarh), /adʒom/ (note: Rarh), [ˈadʒom] (note: Rarh), /azom/ (note: Dhaka), [ˈazom] (note: Dhaka), /adʑom/ (note: Dhaka), [ˈadʑom] (note: Dhaka) Forms: azom [romanization]
Etymology: Borrowed from Arabic أعظم (ʔaʕẓam). Etymology templates: {{glossary|loanword|Borrowed}} Borrowed, {{bor|bn|ar|أعظم|||g=|g2=|g3=|id=|lit=|nocat=|pos=|sc=|sort=|tr=ʔaʕẓam|ts=}} Arabic أعظم (ʔaʕẓam), {{bor+|bn|ar|أعظم|tr=ʔaʕẓam}} Borrowed from Arabic أعظم (ʔaʕẓam), {{root|bn|ar|ع ظ م}} Head templates: {{head|bn|proper noun|tr=azom}} আজম • (azom)
  1. a male given name from Arabic Categories (topical): Bengali given names, Bengali male given names Derived forms: উজিরে আজম (uzire azôm), ইসমে আজম (isme azôm)
    Sense id: en-আজম-bn-name-i3XrTvdu Categories (other): Bengali entries with incorrect language header, Bengali terms with non-redundant manual transliterations Disambiguation of Bengali entries with incorrect language header: 38 25 16 22 Disambiguation of Bengali terms with non-redundant manual transliterations: 45 16 14 26
The following are not (yet) sense-disambiguated
Etymology number: 1

Noun [Bengali]

IPA: /azom/ (note: Rarh), [ˈazom] (note: Rarh), /adʒom/ (note: Rarh), [ˈadʒom] (note: Rarh), /azom/ (note: Dhaka), [ˈazom] (note: Dhaka), /adʑom/ (note: Dhaka), [ˈadʑom] (note: Dhaka) Forms: azom [romanization]
Etymology: Borrowed from Arabic أعظم (ʔaʕẓam). Etymology templates: {{glossary|loanword|Borrowed}} Borrowed, {{bor|bn|ar|أعظم|||g=|g2=|g3=|id=|lit=|nocat=|pos=|sc=|sort=|tr=ʔaʕẓam|ts=}} Arabic أعظم (ʔaʕẓam), {{bor+|bn|ar|أعظم|tr=ʔaʕẓam}} Borrowed from Arabic أعظم (ʔaʕẓam), {{root|bn|ar|ع ظ م}} Head templates: {{head|bn|noun|||||||||||||||||autotrinfl=1|head=|sort=|tr=azom|tr2=}} আজম • (azom), {{bn-noun|tr=azom}} আজম • (azom)
  1. greatest
    Sense id: en-আজম-bn-noun-NblZL1Mj
  2. alternative spellings আযম (azôm)
    Sense id: en-আজম-bn-noun-q8dpDBCZ Categories (other): Bengali terms with non-redundant manual transliterations
The following are not (yet) sense-disambiguated
Etymology number: 1

Noun [Bengali]

IPA: /azom/ (note: Rarh), [ˈazom] (note: Rarh), /adʒom/ (note: Rarh), [ˈadʒom] (note: Rarh), /azom/ (note: Dhaka), [ˈazom] (note: Dhaka), /adʑom/ (note: Dhaka), [ˈadʑom] (note: Dhaka) Forms: ajôm [romanization]
Etymology: Borrowed from Arabic عجم (ʕajam). Etymology templates: {{glossary|loanword|Borrowed}} Borrowed, {{bor|bn|ar|عجم|||g=|g2=|g3=|id=|lit=|nocat=|pos=|sc=|sort=|tr=ʕajam|ts=}} Arabic عجم (ʕajam), {{bor+|bn|ar|عجم|tr=ʕajam}} Borrowed from Arabic عجم (ʕajam) Head templates: {{head|bn|noun|||||||||||||||||autotrinfl=1|head=|sort=|tr=ajôm|tr2=}} আজম • (ajôm), {{bn-noun|tr=ajôm}} আজম • (ajôm)
  1. non-Arab, non-Arabic, not relating to Arabia Derived forms: আজমী (ajmī)
    Sense id: en-আজম-bn-noun-XV0awWMF
The following are not (yet) sense-disambiguated
Etymology number: 2

Download JSON data for আজম meaning in All languages combined (5.9kB)

{
  "etymology_number": 1,
  "etymology_templates": [
    {
      "args": {
        "1": "loanword",
        "2": "Borrowed"
      },
      "expansion": "Borrowed",
      "name": "glossary"
    },
    {
      "args": {
        "1": "bn",
        "2": "ar",
        "3": "أعظم",
        "4": "",
        "5": "",
        "g": "",
        "g2": "",
        "g3": "",
        "id": "",
        "lit": "",
        "nocat": "",
        "pos": "",
        "sc": "",
        "sort": "",
        "tr": "ʔaʕẓam",
        "ts": ""
      },
      "expansion": "Arabic أعظم (ʔaʕẓam)",
      "name": "bor"
    },
    {
      "args": {
        "1": "bn",
        "2": "ar",
        "3": "أعظم",
        "tr": "ʔaʕẓam"
      },
      "expansion": "Borrowed from Arabic أعظم (ʔaʕẓam)",
      "name": "bor+"
    },
    {
      "args": {
        "1": "bn",
        "2": "ar",
        "3": "ع ظ م"
      },
      "expansion": "",
      "name": "root"
    }
  ],
  "etymology_text": "Borrowed from Arabic أعظم (ʔaʕẓam).",
  "forms": [
    {
      "form": "azom",
      "tags": [
        "romanization"
      ]
    }
  ],
  "head_templates": [
    {
      "args": {
        "1": "bn",
        "10": "",
        "11": "",
        "12": "",
        "13": "",
        "14": "",
        "15": "",
        "16": "",
        "17": "",
        "18": "",
        "2": "noun",
        "3": "",
        "4": "",
        "5": "",
        "6": "",
        "7": "",
        "8": "",
        "9": "",
        "autotrinfl": "1",
        "head": "",
        "sort": "",
        "tr": "azom",
        "tr2": ""
      },
      "expansion": "আজম • (azom)",
      "name": "head"
    },
    {
      "args": {
        "tr": "azom"
      },
      "expansion": "আজম • (azom)",
      "name": "bn-noun"
    }
  ],
  "lang": "Bengali",
  "lang_code": "bn",
  "pos": "noun",
  "senses": [
    {
      "glosses": [
        "greatest"
      ],
      "id": "en-আজম-bn-noun-NblZL1Mj",
      "links": [
        [
          "greatest",
          "greatest"
        ]
      ]
    },
    {
      "categories": [
        {
          "kind": "other",
          "name": "Bengali terms with non-redundant manual transliterations",
          "parents": [
            "Terms with non-redundant manual transliterations",
            "Entry maintenance"
          ],
          "source": "w"
        }
      ],
      "glosses": [
        "alternative spellings আযম (azôm)"
      ],
      "id": "en-আজম-bn-noun-q8dpDBCZ",
      "links": [
        [
          "আযম",
          "আযম#Bengali"
        ]
      ]
    }
  ],
  "sounds": [
    {
      "ipa": "/azom/",
      "note": "Rarh"
    },
    {
      "ipa": "[ˈazom]",
      "note": "Rarh"
    },
    {
      "ipa": "/adʒom/",
      "note": "Rarh"
    },
    {
      "ipa": "[ˈadʒom]",
      "note": "Rarh"
    },
    {
      "ipa": "/azom/",
      "note": "Dhaka"
    },
    {
      "ipa": "[ˈazom]",
      "note": "Dhaka"
    },
    {
      "ipa": "/adʑom/",
      "note": "Dhaka"
    },
    {
      "ipa": "[ˈadʑom]",
      "note": "Dhaka"
    }
  ],
  "word": "আজম"
}

{
  "etymology_number": 1,
  "etymology_templates": [
    {
      "args": {
        "1": "loanword",
        "2": "Borrowed"
      },
      "expansion": "Borrowed",
      "name": "glossary"
    },
    {
      "args": {
        "1": "bn",
        "2": "ar",
        "3": "أعظم",
        "4": "",
        "5": "",
        "g": "",
        "g2": "",
        "g3": "",
        "id": "",
        "lit": "",
        "nocat": "",
        "pos": "",
        "sc": "",
        "sort": "",
        "tr": "ʔaʕẓam",
        "ts": ""
      },
      "expansion": "Arabic أعظم (ʔaʕẓam)",
      "name": "bor"
    },
    {
      "args": {
        "1": "bn",
        "2": "ar",
        "3": "أعظم",
        "tr": "ʔaʕẓam"
      },
      "expansion": "Borrowed from Arabic أعظم (ʔaʕẓam)",
      "name": "bor+"
    },
    {
      "args": {
        "1": "bn",
        "2": "ar",
        "3": "ع ظ م"
      },
      "expansion": "",
      "name": "root"
    }
  ],
  "etymology_text": "Borrowed from Arabic أعظم (ʔaʕẓam).",
  "forms": [
    {
      "form": "azom",
      "tags": [
        "romanization"
      ]
    }
  ],
  "head_templates": [
    {
      "args": {
        "1": "bn",
        "2": "proper noun",
        "tr": "azom"
      },
      "expansion": "আজম • (azom)",
      "name": "head"
    }
  ],
  "lang": "Bengali",
  "lang_code": "bn",
  "pos": "name",
  "senses": [
    {
      "categories": [
        {
          "kind": "topical",
          "name": "Bengali given names",
          "parents": [
            "Given names",
            "Names",
            "All topics",
            "Proper nouns",
            "Terms by semantic function",
            "Fundamental",
            "Nouns",
            "Lemmas"
          ],
          "source": "w"
        },
        {
          "kind": "topical",
          "name": "Bengali male given names",
          "parents": [
            "Male given names",
            "Given names",
            "Names",
            "All topics",
            "Proper nouns",
            "Terms by semantic function",
            "Fundamental",
            "Nouns",
            "Lemmas"
          ],
          "source": "w"
        },
        {
          "_dis": "38 25 16 22",
          "kind": "other",
          "name": "Bengali entries with incorrect language header",
          "parents": [
            "Entries with incorrect language header",
            "Entry maintenance"
          ],
          "source": "w+disamb"
        },
        {
          "_dis": "45 16 14 26",
          "kind": "other",
          "name": "Bengali terms with non-redundant manual transliterations",
          "parents": [
            "Terms with non-redundant manual transliterations",
            "Entry maintenance"
          ],
          "source": "w+disamb"
        }
      ],
      "derived": [
        {
          "roman": "uzire azôm",
          "word": "উজিরে আজম"
        },
        {
          "roman": "isme azôm",
          "word": "ইসমে আজম"
        }
      ],
      "glosses": [
        "a male given name from Arabic"
      ],
      "id": "en-আজম-bn-name-i3XrTvdu",
      "links": [
        [
          "given name",
          "given name"
        ]
      ]
    }
  ],
  "sounds": [
    {
      "ipa": "/azom/",
      "note": "Rarh"
    },
    {
      "ipa": "[ˈazom]",
      "note": "Rarh"
    },
    {
      "ipa": "/adʒom/",
      "note": "Rarh"
    },
    {
      "ipa": "[ˈadʒom]",
      "note": "Rarh"
    },
    {
      "ipa": "/azom/",
      "note": "Dhaka"
    },
    {
      "ipa": "[ˈazom]",
      "note": "Dhaka"
    },
    {
      "ipa": "/adʑom/",
      "note": "Dhaka"
    },
    {
      "ipa": "[ˈadʑom]",
      "note": "Dhaka"
    }
  ],
  "word": "আজম"
}

{
  "etymology_number": 2,
  "etymology_templates": [
    {
      "args": {
        "1": "loanword",
        "2": "Borrowed"
      },
      "expansion": "Borrowed",
      "name": "glossary"
    },
    {
      "args": {
        "1": "bn",
        "2": "ar",
        "3": "عجم",
        "4": "",
        "5": "",
        "g": "",
        "g2": "",
        "g3": "",
        "id": "",
        "lit": "",
        "nocat": "",
        "pos": "",
        "sc": "",
        "sort": "",
        "tr": "ʕajam",
        "ts": ""
      },
      "expansion": "Arabic عجم (ʕajam)",
      "name": "bor"
    },
    {
      "args": {
        "1": "bn",
        "2": "ar",
        "3": "عجم",
        "tr": "ʕajam"
      },
      "expansion": "Borrowed from Arabic عجم (ʕajam)",
      "name": "bor+"
    }
  ],
  "etymology_text": "Borrowed from Arabic عجم (ʕajam).",
  "forms": [
    {
      "form": "ajôm",
      "tags": [
        "romanization"
      ]
    }
  ],
  "head_templates": [
    {
      "args": {
        "1": "bn",
        "10": "",
        "11": "",
        "12": "",
        "13": "",
        "14": "",
        "15": "",
        "16": "",
        "17": "",
        "18": "",
        "2": "noun",
        "3": "",
        "4": "",
        "5": "",
        "6": "",
        "7": "",
        "8": "",
        "9": "",
        "autotrinfl": "1",
        "head": "",
        "sort": "",
        "tr": "ajôm",
        "tr2": ""
      },
      "expansion": "আজম • (ajôm)",
      "name": "head"
    },
    {
      "args": {
        "tr": "ajôm"
      },
      "expansion": "আজম • (ajôm)",
      "name": "bn-noun"
    }
  ],
  "lang": "Bengali",
  "lang_code": "bn",
  "pos": "noun",
  "senses": [
    {
      "derived": [
        {
          "roman": "ajmī",
          "word": "আজমী"
        }
      ],
      "glosses": [
        "non-Arab, non-Arabic, not relating to Arabia"
      ],
      "id": "en-আজম-bn-noun-XV0awWMF"
    }
  ],
  "sounds": [
    {
      "ipa": "/azom/",
      "note": "Rarh"
    },
    {
      "ipa": "[ˈazom]",
      "note": "Rarh"
    },
    {
      "ipa": "/adʒom/",
      "note": "Rarh"
    },
    {
      "ipa": "[ˈadʒom]",
      "note": "Rarh"
    },
    {
      "ipa": "/azom/",
      "note": "Dhaka"
    },
    {
      "ipa": "[ˈazom]",
      "note": "Dhaka"
    },
    {
      "ipa": "/adʑom/",
      "note": "Dhaka"
    },
    {
      "ipa": "[ˈadʑom]",
      "note": "Dhaka"
    }
  ],
  "word": "আজম"
}
{
  "categories": [
    "Bengali entries with incorrect language header",
    "Bengali lemmas",
    "Bengali nouns",
    "Bengali proper nouns",
    "Bengali terms borrowed from Arabic",
    "Bengali terms derived from Arabic",
    "Bengali terms derived from the Arabic root ع ظ م",
    "Bengali terms with IPA pronunciation",
    "Bengali terms with non-redundant manual transliterations"
  ],
  "etymology_number": 1,
  "etymology_templates": [
    {
      "args": {
        "1": "loanword",
        "2": "Borrowed"
      },
      "expansion": "Borrowed",
      "name": "glossary"
    },
    {
      "args": {
        "1": "bn",
        "2": "ar",
        "3": "أعظم",
        "4": "",
        "5": "",
        "g": "",
        "g2": "",
        "g3": "",
        "id": "",
        "lit": "",
        "nocat": "",
        "pos": "",
        "sc": "",
        "sort": "",
        "tr": "ʔaʕẓam",
        "ts": ""
      },
      "expansion": "Arabic أعظم (ʔaʕẓam)",
      "name": "bor"
    },
    {
      "args": {
        "1": "bn",
        "2": "ar",
        "3": "أعظم",
        "tr": "ʔaʕẓam"
      },
      "expansion": "Borrowed from Arabic أعظم (ʔaʕẓam)",
      "name": "bor+"
    },
    {
      "args": {
        "1": "bn",
        "2": "ar",
        "3": "ع ظ م"
      },
      "expansion": "",
      "name": "root"
    }
  ],
  "etymology_text": "Borrowed from Arabic أعظم (ʔaʕẓam).",
  "forms": [
    {
      "form": "azom",
      "tags": [
        "romanization"
      ]
    }
  ],
  "head_templates": [
    {
      "args": {
        "1": "bn",
        "10": "",
        "11": "",
        "12": "",
        "13": "",
        "14": "",
        "15": "",
        "16": "",
        "17": "",
        "18": "",
        "2": "noun",
        "3": "",
        "4": "",
        "5": "",
        "6": "",
        "7": "",
        "8": "",
        "9": "",
        "autotrinfl": "1",
        "head": "",
        "sort": "",
        "tr": "azom",
        "tr2": ""
      },
      "expansion": "আজম • (azom)",
      "name": "head"
    },
    {
      "args": {
        "tr": "azom"
      },
      "expansion": "আজম • (azom)",
      "name": "bn-noun"
    }
  ],
  "lang": "Bengali",
  "lang_code": "bn",
  "pos": "noun",
  "senses": [
    {
      "glosses": [
        "greatest"
      ],
      "links": [
        [
          "greatest",
          "greatest"
        ]
      ]
    },
    {
      "categories": [
        "Bengali terms with non-redundant manual transliterations"
      ],
      "glosses": [
        "alternative spellings আযম (azôm)"
      ],
      "links": [
        [
          "আযম",
          "আযম#Bengali"
        ]
      ]
    }
  ],
  "sounds": [
    {
      "ipa": "/azom/",
      "note": "Rarh"
    },
    {
      "ipa": "[ˈazom]",
      "note": "Rarh"
    },
    {
      "ipa": "/adʒom/",
      "note": "Rarh"
    },
    {
      "ipa": "[ˈadʒom]",
      "note": "Rarh"
    },
    {
      "ipa": "/azom/",
      "note": "Dhaka"
    },
    {
      "ipa": "[ˈazom]",
      "note": "Dhaka"
    },
    {
      "ipa": "/adʑom/",
      "note": "Dhaka"
    },
    {
      "ipa": "[ˈadʑom]",
      "note": "Dhaka"
    }
  ],
  "word": "আজম"
}

{
  "categories": [
    "Bengali entries with incorrect language header",
    "Bengali lemmas",
    "Bengali nouns",
    "Bengali proper nouns",
    "Bengali terms borrowed from Arabic",
    "Bengali terms derived from Arabic",
    "Bengali terms derived from the Arabic root ع ظ م",
    "Bengali terms with IPA pronunciation",
    "Bengali terms with non-redundant manual transliterations"
  ],
  "derived": [
    {
      "roman": "uzire azôm",
      "word": "উজিরে আজম"
    },
    {
      "roman": "isme azôm",
      "word": "ইসমে আজম"
    }
  ],
  "etymology_number": 1,
  "etymology_templates": [
    {
      "args": {
        "1": "loanword",
        "2": "Borrowed"
      },
      "expansion": "Borrowed",
      "name": "glossary"
    },
    {
      "args": {
        "1": "bn",
        "2": "ar",
        "3": "أعظم",
        "4": "",
        "5": "",
        "g": "",
        "g2": "",
        "g3": "",
        "id": "",
        "lit": "",
        "nocat": "",
        "pos": "",
        "sc": "",
        "sort": "",
        "tr": "ʔaʕẓam",
        "ts": ""
      },
      "expansion": "Arabic أعظم (ʔaʕẓam)",
      "name": "bor"
    },
    {
      "args": {
        "1": "bn",
        "2": "ar",
        "3": "أعظم",
        "tr": "ʔaʕẓam"
      },
      "expansion": "Borrowed from Arabic أعظم (ʔaʕẓam)",
      "name": "bor+"
    },
    {
      "args": {
        "1": "bn",
        "2": "ar",
        "3": "ع ظ م"
      },
      "expansion": "",
      "name": "root"
    }
  ],
  "etymology_text": "Borrowed from Arabic أعظم (ʔaʕẓam).",
  "forms": [
    {
      "form": "azom",
      "tags": [
        "romanization"
      ]
    }
  ],
  "head_templates": [
    {
      "args": {
        "1": "bn",
        "2": "proper noun",
        "tr": "azom"
      },
      "expansion": "আজম • (azom)",
      "name": "head"
    }
  ],
  "lang": "Bengali",
  "lang_code": "bn",
  "pos": "name",
  "senses": [
    {
      "categories": [
        "Bengali given names",
        "Bengali male given names",
        "Bengali male given names from Arabic"
      ],
      "glosses": [
        "a male given name from Arabic"
      ],
      "links": [
        [
          "given name",
          "given name"
        ]
      ]
    }
  ],
  "sounds": [
    {
      "ipa": "/azom/",
      "note": "Rarh"
    },
    {
      "ipa": "[ˈazom]",
      "note": "Rarh"
    },
    {
      "ipa": "/adʒom/",
      "note": "Rarh"
    },
    {
      "ipa": "[ˈadʒom]",
      "note": "Rarh"
    },
    {
      "ipa": "/azom/",
      "note": "Dhaka"
    },
    {
      "ipa": "[ˈazom]",
      "note": "Dhaka"
    },
    {
      "ipa": "/adʑom/",
      "note": "Dhaka"
    },
    {
      "ipa": "[ˈadʑom]",
      "note": "Dhaka"
    }
  ],
  "word": "আজম"
}

{
  "categories": [
    "Bengali entries with incorrect language header",
    "Bengali lemmas",
    "Bengali nouns",
    "Bengali terms borrowed from Arabic",
    "Bengali terms derived from Arabic",
    "Bengali terms with IPA pronunciation",
    "Bengali terms with non-redundant manual transliterations"
  ],
  "derived": [
    {
      "roman": "ajmī",
      "word": "আজমী"
    }
  ],
  "etymology_number": 2,
  "etymology_templates": [
    {
      "args": {
        "1": "loanword",
        "2": "Borrowed"
      },
      "expansion": "Borrowed",
      "name": "glossary"
    },
    {
      "args": {
        "1": "bn",
        "2": "ar",
        "3": "عجم",
        "4": "",
        "5": "",
        "g": "",
        "g2": "",
        "g3": "",
        "id": "",
        "lit": "",
        "nocat": "",
        "pos": "",
        "sc": "",
        "sort": "",
        "tr": "ʕajam",
        "ts": ""
      },
      "expansion": "Arabic عجم (ʕajam)",
      "name": "bor"
    },
    {
      "args": {
        "1": "bn",
        "2": "ar",
        "3": "عجم",
        "tr": "ʕajam"
      },
      "expansion": "Borrowed from Arabic عجم (ʕajam)",
      "name": "bor+"
    }
  ],
  "etymology_text": "Borrowed from Arabic عجم (ʕajam).",
  "forms": [
    {
      "form": "ajôm",
      "tags": [
        "romanization"
      ]
    }
  ],
  "head_templates": [
    {
      "args": {
        "1": "bn",
        "10": "",
        "11": "",
        "12": "",
        "13": "",
        "14": "",
        "15": "",
        "16": "",
        "17": "",
        "18": "",
        "2": "noun",
        "3": "",
        "4": "",
        "5": "",
        "6": "",
        "7": "",
        "8": "",
        "9": "",
        "autotrinfl": "1",
        "head": "",
        "sort": "",
        "tr": "ajôm",
        "tr2": ""
      },
      "expansion": "আজম • (ajôm)",
      "name": "head"
    },
    {
      "args": {
        "tr": "ajôm"
      },
      "expansion": "আজম • (ajôm)",
      "name": "bn-noun"
    }
  ],
  "lang": "Bengali",
  "lang_code": "bn",
  "pos": "noun",
  "senses": [
    {
      "glosses": [
        "non-Arab, non-Arabic, not relating to Arabia"
      ]
    }
  ],
  "sounds": [
    {
      "ipa": "/azom/",
      "note": "Rarh"
    },
    {
      "ipa": "[ˈazom]",
      "note": "Rarh"
    },
    {
      "ipa": "/adʒom/",
      "note": "Rarh"
    },
    {
      "ipa": "[ˈadʒom]",
      "note": "Rarh"
    },
    {
      "ipa": "/azom/",
      "note": "Dhaka"
    },
    {
      "ipa": "[ˈazom]",
      "note": "Dhaka"
    },
    {
      "ipa": "/adʑom/",
      "note": "Dhaka"
    },
    {
      "ipa": "[ˈadʑom]",
      "note": "Dhaka"
    }
  ],
  "word": "আজম"
}

This page is a part of the kaikki.org machine-readable All languages combined dictionary. This dictionary is based on structured data extracted on 2024-05-25 from the enwiktionary dump dated 2024-05-02 using wiktextract (bb24e0f and c7ea76d). The data shown on this site has been post-processed and various details (e.g., extra categories) removed, some information disambiguated, and additional data merged from other sources. See the raw data download page for the unprocessed wiktextract data.

If you use this data in academic research, please cite Tatu Ylonen: Wiktextract: Wiktionary as Machine-Readable Structured Data, Proceedings of the 13th Conference on Language Resources and Evaluation (LREC), pp. 1317-1325, Marseille, 20-25 June 2022. Linking to the relevant page(s) under https://kaikki.org would also be greatly appreciated.