"сарбаз" meaning in All languages combined

See сарбаз on Wiktionary

Noun [Kazakh]

IPA: /sɑrˈbɑz/
Etymology: From Persian سرباز (sarbâz). Etymology templates: {{bor|kk|fa|سرباز|tr=sarbâz}} Persian سرباز (sarbâz) Head templates: {{kk-noun|pl=сарбаздар}} сарбаз • (sarbaz) (nominative plural сарбаздар) Inflection templates: {{kk-decl-noun|сарбаз|сарбаздар|сарбаздың|сарбаздардың|сарбазға|сарбаздарға|сарбазды|сарбаздарды|сарбазда|сарбаздарда|сарбаздан|сарбаздардан|сарбазбен|сарбаздармен|pl=}} Forms: sarbaz [romanization], сарбаздар [nominative, plural], no-table-tags [table-tags], сарбаз [nominative, singular], сарбаздар [nominative, plural], сарбаздың [genitive, singular], сарбаздардың [genitive, plural], сарбазға [dative, singular], сарбаздарға [dative, plural], сарбазды [accusative, singular], сарбаздарды [accusative, plural], сарбазда [locative, singular], сарбаздарда [locative, plural], сарбаздан [ablative, singular], сарбаздардан [ablative, plural], сарбазбен [instrumental, singular], сарбаздармен [instrumental, plural]
  1. soldier, warrior Synonyms (soldier): жауынгер (jauyñer)
    Sense id: en-сарбаз-kk-noun-TEpeV~Fa Disambiguation of 'soldier': 100 0
  2. (chess) pawn Categories (topical): Chess, Chess Synonyms (pawn): пешка (peşka)
    Sense id: en-сарбаз-kk-noun-BQKdS8LE Disambiguation of Chess: 5 95 Categories (other): Kazakh entries with incorrect language header, Kazakh terms with redundant script codes, Pages with 2 entries, Pages with entries Disambiguation of Kazakh entries with incorrect language header: 29 71 Disambiguation of Kazakh terms with redundant script codes: 21 79 Disambiguation of Pages with 2 entries: 9 91 Disambiguation of Pages with entries: 7 93 Topics: board-games, chess, games Disambiguation of 'pawn': 0 100
The following are not (yet) sense-disambiguated
Related terms: патша (patşa), уәзір (uäzır), патшайым (patşaiym), тура (tura), піл (pıl), ат (english: at), пешка (peşka)

Noun [Russian]

IPA: [sɐrˈbas]
Head templates: {{ru-noun+|сарба́з}} сарба́з • (sarbáz) m inan (genitive сарба́за, nominative plural сарба́зы, genitive plural сарба́зов) Forms: сарба́з [canonical], sarbáz [romanization], сарба́за [genitive], сарба́зы [nominative, plural], сарба́зов [genitive, plural], no-table-tags [table-tags], сарба́з [nominative, singular], сарба́зы [nominative, plural], сарба́за [genitive, singular], сарба́зов [genitive, plural], сарба́зу [dative, singular], сарба́зам [dative, plural], сарба́з [accusative, singular], сарба́зы [accusative, plural], сарба́зом [instrumental, singular], сарба́зами [instrumental, plural], сарба́зе [prepositional, singular], сарба́зах [plural, prepositional]
  1. Tags: no-gloss
{
  "etymology_templates": [
    {
      "args": {
        "1": "kk",
        "2": "fa",
        "3": "سرباز",
        "tr": "sarbâz"
      },
      "expansion": "Persian سرباز (sarbâz)",
      "name": "bor"
    }
  ],
  "etymology_text": "From Persian سرباز (sarbâz).",
  "forms": [
    {
      "form": "sarbaz",
      "tags": [
        "romanization"
      ]
    },
    {
      "form": "сарбаздар",
      "tags": [
        "nominative",
        "plural"
      ]
    },
    {
      "form": "no-table-tags",
      "source": "declension",
      "tags": [
        "table-tags"
      ]
    },
    {
      "form": "kk-noun-c",
      "source": "declension",
      "tags": [
        "inflection-template"
      ]
    },
    {
      "form": "сарбаз",
      "roman": "sarbaz",
      "source": "declension",
      "tags": [
        "nominative",
        "singular"
      ]
    },
    {
      "form": "сарбаздар",
      "roman": "sarbazdar",
      "source": "declension",
      "tags": [
        "nominative",
        "plural"
      ]
    },
    {
      "form": "сарбаздың",
      "roman": "sarbazdyñ",
      "source": "declension",
      "tags": [
        "genitive",
        "singular"
      ]
    },
    {
      "form": "сарбаздардың",
      "roman": "sarbazdardyñ",
      "source": "declension",
      "tags": [
        "genitive",
        "plural"
      ]
    },
    {
      "form": "сарбазға",
      "roman": "sarbazğa",
      "source": "declension",
      "tags": [
        "dative",
        "singular"
      ]
    },
    {
      "form": "сарбаздарға",
      "roman": "sarbazdarğa",
      "source": "declension",
      "tags": [
        "dative",
        "plural"
      ]
    },
    {
      "form": "сарбазды",
      "roman": "sarbazdy",
      "source": "declension",
      "tags": [
        "accusative",
        "singular"
      ]
    },
    {
      "form": "сарбаздарды",
      "roman": "sarbazdardy",
      "source": "declension",
      "tags": [
        "accusative",
        "plural"
      ]
    },
    {
      "form": "сарбазда",
      "roman": "sarbazda",
      "source": "declension",
      "tags": [
        "locative",
        "singular"
      ]
    },
    {
      "form": "сарбаздарда",
      "roman": "sarbazdarda",
      "source": "declension",
      "tags": [
        "locative",
        "plural"
      ]
    },
    {
      "form": "сарбаздан",
      "roman": "sarbazdan",
      "source": "declension",
      "tags": [
        "ablative",
        "singular"
      ]
    },
    {
      "form": "сарбаздардан",
      "roman": "sarbazdardan",
      "source": "declension",
      "tags": [
        "ablative",
        "plural"
      ]
    },
    {
      "form": "сарбазбен",
      "roman": "sarbazben",
      "source": "declension",
      "tags": [
        "instrumental",
        "singular"
      ]
    },
    {
      "form": "сарбаздармен",
      "roman": "sarbazdarmen",
      "source": "declension",
      "tags": [
        "instrumental",
        "plural"
      ]
    }
  ],
  "head_templates": [
    {
      "args": {
        "pl": "сарбаздар"
      },
      "expansion": "сарбаз • (sarbaz) (nominative plural сарбаздар)",
      "name": "kk-noun"
    }
  ],
  "inflection_templates": [
    {
      "args": {
        "1": "сарбаз",
        "10": "сарбаздарда",
        "11": "сарбаздан",
        "12": "сарбаздардан",
        "13": "сарбазбен",
        "14": "сарбаздармен",
        "2": "сарбаздар",
        "3": "сарбаздың",
        "4": "сарбаздардың",
        "5": "сарбазға",
        "6": "сарбаздарға",
        "7": "сарбазды",
        "8": "сарбаздарды",
        "9": "сарбазда",
        "pl": ""
      },
      "name": "kk-decl-noun"
    }
  ],
  "lang": "Kazakh",
  "lang_code": "kk",
  "pos": "noun",
  "related": [
    {
      "_dis1": "0 0",
      "roman": "patşa",
      "word": "патша"
    },
    {
      "_dis1": "0 0",
      "roman": "uäzır",
      "word": "уәзір"
    },
    {
      "_dis1": "0 0",
      "roman": "patşaiym",
      "word": "патшайым"
    },
    {
      "_dis1": "0 0",
      "roman": "tura",
      "word": "тура"
    },
    {
      "_dis1": "0 0",
      "roman": "pıl",
      "word": "піл"
    },
    {
      "_dis1": "0 0",
      "english": "at",
      "word": "ат"
    },
    {
      "_dis1": "0 0",
      "roman": "peşka",
      "word": "пешка"
    }
  ],
  "senses": [
    {
      "glosses": [
        "soldier, warrior"
      ],
      "id": "en-сарбаз-kk-noun-TEpeV~Fa",
      "links": [
        [
          "soldier",
          "soldier"
        ],
        [
          "warrior",
          "warrior"
        ]
      ],
      "synonyms": [
        {
          "_dis1": "100 0",
          "roman": "jauyñer",
          "sense": "soldier",
          "word": "жауынгер"
        }
      ]
    },
    {
      "categories": [
        {
          "kind": "topical",
          "langcode": "kk",
          "name": "Chess",
          "orig": "kk:Chess",
          "parents": [
            "Board games",
            "Tabletop games",
            "Games",
            "Recreation",
            "Human activity",
            "Human behaviour",
            "Human",
            "All topics",
            "Fundamental"
          ],
          "source": "w"
        },
        {
          "_dis": "29 71",
          "kind": "other",
          "name": "Kazakh entries with incorrect language header",
          "parents": [
            "Entries with incorrect language header",
            "Entry maintenance"
          ],
          "source": "w+disamb"
        },
        {
          "_dis": "21 79",
          "kind": "other",
          "name": "Kazakh terms with redundant script codes",
          "parents": [
            "Terms with redundant script codes",
            "Entry maintenance"
          ],
          "source": "w+disamb"
        },
        {
          "_dis": "9 91",
          "kind": "other",
          "name": "Pages with 2 entries",
          "parents": [],
          "source": "w+disamb"
        },
        {
          "_dis": "7 93",
          "kind": "other",
          "name": "Pages with entries",
          "parents": [],
          "source": "w+disamb"
        },
        {
          "_dis": "5 95",
          "kind": "topical",
          "langcode": "kk",
          "name": "Chess",
          "orig": "kk:Chess",
          "parents": [
            "Board games",
            "Tabletop games",
            "Games",
            "Recreation",
            "Human activity",
            "Human behaviour",
            "Human",
            "All topics",
            "Fundamental"
          ],
          "source": "w+disamb"
        }
      ],
      "glosses": [
        "pawn"
      ],
      "id": "en-сарбаз-kk-noun-BQKdS8LE",
      "links": [
        [
          "chess",
          "chess"
        ],
        [
          "pawn",
          "pawn"
        ]
      ],
      "raw_glosses": [
        "(chess) pawn"
      ],
      "synonyms": [
        {
          "_dis1": "0 100",
          "roman": "peşka",
          "sense": "pawn",
          "word": "пешка"
        }
      ],
      "topics": [
        "board-games",
        "chess",
        "games"
      ]
    }
  ],
  "sounds": [
    {
      "ipa": "/sɑrˈbɑz/"
    }
  ],
  "word": "сарбаз"
}

{
  "forms": [
    {
      "form": "сарба́з",
      "tags": [
        "canonical"
      ]
    },
    {
      "form": "sarbáz",
      "tags": [
        "romanization"
      ]
    },
    {
      "form": "сарба́за",
      "tags": [
        "genitive"
      ]
    },
    {
      "form": "сарба́зы",
      "tags": [
        "nominative",
        "plural"
      ]
    },
    {
      "form": "сарба́зов",
      "tags": [
        "genitive",
        "plural"
      ]
    },
    {
      "form": "no-table-tags",
      "source": "declension",
      "tags": [
        "table-tags"
      ]
    },
    {
      "form": "ru-noun-table",
      "source": "declension",
      "tags": [
        "inflection-template"
      ]
    },
    {
      "form": "hard-stem",
      "source": "declension",
      "tags": [
        "class"
      ]
    },
    {
      "form": "accent-a",
      "source": "declension",
      "tags": [
        "class"
      ]
    },
    {
      "form": "сарба́з",
      "roman": "sarbáz",
      "source": "declension",
      "tags": [
        "nominative",
        "singular"
      ]
    },
    {
      "form": "сарба́зы",
      "roman": "sarbázy",
      "source": "declension",
      "tags": [
        "nominative",
        "plural"
      ]
    },
    {
      "form": "сарба́за",
      "roman": "sarbáza",
      "source": "declension",
      "tags": [
        "genitive",
        "singular"
      ]
    },
    {
      "form": "сарба́зов",
      "roman": "sarbázov",
      "source": "declension",
      "tags": [
        "genitive",
        "plural"
      ]
    },
    {
      "form": "сарба́зу",
      "roman": "sarbázu",
      "source": "declension",
      "tags": [
        "dative",
        "singular"
      ]
    },
    {
      "form": "сарба́зам",
      "roman": "sarbázam",
      "source": "declension",
      "tags": [
        "dative",
        "plural"
      ]
    },
    {
      "form": "сарба́з",
      "roman": "sarbáz",
      "source": "declension",
      "tags": [
        "accusative",
        "singular"
      ]
    },
    {
      "form": "сарба́зы",
      "roman": "sarbázy",
      "source": "declension",
      "tags": [
        "accusative",
        "plural"
      ]
    },
    {
      "form": "сарба́зом",
      "roman": "sarbázom",
      "source": "declension",
      "tags": [
        "instrumental",
        "singular"
      ]
    },
    {
      "form": "сарба́зами",
      "roman": "sarbázami",
      "source": "declension",
      "tags": [
        "instrumental",
        "plural"
      ]
    },
    {
      "form": "сарба́зе",
      "roman": "sarbáze",
      "source": "declension",
      "tags": [
        "prepositional",
        "singular"
      ]
    },
    {
      "form": "сарба́зах",
      "roman": "sarbázax",
      "source": "declension",
      "tags": [
        "plural",
        "prepositional"
      ]
    }
  ],
  "head_templates": [
    {
      "args": {
        "1": "сарба́з"
      },
      "expansion": "сарба́з • (sarbáz) m inan (genitive сарба́за, nominative plural сарба́зы, genitive plural сарба́зов)",
      "name": "ru-noun+"
    }
  ],
  "lang": "Russian",
  "lang_code": "ru",
  "pos": "noun",
  "senses": [
    {
      "categories": [
        {
          "kind": "other",
          "name": "Pages with 2 entries",
          "parents": [],
          "source": "w"
        },
        {
          "kind": "other",
          "name": "Pages with entries",
          "parents": [],
          "source": "w"
        },
        {
          "kind": "other",
          "name": "Russian entries with incorrect language header",
          "parents": [
            "Entries with incorrect language header",
            "Entry maintenance"
          ],
          "source": "w"
        },
        {
          "kind": "other",
          "name": "Russian links with redundant wikilinks",
          "parents": [
            "Links with redundant wikilinks",
            "Entry maintenance"
          ],
          "source": "w"
        },
        {
          "kind": "other",
          "name": "Russian nouns with accent pattern a",
          "parents": [],
          "source": "w"
        }
      ],
      "id": "en-сарбаз-ru-noun-47DEQpj8",
      "tags": [
        "no-gloss"
      ]
    }
  ],
  "sounds": [
    {
      "ipa": "[sɐrˈbas]"
    }
  ],
  "word": "сарбаз"
}
{
  "categories": [
    "Kazakh entries with incorrect language header",
    "Kazakh lemmas",
    "Kazakh nouns",
    "Kazakh terms borrowed from Persian",
    "Kazakh terms derived from Persian",
    "Kazakh terms with redundant script codes",
    "Pages with 2 entries",
    "Pages with entries",
    "kk:Chess"
  ],
  "etymology_templates": [
    {
      "args": {
        "1": "kk",
        "2": "fa",
        "3": "سرباز",
        "tr": "sarbâz"
      },
      "expansion": "Persian سرباز (sarbâz)",
      "name": "bor"
    }
  ],
  "etymology_text": "From Persian سرباز (sarbâz).",
  "forms": [
    {
      "form": "sarbaz",
      "tags": [
        "romanization"
      ]
    },
    {
      "form": "сарбаздар",
      "tags": [
        "nominative",
        "plural"
      ]
    },
    {
      "form": "no-table-tags",
      "source": "declension",
      "tags": [
        "table-tags"
      ]
    },
    {
      "form": "kk-noun-c",
      "source": "declension",
      "tags": [
        "inflection-template"
      ]
    },
    {
      "form": "сарбаз",
      "roman": "sarbaz",
      "source": "declension",
      "tags": [
        "nominative",
        "singular"
      ]
    },
    {
      "form": "сарбаздар",
      "roman": "sarbazdar",
      "source": "declension",
      "tags": [
        "nominative",
        "plural"
      ]
    },
    {
      "form": "сарбаздың",
      "roman": "sarbazdyñ",
      "source": "declension",
      "tags": [
        "genitive",
        "singular"
      ]
    },
    {
      "form": "сарбаздардың",
      "roman": "sarbazdardyñ",
      "source": "declension",
      "tags": [
        "genitive",
        "plural"
      ]
    },
    {
      "form": "сарбазға",
      "roman": "sarbazğa",
      "source": "declension",
      "tags": [
        "dative",
        "singular"
      ]
    },
    {
      "form": "сарбаздарға",
      "roman": "sarbazdarğa",
      "source": "declension",
      "tags": [
        "dative",
        "plural"
      ]
    },
    {
      "form": "сарбазды",
      "roman": "sarbazdy",
      "source": "declension",
      "tags": [
        "accusative",
        "singular"
      ]
    },
    {
      "form": "сарбаздарды",
      "roman": "sarbazdardy",
      "source": "declension",
      "tags": [
        "accusative",
        "plural"
      ]
    },
    {
      "form": "сарбазда",
      "roman": "sarbazda",
      "source": "declension",
      "tags": [
        "locative",
        "singular"
      ]
    },
    {
      "form": "сарбаздарда",
      "roman": "sarbazdarda",
      "source": "declension",
      "tags": [
        "locative",
        "plural"
      ]
    },
    {
      "form": "сарбаздан",
      "roman": "sarbazdan",
      "source": "declension",
      "tags": [
        "ablative",
        "singular"
      ]
    },
    {
      "form": "сарбаздардан",
      "roman": "sarbazdardan",
      "source": "declension",
      "tags": [
        "ablative",
        "plural"
      ]
    },
    {
      "form": "сарбазбен",
      "roman": "sarbazben",
      "source": "declension",
      "tags": [
        "instrumental",
        "singular"
      ]
    },
    {
      "form": "сарбаздармен",
      "roman": "sarbazdarmen",
      "source": "declension",
      "tags": [
        "instrumental",
        "plural"
      ]
    }
  ],
  "head_templates": [
    {
      "args": {
        "pl": "сарбаздар"
      },
      "expansion": "сарбаз • (sarbaz) (nominative plural сарбаздар)",
      "name": "kk-noun"
    }
  ],
  "inflection_templates": [
    {
      "args": {
        "1": "сарбаз",
        "10": "сарбаздарда",
        "11": "сарбаздан",
        "12": "сарбаздардан",
        "13": "сарбазбен",
        "14": "сарбаздармен",
        "2": "сарбаздар",
        "3": "сарбаздың",
        "4": "сарбаздардың",
        "5": "сарбазға",
        "6": "сарбаздарға",
        "7": "сарбазды",
        "8": "сарбаздарды",
        "9": "сарбазда",
        "pl": ""
      },
      "name": "kk-decl-noun"
    }
  ],
  "lang": "Kazakh",
  "lang_code": "kk",
  "pos": "noun",
  "related": [
    {
      "roman": "patşa",
      "word": "патша"
    },
    {
      "roman": "uäzır",
      "word": "уәзір"
    },
    {
      "roman": "patşaiym",
      "word": "патшайым"
    },
    {
      "roman": "tura",
      "word": "тура"
    },
    {
      "roman": "pıl",
      "word": "піл"
    },
    {
      "english": "at",
      "word": "ат"
    },
    {
      "roman": "peşka",
      "word": "пешка"
    }
  ],
  "senses": [
    {
      "glosses": [
        "soldier, warrior"
      ],
      "links": [
        [
          "soldier",
          "soldier"
        ],
        [
          "warrior",
          "warrior"
        ]
      ]
    },
    {
      "categories": [
        "kk:Chess"
      ],
      "glosses": [
        "pawn"
      ],
      "links": [
        [
          "chess",
          "chess"
        ],
        [
          "pawn",
          "pawn"
        ]
      ],
      "raw_glosses": [
        "(chess) pawn"
      ],
      "topics": [
        "board-games",
        "chess",
        "games"
      ]
    }
  ],
  "sounds": [
    {
      "ipa": "/sɑrˈbɑz/"
    }
  ],
  "synonyms": [
    {
      "roman": "jauyñer",
      "sense": "soldier",
      "word": "жауынгер"
    },
    {
      "roman": "peşka",
      "sense": "pawn",
      "word": "пешка"
    }
  ],
  "word": "сарбаз"
}

{
  "forms": [
    {
      "form": "сарба́з",
      "tags": [
        "canonical"
      ]
    },
    {
      "form": "sarbáz",
      "tags": [
        "romanization"
      ]
    },
    {
      "form": "сарба́за",
      "tags": [
        "genitive"
      ]
    },
    {
      "form": "сарба́зы",
      "tags": [
        "nominative",
        "plural"
      ]
    },
    {
      "form": "сарба́зов",
      "tags": [
        "genitive",
        "plural"
      ]
    },
    {
      "form": "no-table-tags",
      "source": "declension",
      "tags": [
        "table-tags"
      ]
    },
    {
      "form": "ru-noun-table",
      "source": "declension",
      "tags": [
        "inflection-template"
      ]
    },
    {
      "form": "hard-stem",
      "source": "declension",
      "tags": [
        "class"
      ]
    },
    {
      "form": "accent-a",
      "source": "declension",
      "tags": [
        "class"
      ]
    },
    {
      "form": "сарба́з",
      "roman": "sarbáz",
      "source": "declension",
      "tags": [
        "nominative",
        "singular"
      ]
    },
    {
      "form": "сарба́зы",
      "roman": "sarbázy",
      "source": "declension",
      "tags": [
        "nominative",
        "plural"
      ]
    },
    {
      "form": "сарба́за",
      "roman": "sarbáza",
      "source": "declension",
      "tags": [
        "genitive",
        "singular"
      ]
    },
    {
      "form": "сарба́зов",
      "roman": "sarbázov",
      "source": "declension",
      "tags": [
        "genitive",
        "plural"
      ]
    },
    {
      "form": "сарба́зу",
      "roman": "sarbázu",
      "source": "declension",
      "tags": [
        "dative",
        "singular"
      ]
    },
    {
      "form": "сарба́зам",
      "roman": "sarbázam",
      "source": "declension",
      "tags": [
        "dative",
        "plural"
      ]
    },
    {
      "form": "сарба́з",
      "roman": "sarbáz",
      "source": "declension",
      "tags": [
        "accusative",
        "singular"
      ]
    },
    {
      "form": "сарба́зы",
      "roman": "sarbázy",
      "source": "declension",
      "tags": [
        "accusative",
        "plural"
      ]
    },
    {
      "form": "сарба́зом",
      "roman": "sarbázom",
      "source": "declension",
      "tags": [
        "instrumental",
        "singular"
      ]
    },
    {
      "form": "сарба́зами",
      "roman": "sarbázami",
      "source": "declension",
      "tags": [
        "instrumental",
        "plural"
      ]
    },
    {
      "form": "сарба́зе",
      "roman": "sarbáze",
      "source": "declension",
      "tags": [
        "prepositional",
        "singular"
      ]
    },
    {
      "form": "сарба́зах",
      "roman": "sarbázax",
      "source": "declension",
      "tags": [
        "plural",
        "prepositional"
      ]
    }
  ],
  "head_templates": [
    {
      "args": {
        "1": "сарба́з"
      },
      "expansion": "сарба́з • (sarbáz) m inan (genitive сарба́за, nominative plural сарба́зы, genitive plural сарба́зов)",
      "name": "ru-noun+"
    }
  ],
  "lang": "Russian",
  "lang_code": "ru",
  "pos": "noun",
  "senses": [
    {
      "categories": [
        "Pages with 2 entries",
        "Pages with entries",
        "Russian 2-syllable words",
        "Russian entries with incorrect language header",
        "Russian hard-stem masculine-form accent-a nouns",
        "Russian hard-stem masculine-form nouns",
        "Russian inanimate nouns",
        "Russian lemmas",
        "Russian links with redundant wikilinks",
        "Russian masculine nouns",
        "Russian nouns",
        "Russian nouns with accent pattern a",
        "Russian terms with IPA pronunciation"
      ],
      "tags": [
        "no-gloss"
      ]
    }
  ],
  "sounds": [
    {
      "ipa": "[sɐrˈbas]"
    }
  ],
  "word": "сарбаз"
}

Download raw JSONL data for сарбаз meaning in All languages combined (6.4kB)


This page is a part of the kaikki.org machine-readable All languages combined dictionary. This dictionary is based on structured data extracted on 2025-01-25 from the enwiktionary dump dated 2025-01-20 using wiktextract (c15a5ce and 5c11237). The data shown on this site has been post-processed and various details (e.g., extra categories) removed, some information disambiguated, and additional data merged from other sources. See the raw data download page for the unprocessed wiktextract data.

If you use this data in academic research, please cite Tatu Ylonen: Wiktextract: Wiktionary as Machine-Readable Structured Data, Proceedings of the 13th Conference on Language Resources and Evaluation (LREC), pp. 1317-1325, Marseille, 20-25 June 2022. Linking to the relevant page(s) under https://kaikki.org would also be greatly appreciated.