"羜" meaning in All languages combined

See 羜 on Wiktionary

Character [Chinese]

IPA: /ʈ͡ʂu⁵¹/ [Mandarin, Sinological-IPA], /t͡sʰyː¹³/ [Cantonese, Sinological-IPA], /ʈ͡ʂu⁵¹/, /t͡sʰyː¹³/ Chinese transliterations: zhù [Mandarin, Pinyin], zhu⁴ [Mandarin, Pinyin], ㄓㄨˋ [Mandarin, bopomofo], cyu⁵ [Cantonese, Jyutping], zhù [Hanyu-Pinyin, Mandarin], jhù [Mandarin, Tongyong-Pinyin], chu⁴ [Mandarin, Wade-Giles], jù [Mandarin, Yale], juh [Gwoyeu-Romatsyh, Mandarin], чжу [Mandarin, Palladius], čžu [Mandarin, Palladius], chyúh [Cantonese, Yale], tsy⁵ [Cantonese, Pinyin], qu⁵ [Cantonese, Guangdong-Romanization], drjoX [Middle-Chinese], /*daʔ/ [Old-Chinese, Zhengzhang]
Etymology: Phono-semantic compound (形聲/形声, OC *daʔ) : semantic 羊 (“sheep”) + phonetic 宁 (OC *da, *daʔ) From Proto-Sino-Tibetan *ra (“goat”) (STEDT). Cognate with Tibetan ར (ra, “goat”), Pattani lá (“goat”), etc. Etymology templates: {{categorize|zh|Han phono-semantic compounds}}, {{liushu|psc|adj=|nocap=|pron=OC *daʔ}} Phono-semantic compound (形聲/形声, OC *daʔ), {{Han compound|羊|宁|c1=s|c2=p|ls=psc|t1=sheep}} Phono-semantic compound (形聲/形声, OC *daʔ) : semantic 羊 (“sheep”) + phonetic 宁 (OC *da, *daʔ), {{inh|zh|sit-pro|*ra||goat}} Proto-Sino-Tibetan *ra (“goat”), {{cog|bo|ར||goat}} Tibetan ར (ra, “goat”), {{cog|lae|lá||goat}} Pattani lá (“goat”) Head templates: {{head|zh|hanzi}} 羜
  1. (obsolete) five-month-old lamb Tags: obsolete

Character [Japanese]

  1. lamb Tags: Hyōgai, kanji, uncommon
    Sense id: en-羜-ja-character-nSM4S-P9 Categories (other): Uncommon kanji

Character [Korean]

Forms: jeo [romanization], [hangeul], jeo [revised], chŏ [McCune-Reischauer], ce [Yale]
Head templates: {{head|ko|Han characters|sc=Kore|sort=저|tr=jeo}} 羜 • (jeo), {{ko-hanja|eumhun=|hangeul=저|mr=chŏ|rv=jeo|y=ce}} 羜 • (jeo) (hangeul 저, revised jeo, McCune–Reischauer chŏ, Yale ce)
  1. 새끼 양: lamb, lambkin

Character [Translingual]

Forms: 123 [radical], 羊+5 [radical], 11 [strokes], 廿手十一弓 [cangjie-input], 8352₁ [four-corner], ⿰羊宁 [composition]
Head templates: {{Han char|as=05|canj=TQJMN|four=83521|ids=⿰羊宁|rad=羊|rn=123|sn=11}} 羜 (Kangxi radical 123, 羊+5, 11 strokes, cangjie input 廿手十一弓 (TQJMN), four-corner 8352₁, composition ⿰羊宁)
  1. lamb

Download JSON data for 羜 meaning in All languages combined (6.3kB)

{
  "forms": [
    {
      "form": "123",
      "tags": [
        "radical"
      ]
    },
    {
      "form": "羊+5",
      "tags": [
        "radical"
      ]
    },
    {
      "form": "11",
      "tags": [
        "strokes"
      ]
    },
    {
      "form": "廿手十一弓",
      "roman": "TQJMN",
      "tags": [
        "cangjie-input"
      ]
    },
    {
      "form": "8352₁",
      "tags": [
        "four-corner"
      ]
    },
    {
      "form": "⿰羊宁",
      "tags": [
        "composition"
      ]
    }
  ],
  "head_templates": [
    {
      "args": {
        "as": "05",
        "canj": "TQJMN",
        "four": "83521",
        "ids": "⿰羊宁",
        "rad": "羊",
        "rn": "123",
        "sn": "11"
      },
      "expansion": "羜 (Kangxi radical 123, 羊+5, 11 strokes, cangjie input 廿手十一弓 (TQJMN), four-corner 8352₁, composition ⿰羊宁)",
      "name": "Han char"
    }
  ],
  "lang": "Translingual",
  "lang_code": "mul",
  "pos": "character",
  "senses": [
    {
      "categories": [
        {
          "kind": "other",
          "name": "Translingual Han characters with definition lines",
          "parents": [],
          "source": "w"
        },
        {
          "kind": "other",
          "name": "Translingual entries with incorrect language header",
          "parents": [
            "Entries with incorrect language header",
            "Entry maintenance"
          ],
          "source": "w"
        },
        {
          "kind": "other",
          "name": "Translingual terms with non-redundant non-automated sortkeys",
          "parents": [
            "Terms with non-redundant non-automated sortkeys",
            "Entry maintenance"
          ],
          "source": "w"
        },
        {
          "kind": "other",
          "name": "Translingual terms with redundant script codes",
          "parents": [
            "Terms with redundant script codes",
            "Entry maintenance"
          ],
          "source": "w"
        }
      ],
      "glosses": [
        "lamb"
      ],
      "id": "en-羜-mul-character-nSM4S-P9",
      "links": [
        [
          "lamb",
          "lamb"
        ]
      ],
      "raw_tags": [
        "han"
      ]
    }
  ],
  "word": "羜"
}

{
  "etymology_templates": [
    {
      "args": {
        "1": "zh",
        "2": "Han phono-semantic compounds"
      },
      "expansion": "",
      "name": "categorize"
    },
    {
      "args": {
        "1": "psc",
        "adj": "",
        "nocap": "",
        "pron": "OC *daʔ"
      },
      "expansion": "Phono-semantic compound (形聲/形声, OC *daʔ)",
      "name": "liushu"
    },
    {
      "args": {
        "1": "羊",
        "2": "宁",
        "c1": "s",
        "c2": "p",
        "ls": "psc",
        "t1": "sheep"
      },
      "expansion": "Phono-semantic compound (形聲/形声, OC *daʔ) : semantic 羊 (“sheep”) + phonetic 宁 (OC *da, *daʔ)",
      "name": "Han compound"
    },
    {
      "args": {
        "1": "zh",
        "2": "sit-pro",
        "3": "*ra",
        "4": "",
        "5": "goat"
      },
      "expansion": "Proto-Sino-Tibetan *ra (“goat”)",
      "name": "inh"
    },
    {
      "args": {
        "1": "bo",
        "2": "ར",
        "3": "",
        "4": "goat"
      },
      "expansion": "Tibetan ར (ra, “goat”)",
      "name": "cog"
    },
    {
      "args": {
        "1": "lae",
        "2": "lá",
        "3": "",
        "4": "goat"
      },
      "expansion": "Pattani lá (“goat”)",
      "name": "cog"
    }
  ],
  "etymology_text": "Phono-semantic compound (形聲/形声, OC *daʔ) : semantic 羊 (“sheep”) + phonetic 宁 (OC *da, *daʔ)\nFrom Proto-Sino-Tibetan *ra (“goat”) (STEDT). Cognate with Tibetan ར (ra, “goat”), Pattani lá (“goat”), etc.",
  "head_templates": [
    {
      "args": {
        "1": "zh",
        "2": "hanzi"
      },
      "expansion": "羜",
      "name": "head"
    }
  ],
  "lang": "Chinese",
  "lang_code": "zh",
  "pos": "character",
  "senses": [
    {
      "categories": [
        {
          "kind": "other",
          "name": "Chinese entries with incorrect language header",
          "parents": [
            "Entries with incorrect language header",
            "Entry maintenance"
          ],
          "source": "w"
        },
        {
          "kind": "other",
          "name": "Chinese terms with non-redundant manual transliterations",
          "parents": [
            "Terms with non-redundant manual transliterations",
            "Entry maintenance"
          ],
          "source": "w"
        },
        {
          "kind": "other",
          "name": "Han phono-semantic compounds",
          "parents": [],
          "source": "w"
        },
        {
          "kind": "other",
          "name": "Pattani terms in nonstandard scripts",
          "parents": [
            "Terms in nonstandard scripts",
            "Entry maintenance"
          ],
          "source": "w"
        }
      ],
      "glosses": [
        "five-month-old lamb"
      ],
      "id": "en-羜-zh-character-O5sZ7rNy",
      "links": [
        [
          "five",
          "five"
        ],
        [
          "month",
          "month"
        ],
        [
          "lamb",
          "lamb"
        ]
      ],
      "raw_glosses": [
        "(obsolete) five-month-old lamb"
      ],
      "tags": [
        "obsolete"
      ]
    }
  ],
  "sounds": [
    {
      "tags": [
        "Mandarin",
        "Pinyin"
      ],
      "zh-pron": "zhù"
    },
    {
      "tags": [
        "Mandarin",
        "Pinyin"
      ],
      "zh-pron": "zhu⁴"
    },
    {
      "tags": [
        "Mandarin",
        "bopomofo"
      ],
      "zh-pron": "ㄓㄨˋ"
    },
    {
      "tags": [
        "Cantonese",
        "Jyutping"
      ],
      "zh-pron": "cyu⁵"
    },
    {
      "tags": [
        "Hanyu-Pinyin",
        "Mandarin"
      ],
      "zh-pron": "zhù"
    },
    {
      "tags": [
        "Mandarin",
        "Tongyong-Pinyin"
      ],
      "zh-pron": "jhù"
    },
    {
      "tags": [
        "Mandarin",
        "Wade-Giles"
      ],
      "zh-pron": "chu⁴"
    },
    {
      "tags": [
        "Mandarin",
        "Yale"
      ],
      "zh-pron": "jù"
    },
    {
      "tags": [
        "Gwoyeu-Romatsyh",
        "Mandarin"
      ],
      "zh-pron": "juh"
    },
    {
      "tags": [
        "Mandarin",
        "Palladius"
      ],
      "zh-pron": "чжу"
    },
    {
      "tags": [
        "Mandarin",
        "Palladius"
      ],
      "zh-pron": "čžu"
    },
    {
      "ipa": "/ʈ͡ʂu⁵¹/",
      "tags": [
        "Mandarin",
        "Sinological-IPA"
      ]
    },
    {
      "tags": [
        "Cantonese",
        "Yale"
      ],
      "zh-pron": "chyúh"
    },
    {
      "tags": [
        "Cantonese",
        "Pinyin"
      ],
      "zh-pron": "tsy⁵"
    },
    {
      "tags": [
        "Cantonese",
        "Guangdong-Romanization"
      ],
      "zh-pron": "qu⁵"
    },
    {
      "ipa": "/t͡sʰyː¹³/",
      "tags": [
        "Cantonese",
        "Sinological-IPA"
      ]
    },
    {
      "tags": [
        "Middle-Chinese"
      ],
      "zh-pron": "drjoX"
    },
    {
      "tags": [
        "Old-Chinese",
        "Zhengzhang"
      ],
      "zh-pron": "/*daʔ/"
    },
    {
      "ipa": "/ʈ͡ʂu⁵¹/"
    },
    {
      "ipa": "/t͡sʰyː¹³/"
    },
    {
      "other": "/*daʔ/"
    }
  ],
  "word": "羜"
}

{
  "lang": "Japanese",
  "lang_code": "ja",
  "pos": "character",
  "senses": [
    {
      "categories": [
        {
          "kind": "other",
          "name": "Uncommon kanji",
          "parents": [],
          "source": "w"
        }
      ],
      "glosses": [
        "lamb"
      ],
      "id": "en-羜-ja-character-nSM4S-P9",
      "links": [
        [
          "lamb",
          "lamb"
        ]
      ],
      "tags": [
        "Hyōgai",
        "kanji",
        "uncommon"
      ]
    }
  ],
  "word": "羜"
}

{
  "forms": [
    {
      "form": "jeo",
      "tags": [
        "romanization"
      ]
    },
    {
      "form": "저",
      "tags": [
        "hangeul"
      ]
    },
    {
      "form": "jeo",
      "tags": [
        "revised"
      ]
    },
    {
      "form": "chŏ",
      "tags": [
        "McCune-Reischauer"
      ]
    },
    {
      "form": "ce",
      "tags": [
        "Yale"
      ]
    }
  ],
  "head_templates": [
    {
      "args": {
        "1": "ko",
        "2": "Han characters",
        "sc": "Kore",
        "sort": "저",
        "tr": "jeo"
      },
      "expansion": "羜 • (jeo)",
      "name": "head"
    },
    {
      "args": {
        "eumhun": "",
        "hangeul": "저",
        "mr": "chŏ",
        "rv": "jeo",
        "y": "ce"
      },
      "expansion": "羜 • (jeo) (hangeul 저, revised jeo, McCune–Reischauer chŏ, Yale ce)",
      "name": "ko-hanja"
    }
  ],
  "lang": "Korean",
  "lang_code": "ko",
  "pos": "character",
  "senses": [
    {
      "categories": [
        {
          "kind": "other",
          "name": "Korean entries with incorrect language header",
          "parents": [
            "Entries with incorrect language header",
            "Entry maintenance"
          ],
          "source": "w"
        },
        {
          "kind": "other",
          "name": "Korean terms with non-redundant non-automated sortkeys",
          "parents": [
            "Terms with non-redundant non-automated sortkeys",
            "Entry maintenance"
          ],
          "source": "w"
        },
        {
          "kind": "other",
          "name": "Korean terms with redundant script codes",
          "parents": [
            "Terms with redundant script codes",
            "Entry maintenance"
          ],
          "source": "w"
        }
      ],
      "glosses": [
        "새끼 양: lamb, lambkin"
      ],
      "id": "en-羜-ko-character-94-7eVuF",
      "links": [
        [
          "새끼 양",
          "새끼 양"
        ],
        [
          "lamb",
          "lamb"
        ],
        [
          "lambkin",
          "lambkin"
        ]
      ],
      "raw_tags": [
        "Hanja"
      ]
    }
  ],
  "word": "羜"
}
{
  "etymology_templates": [
    {
      "args": {
        "1": "zh",
        "2": "Han phono-semantic compounds"
      },
      "expansion": "",
      "name": "categorize"
    },
    {
      "args": {
        "1": "psc",
        "adj": "",
        "nocap": "",
        "pron": "OC *daʔ"
      },
      "expansion": "Phono-semantic compound (形聲/形声, OC *daʔ)",
      "name": "liushu"
    },
    {
      "args": {
        "1": "羊",
        "2": "宁",
        "c1": "s",
        "c2": "p",
        "ls": "psc",
        "t1": "sheep"
      },
      "expansion": "Phono-semantic compound (形聲/形声, OC *daʔ) : semantic 羊 (“sheep”) + phonetic 宁 (OC *da, *daʔ)",
      "name": "Han compound"
    },
    {
      "args": {
        "1": "zh",
        "2": "sit-pro",
        "3": "*ra",
        "4": "",
        "5": "goat"
      },
      "expansion": "Proto-Sino-Tibetan *ra (“goat”)",
      "name": "inh"
    },
    {
      "args": {
        "1": "bo",
        "2": "ར",
        "3": "",
        "4": "goat"
      },
      "expansion": "Tibetan ར (ra, “goat”)",
      "name": "cog"
    },
    {
      "args": {
        "1": "lae",
        "2": "lá",
        "3": "",
        "4": "goat"
      },
      "expansion": "Pattani lá (“goat”)",
      "name": "cog"
    }
  ],
  "etymology_text": "Phono-semantic compound (形聲/形声, OC *daʔ) : semantic 羊 (“sheep”) + phonetic 宁 (OC *da, *daʔ)\nFrom Proto-Sino-Tibetan *ra (“goat”) (STEDT). Cognate with Tibetan ར (ra, “goat”), Pattani lá (“goat”), etc.",
  "head_templates": [
    {
      "args": {
        "1": "zh",
        "2": "hanzi"
      },
      "expansion": "羜",
      "name": "head"
    }
  ],
  "lang": "Chinese",
  "lang_code": "zh",
  "pos": "character",
  "senses": [
    {
      "categories": [
        "Cantonese lemmas",
        "Cantonese nouns",
        "Chinese Han characters",
        "Chinese entries with incorrect language header",
        "Chinese lemmas",
        "Chinese nouns",
        "Chinese terms derived from Proto-Sino-Tibetan",
        "Chinese terms inherited from Proto-Sino-Tibetan",
        "Chinese terms with IPA pronunciation",
        "Chinese terms with non-redundant manual transliterations",
        "Chinese terms with obsolete senses",
        "Han phono-semantic compounds",
        "Mandarin lemmas",
        "Mandarin nouns",
        "Middle Chinese lemmas",
        "Old Chinese lemmas",
        "Pattani terms in nonstandard scripts"
      ],
      "glosses": [
        "five-month-old lamb"
      ],
      "links": [
        [
          "five",
          "five"
        ],
        [
          "month",
          "month"
        ],
        [
          "lamb",
          "lamb"
        ]
      ],
      "raw_glosses": [
        "(obsolete) five-month-old lamb"
      ],
      "tags": [
        "obsolete"
      ]
    }
  ],
  "sounds": [
    {
      "tags": [
        "Mandarin",
        "Pinyin"
      ],
      "zh-pron": "zhù"
    },
    {
      "tags": [
        "Mandarin",
        "Pinyin"
      ],
      "zh-pron": "zhu⁴"
    },
    {
      "tags": [
        "Mandarin",
        "bopomofo"
      ],
      "zh-pron": "ㄓㄨˋ"
    },
    {
      "tags": [
        "Cantonese",
        "Jyutping"
      ],
      "zh-pron": "cyu⁵"
    },
    {
      "tags": [
        "Hanyu-Pinyin",
        "Mandarin"
      ],
      "zh-pron": "zhù"
    },
    {
      "tags": [
        "Mandarin",
        "Tongyong-Pinyin"
      ],
      "zh-pron": "jhù"
    },
    {
      "tags": [
        "Mandarin",
        "Wade-Giles"
      ],
      "zh-pron": "chu⁴"
    },
    {
      "tags": [
        "Mandarin",
        "Yale"
      ],
      "zh-pron": "jù"
    },
    {
      "tags": [
        "Gwoyeu-Romatsyh",
        "Mandarin"
      ],
      "zh-pron": "juh"
    },
    {
      "tags": [
        "Mandarin",
        "Palladius"
      ],
      "zh-pron": "чжу"
    },
    {
      "tags": [
        "Mandarin",
        "Palladius"
      ],
      "zh-pron": "čžu"
    },
    {
      "ipa": "/ʈ͡ʂu⁵¹/",
      "tags": [
        "Mandarin",
        "Sinological-IPA"
      ]
    },
    {
      "tags": [
        "Cantonese",
        "Yale"
      ],
      "zh-pron": "chyúh"
    },
    {
      "tags": [
        "Cantonese",
        "Pinyin"
      ],
      "zh-pron": "tsy⁵"
    },
    {
      "tags": [
        "Cantonese",
        "Guangdong-Romanization"
      ],
      "zh-pron": "qu⁵"
    },
    {
      "ipa": "/t͡sʰyː¹³/",
      "tags": [
        "Cantonese",
        "Sinological-IPA"
      ]
    },
    {
      "tags": [
        "Middle-Chinese"
      ],
      "zh-pron": "drjoX"
    },
    {
      "tags": [
        "Old-Chinese",
        "Zhengzhang"
      ],
      "zh-pron": "/*daʔ/"
    },
    {
      "ipa": "/ʈ͡ʂu⁵¹/"
    },
    {
      "ipa": "/t͡sʰyː¹³/"
    },
    {
      "other": "/*daʔ/"
    }
  ],
  "word": "羜"
}

{
  "lang": "Japanese",
  "lang_code": "ja",
  "pos": "character",
  "senses": [
    {
      "categories": [
        "Japanese Han characters",
        "Uncommon kanji"
      ],
      "glosses": [
        "lamb"
      ],
      "links": [
        [
          "lamb",
          "lamb"
        ]
      ],
      "tags": [
        "Hyōgai",
        "kanji",
        "uncommon"
      ]
    }
  ],
  "word": "羜"
}

{
  "forms": [
    {
      "form": "jeo",
      "tags": [
        "romanization"
      ]
    },
    {
      "form": "저",
      "tags": [
        "hangeul"
      ]
    },
    {
      "form": "jeo",
      "tags": [
        "revised"
      ]
    },
    {
      "form": "chŏ",
      "tags": [
        "McCune-Reischauer"
      ]
    },
    {
      "form": "ce",
      "tags": [
        "Yale"
      ]
    }
  ],
  "head_templates": [
    {
      "args": {
        "1": "ko",
        "2": "Han characters",
        "sc": "Kore",
        "sort": "저",
        "tr": "jeo"
      },
      "expansion": "羜 • (jeo)",
      "name": "head"
    },
    {
      "args": {
        "eumhun": "",
        "hangeul": "저",
        "mr": "chŏ",
        "rv": "jeo",
        "y": "ce"
      },
      "expansion": "羜 • (jeo) (hangeul 저, revised jeo, McCune–Reischauer chŏ, Yale ce)",
      "name": "ko-hanja"
    }
  ],
  "lang": "Korean",
  "lang_code": "ko",
  "pos": "character",
  "senses": [
    {
      "categories": [
        "Korean Han characters",
        "Korean entries with incorrect language header",
        "Korean lemmas",
        "Korean terms with non-redundant non-automated sortkeys",
        "Korean terms with redundant script codes"
      ],
      "glosses": [
        "새끼 양: lamb, lambkin"
      ],
      "links": [
        [
          "새끼 양",
          "새끼 양"
        ],
        [
          "lamb",
          "lamb"
        ],
        [
          "lambkin",
          "lambkin"
        ]
      ],
      "raw_tags": [
        "Hanja"
      ]
    }
  ],
  "word": "羜"
}

{
  "forms": [
    {
      "form": "123",
      "tags": [
        "radical"
      ]
    },
    {
      "form": "羊+5",
      "tags": [
        "radical"
      ]
    },
    {
      "form": "11",
      "tags": [
        "strokes"
      ]
    },
    {
      "form": "廿手十一弓",
      "roman": "TQJMN",
      "tags": [
        "cangjie-input"
      ]
    },
    {
      "form": "8352₁",
      "tags": [
        "four-corner"
      ]
    },
    {
      "form": "⿰羊宁",
      "tags": [
        "composition"
      ]
    }
  ],
  "head_templates": [
    {
      "args": {
        "as": "05",
        "canj": "TQJMN",
        "four": "83521",
        "ids": "⿰羊宁",
        "rad": "羊",
        "rn": "123",
        "sn": "11"
      },
      "expansion": "羜 (Kangxi radical 123, 羊+5, 11 strokes, cangjie input 廿手十一弓 (TQJMN), four-corner 8352₁, composition ⿰羊宁)",
      "name": "Han char"
    }
  ],
  "lang": "Translingual",
  "lang_code": "mul",
  "pos": "character",
  "senses": [
    {
      "categories": [
        "Han script characters",
        "Translingual Han characters with definition lines",
        "Translingual entries with incorrect language header",
        "Translingual lemmas",
        "Translingual symbols",
        "Translingual terms with non-redundant non-automated sortkeys",
        "Translingual terms with redundant script codes"
      ],
      "glosses": [
        "lamb"
      ],
      "links": [
        [
          "lamb",
          "lamb"
        ]
      ],
      "raw_tags": [
        "han"
      ]
    }
  ],
  "word": "羜"
}
{
  "called_from": "pronunciations/296/20230324",
  "msg": "Zh-pron header not found in zh_pron_tags or tags: '(Standard Cantonese, Guangzhou–Hong Kong)⁺'",
  "path": [
    "羜"
  ],
  "section": "Chinese",
  "subsection": "",
  "title": "羜",
  "trace": ""
}

{
  "called_from": "pronunciations/296/20230324",
  "msg": "Zh-pron header not found in zh_pron_tags or tags: '(Standard Chinese)⁺'",
  "path": [
    "羜"
  ],
  "section": "Chinese",
  "subsection": "",
  "title": "羜",
  "trace": ""
}

This page is a part of the kaikki.org machine-readable All languages combined dictionary. This dictionary is based on structured data extracted on 2024-06-04 from the enwiktionary dump dated 2024-05-02 using wiktextract (e9e0a99 and db5a844). The data shown on this site has been post-processed and various details (e.g., extra categories) removed, some information disambiguated, and additional data merged from other sources. See the raw data download page for the unprocessed wiktextract data.

If you use this data in academic research, please cite Tatu Ylonen: Wiktextract: Wiktionary as Machine-Readable Structured Data, Proceedings of the 13th Conference on Language Resources and Evaluation (LREC), pp. 1317-1325, Marseille, 20-25 June 2022. Linking to the relevant page(s) under https://kaikki.org would also be greatly appreciated.