"tausi" meaning in All languages combined

See tausi on Wiktionary

Noun [English]

Etymology: From Tagalog tausi, from Philippine Hokkien 豆豉 (tāu-sīⁿ). Doublet of douchi. Etymology templates: {{bor|en|tl|tausi}} Tagalog tausi, {{der|en|nan-hbl-PH|豆豉|tr=tāu-sīⁿ}} Philippine Hokkien 豆豉 (tāu-sīⁿ), {{doublet|en|douchi}} Doublet of douchi Head templates: {{en-noun|-}} tausi (uncountable)
  1. (Philippines) Salted black beans. Tags: Philippines, uncountable

Noun [Norwegian Nynorsk]

Head templates: {{head|nn|nounf|g=f}} tausi f
  1. (non-standard since 2012) definite singular of taus Tags: definite, feminine, form-of, nonstandard, singular Form of: taus

Noun [Samoan]

Head templates: {{head|sm|noun}} tausi
  1. wife of a talking chief
    Sense id: en-tausi-sm-noun-hw8VaZ-P

Verb [Samoan]

Head templates: {{head|sm|verb}} tausi
  1. care for
    Sense id: en-tausi-sm-verb-bBuoGzNc Categories (other): Pages with 5 entries, Pages with entries, Samoan entries with incorrect language header Disambiguation of Pages with 5 entries: 34 66 Disambiguation of Pages with entries: 34 66 Disambiguation of Samoan entries with incorrect language header: 26 74

Noun [Spanish]

IPA: /ˈtausi/, [ˈt̪au̯.si]
Rhymes: -ausi Etymology: Borrowed from Cantonese 豆豉 (dau⁶ si⁶). Etymology templates: {{bor+|es|yue|-}} Borrowed from Cantonese, {{sup|6}} ⁶, {{sup|6}} ⁶, {{zh-m|豆豉|tr=dau⁶ si⁶}} 豆豉 (dau⁶ si⁶) Head templates: {{es-noun|m|?}} tausi m
  1. (Peru) douchi Tags: Peru, masculine Categories (topical): Foods

Noun [Swahili]

Audio: Sw-ke-tausi.flac Forms: tausi class IX [canonical], tausi class X [plural]
Etymology: Borrowed from Arabic طَاوُوس (ṭāwūs). Etymology templates: {{bor+|sw|ar|طَاوُوس}} Borrowed from Arabic طَاوُوس (ṭāwūs) Head templates: {{sw-noun|n}} tausi class IX (plural tausi class X)
  1. peacock Categories (lifeform): Fowls
    Sense id: en-tausi-sw-noun-oJXHcT4z Categories (other): Pages with 5 entries, Pages with entries, Swahili entries with incorrect language header
{
  "etymology_templates": [
    {
      "args": {
        "1": "en",
        "2": "tl",
        "3": "tausi"
      },
      "expansion": "Tagalog tausi",
      "name": "bor"
    },
    {
      "args": {
        "1": "en",
        "2": "nan-hbl-PH",
        "3": "豆豉",
        "tr": "tāu-sīⁿ"
      },
      "expansion": "Philippine Hokkien 豆豉 (tāu-sīⁿ)",
      "name": "der"
    },
    {
      "args": {
        "1": "en",
        "2": "douchi"
      },
      "expansion": "Doublet of douchi",
      "name": "doublet"
    }
  ],
  "etymology_text": "From Tagalog tausi, from Philippine Hokkien 豆豉 (tāu-sīⁿ). Doublet of douchi.",
  "head_templates": [
    {
      "args": {
        "1": "-"
      },
      "expansion": "tausi (uncountable)",
      "name": "en-noun"
    }
  ],
  "lang": "English",
  "lang_code": "en",
  "pos": "noun",
  "senses": [
    {
      "categories": [
        {
          "kind": "other",
          "name": "English entries with incorrect language header",
          "parents": [
            "Entries with incorrect language header",
            "Entry maintenance"
          ],
          "source": "w"
        },
        {
          "kind": "other",
          "name": "Pages with 5 entries",
          "parents": [],
          "source": "w"
        },
        {
          "kind": "other",
          "name": "Pages with entries",
          "parents": [],
          "source": "w"
        },
        {
          "kind": "other",
          "name": "Philippine English",
          "parents": [],
          "source": "w"
        }
      ],
      "examples": [
        {
          "bold_text_offsets": [
            [
              114,
              119
            ]
          ],
          "ref": "1979, Sugar News, volume 55, Manila, →ISSN, →OCLC, page 43, column 2:",
          "text": "Foods with high sodium content (not allowed or limited in high blood pressure): […] beans with added salt such as tausi, tahure, misu; […]",
          "type": "quote"
        },
        {
          "bold_text_offsets": [
            [
              45,
              50
            ]
          ],
          "ref": "2012, Amy Besa, Romy Dorotan, “Chilled Lobster Kinilaw”, in Memories of Philippine Kitchens: Stories and Recipes from Far and Near, 2nd edition, New York, N.Y.: Stewart, Tabori & Chang, →ISBN, chapter 2 (Food That Was Always Ours), page 51, column 3:",
          "text": "Other additions: salted duck eggs, tomatoes, tausi [salted black beans], and garlic",
          "type": "quote"
        },
        {
          "bold_text_offsets": [
            [
              85,
              90
            ]
          ],
          "ref": "2014 November, Claude Tayag, Mary Ann Quioc, “Dilis-cious rice”, in Linamnam: Eating One’s Way Around the Philippines, 2nd edition, Mandaluyong: Anvil Publishing, →ISBN, “Metro Manila” section, page 76:",
          "text": "At Fely J’s, the dilis or dried mini anchovies are crisp fried with a slight hint of tausi and served atop hot steaming jasmine rice.",
          "type": "quote"
        },
        {
          "bold_text_offsets": [
            [
              14,
              19
            ],
            [
              91,
              96
            ]
          ],
          "ref": "2022 March 7–13, “Tochong Bangus”, in Mindanao Examiner, Mindanao, →OCLC, page 9, columns 3–4:",
          "text": "2 Tablespoons tausi salted black beans […] Once the onion softens, add vinegar, tahure and tausi.",
          "type": "quote"
        }
      ],
      "glosses": [
        "Salted black beans."
      ],
      "id": "en-tausi-en-noun-z17UoEFO",
      "links": [
        [
          "Salted",
          "salted"
        ],
        [
          "black bean",
          "black bean"
        ]
      ],
      "raw_glosses": [
        "(Philippines) Salted black beans."
      ],
      "tags": [
        "Philippines",
        "uncountable"
      ]
    }
  ],
  "word": "tausi"
}

{
  "head_templates": [
    {
      "args": {
        "1": "nn",
        "2": "nounf",
        "g": "f"
      },
      "expansion": "tausi f",
      "name": "head"
    }
  ],
  "lang": "Norwegian Nynorsk",
  "lang_code": "nn",
  "pos": "noun",
  "senses": [
    {
      "categories": [
        {
          "kind": "other",
          "name": "Norwegian Nynorsk entries with incorrect language header",
          "parents": [
            "Entries with incorrect language header",
            "Entry maintenance"
          ],
          "source": "w"
        },
        {
          "kind": "other",
          "name": "Pages with 5 entries",
          "parents": [],
          "source": "w"
        },
        {
          "kind": "other",
          "name": "Pages with entries",
          "parents": [],
          "source": "w"
        }
      ],
      "form_of": [
        {
          "word": "taus"
        }
      ],
      "glosses": [
        "definite singular of taus"
      ],
      "id": "en-tausi-nn-noun-A9OZaFl3",
      "links": [
        [
          "taus",
          "taus#Norwegian_Nynorsk"
        ]
      ],
      "raw_glosses": [
        "(non-standard since 2012) definite singular of taus"
      ],
      "tags": [
        "definite",
        "feminine",
        "form-of",
        "nonstandard",
        "singular"
      ]
    }
  ],
  "word": "tausi"
}

{
  "head_templates": [
    {
      "args": {
        "1": "sm",
        "2": "verb"
      },
      "expansion": "tausi",
      "name": "head"
    }
  ],
  "lang": "Samoan",
  "lang_code": "sm",
  "pos": "verb",
  "senses": [
    {
      "categories": [
        {
          "_dis": "34 66",
          "kind": "other",
          "name": "Pages with 5 entries",
          "parents": [],
          "source": "w+disamb"
        },
        {
          "_dis": "34 66",
          "kind": "other",
          "name": "Pages with entries",
          "parents": [],
          "source": "w+disamb"
        },
        {
          "_dis": "26 74",
          "kind": "other",
          "name": "Samoan entries with incorrect language header",
          "parents": [
            "Entries with incorrect language header",
            "Entry maintenance"
          ],
          "source": "w+disamb"
        }
      ],
      "glosses": [
        "care for"
      ],
      "id": "en-tausi-sm-verb-bBuoGzNc",
      "links": [
        [
          "care for",
          "care for"
        ]
      ]
    }
  ],
  "word": "tausi"
}

{
  "head_templates": [
    {
      "args": {
        "1": "sm",
        "2": "noun"
      },
      "expansion": "tausi",
      "name": "head"
    }
  ],
  "lang": "Samoan",
  "lang_code": "sm",
  "pos": "noun",
  "senses": [
    {
      "glosses": [
        "wife of a talking chief"
      ],
      "id": "en-tausi-sm-noun-hw8VaZ-P",
      "links": [
        [
          "wife",
          "wife"
        ],
        [
          "chief",
          "chief"
        ]
      ]
    }
  ],
  "word": "tausi"
}

{
  "etymology_templates": [
    {
      "args": {
        "1": "es",
        "2": "yue",
        "3": "-"
      },
      "expansion": "Borrowed from Cantonese",
      "name": "bor+"
    },
    {
      "args": {
        "1": "6"
      },
      "expansion": "⁶",
      "name": "sup"
    },
    {
      "args": {
        "1": "6"
      },
      "expansion": "⁶",
      "name": "sup"
    },
    {
      "args": {
        "1": "豆豉",
        "tr": "dau⁶ si⁶"
      },
      "expansion": "豆豉 (dau⁶ si⁶)",
      "name": "zh-m"
    }
  ],
  "etymology_text": "Borrowed from Cantonese 豆豉 (dau⁶ si⁶).",
  "head_templates": [
    {
      "args": {
        "1": "m",
        "2": "?"
      },
      "expansion": "tausi m",
      "name": "es-noun"
    }
  ],
  "hyphenation": [
    "tau‧si"
  ],
  "lang": "Spanish",
  "lang_code": "es",
  "pos": "noun",
  "senses": [
    {
      "categories": [
        {
          "kind": "other",
          "name": "Pages with 5 entries",
          "parents": [],
          "source": "w"
        },
        {
          "kind": "other",
          "name": "Pages with entries",
          "parents": [],
          "source": "w"
        },
        {
          "kind": "other",
          "name": "Peruvian Spanish",
          "parents": [],
          "source": "w"
        },
        {
          "kind": "other",
          "name": "Spanish entries with incorrect language header",
          "parents": [
            "Entries with incorrect language header",
            "Entry maintenance"
          ],
          "source": "w"
        },
        {
          "kind": "topical",
          "langcode": "es",
          "name": "Foods",
          "orig": "es:Foods",
          "parents": [
            "Eating",
            "Food and drink",
            "Human behaviour",
            "All topics",
            "Human",
            "Fundamental"
          ],
          "source": "w"
        }
      ],
      "examples": [
        {
          "text": "Meronym: chifa (“Peruvian Chinese cuisine”)"
        }
      ],
      "glosses": [
        "douchi"
      ],
      "id": "en-tausi-es-noun-d48YtZB-",
      "links": [
        [
          "douchi",
          "douchi"
        ]
      ],
      "raw_glosses": [
        "(Peru) douchi"
      ],
      "tags": [
        "Peru",
        "masculine"
      ]
    }
  ],
  "sounds": [
    {
      "ipa": "/ˈtausi/"
    },
    {
      "ipa": "[ˈt̪au̯.si]"
    },
    {
      "rhymes": "-ausi"
    }
  ],
  "word": "tausi"
}

{
  "etymology_templates": [
    {
      "args": {
        "1": "sw",
        "2": "ar",
        "3": "طَاوُوس"
      },
      "expansion": "Borrowed from Arabic طَاوُوس (ṭāwūs)",
      "name": "bor+"
    }
  ],
  "etymology_text": "Borrowed from Arabic طَاوُوس (ṭāwūs).",
  "forms": [
    {
      "form": "tausi class IX",
      "tags": [
        "canonical"
      ]
    },
    {
      "form": "tausi class X",
      "tags": [
        "plural"
      ]
    }
  ],
  "head_templates": [
    {
      "args": {
        "1": "n"
      },
      "expansion": "tausi class IX (plural tausi class X)",
      "name": "sw-noun"
    }
  ],
  "lang": "Swahili",
  "lang_code": "sw",
  "pos": "noun",
  "senses": [
    {
      "categories": [
        {
          "kind": "other",
          "name": "Pages with 5 entries",
          "parents": [],
          "source": "w"
        },
        {
          "kind": "other",
          "name": "Pages with entries",
          "parents": [],
          "source": "w"
        },
        {
          "kind": "other",
          "name": "Swahili entries with incorrect language header",
          "parents": [
            "Entries with incorrect language header",
            "Entry maintenance"
          ],
          "source": "w"
        },
        {
          "kind": "lifeform",
          "langcode": "sw",
          "name": "Fowls",
          "orig": "sw:Fowls",
          "parents": [
            "Birds",
            "Vertebrates",
            "Chordates",
            "Animals",
            "Lifeforms",
            "All topics",
            "Life",
            "Fundamental",
            "Nature"
          ],
          "source": "w"
        }
      ],
      "glosses": [
        "peacock"
      ],
      "id": "en-tausi-sw-noun-oJXHcT4z",
      "links": [
        [
          "peacock",
          "peacock"
        ]
      ]
    }
  ],
  "sounds": [
    {
      "audio": "Sw-ke-tausi.flac",
      "mp3_url": "https://upload.wikimedia.org/wikipedia/commons/transcoded/7/74/Sw-ke-tausi.flac/Sw-ke-tausi.flac.mp3",
      "ogg_url": "https://upload.wikimedia.org/wikipedia/commons/transcoded/7/74/Sw-ke-tausi.flac/Sw-ke-tausi.flac.ogg"
    }
  ],
  "word": "tausi"
}
{
  "etymology_templates": [
    {
      "args": {
        "1": "en",
        "2": "tl",
        "3": "tausi"
      },
      "expansion": "Tagalog tausi",
      "name": "bor"
    },
    {
      "args": {
        "1": "en",
        "2": "nan-hbl-PH",
        "3": "豆豉",
        "tr": "tāu-sīⁿ"
      },
      "expansion": "Philippine Hokkien 豆豉 (tāu-sīⁿ)",
      "name": "der"
    },
    {
      "args": {
        "1": "en",
        "2": "douchi"
      },
      "expansion": "Doublet of douchi",
      "name": "doublet"
    }
  ],
  "etymology_text": "From Tagalog tausi, from Philippine Hokkien 豆豉 (tāu-sīⁿ). Doublet of douchi.",
  "head_templates": [
    {
      "args": {
        "1": "-"
      },
      "expansion": "tausi (uncountable)",
      "name": "en-noun"
    }
  ],
  "lang": "English",
  "lang_code": "en",
  "pos": "noun",
  "senses": [
    {
      "categories": [
        "English doublets",
        "English entries with incorrect language header",
        "English lemmas",
        "English nouns",
        "English terms borrowed from Tagalog",
        "English terms derived from Philippine Hokkien",
        "English terms derived from Tagalog",
        "English terms with quotations",
        "English uncountable nouns",
        "Pages with 5 entries",
        "Pages with entries",
        "Philippine English"
      ],
      "examples": [
        {
          "bold_text_offsets": [
            [
              114,
              119
            ]
          ],
          "ref": "1979, Sugar News, volume 55, Manila, →ISSN, →OCLC, page 43, column 2:",
          "text": "Foods with high sodium content (not allowed or limited in high blood pressure): […] beans with added salt such as tausi, tahure, misu; […]",
          "type": "quote"
        },
        {
          "bold_text_offsets": [
            [
              45,
              50
            ]
          ],
          "ref": "2012, Amy Besa, Romy Dorotan, “Chilled Lobster Kinilaw”, in Memories of Philippine Kitchens: Stories and Recipes from Far and Near, 2nd edition, New York, N.Y.: Stewart, Tabori & Chang, →ISBN, chapter 2 (Food That Was Always Ours), page 51, column 3:",
          "text": "Other additions: salted duck eggs, tomatoes, tausi [salted black beans], and garlic",
          "type": "quote"
        },
        {
          "bold_text_offsets": [
            [
              85,
              90
            ]
          ],
          "ref": "2014 November, Claude Tayag, Mary Ann Quioc, “Dilis-cious rice”, in Linamnam: Eating One’s Way Around the Philippines, 2nd edition, Mandaluyong: Anvil Publishing, →ISBN, “Metro Manila” section, page 76:",
          "text": "At Fely J’s, the dilis or dried mini anchovies are crisp fried with a slight hint of tausi and served atop hot steaming jasmine rice.",
          "type": "quote"
        },
        {
          "bold_text_offsets": [
            [
              14,
              19
            ],
            [
              91,
              96
            ]
          ],
          "ref": "2022 March 7–13, “Tochong Bangus”, in Mindanao Examiner, Mindanao, →OCLC, page 9, columns 3–4:",
          "text": "2 Tablespoons tausi salted black beans […] Once the onion softens, add vinegar, tahure and tausi.",
          "type": "quote"
        }
      ],
      "glosses": [
        "Salted black beans."
      ],
      "links": [
        [
          "Salted",
          "salted"
        ],
        [
          "black bean",
          "black bean"
        ]
      ],
      "raw_glosses": [
        "(Philippines) Salted black beans."
      ],
      "tags": [
        "Philippines",
        "uncountable"
      ]
    }
  ],
  "word": "tausi"
}

{
  "head_templates": [
    {
      "args": {
        "1": "nn",
        "2": "nounf",
        "g": "f"
      },
      "expansion": "tausi f",
      "name": "head"
    }
  ],
  "lang": "Norwegian Nynorsk",
  "lang_code": "nn",
  "pos": "noun",
  "senses": [
    {
      "categories": [
        "Norwegian Nynorsk entries with incorrect language header",
        "Norwegian Nynorsk non-lemma forms",
        "Norwegian Nynorsk noun forms",
        "Pages with 5 entries",
        "Pages with entries"
      ],
      "form_of": [
        {
          "word": "taus"
        }
      ],
      "glosses": [
        "definite singular of taus"
      ],
      "links": [
        [
          "taus",
          "taus#Norwegian_Nynorsk"
        ]
      ],
      "raw_glosses": [
        "(non-standard since 2012) definite singular of taus"
      ],
      "tags": [
        "definite",
        "feminine",
        "form-of",
        "nonstandard",
        "singular"
      ]
    }
  ],
  "word": "tausi"
}

{
  "categories": [
    "Pages with 5 entries",
    "Pages with entries",
    "Samoan entries with incorrect language header",
    "Samoan lemmas",
    "Samoan nouns",
    "Samoan verbs"
  ],
  "head_templates": [
    {
      "args": {
        "1": "sm",
        "2": "verb"
      },
      "expansion": "tausi",
      "name": "head"
    }
  ],
  "lang": "Samoan",
  "lang_code": "sm",
  "pos": "verb",
  "senses": [
    {
      "glosses": [
        "care for"
      ],
      "links": [
        [
          "care for",
          "care for"
        ]
      ]
    }
  ],
  "word": "tausi"
}

{
  "categories": [
    "Pages with 5 entries",
    "Pages with entries",
    "Samoan entries with incorrect language header",
    "Samoan lemmas",
    "Samoan nouns",
    "Samoan verbs"
  ],
  "head_templates": [
    {
      "args": {
        "1": "sm",
        "2": "noun"
      },
      "expansion": "tausi",
      "name": "head"
    }
  ],
  "lang": "Samoan",
  "lang_code": "sm",
  "pos": "noun",
  "senses": [
    {
      "glosses": [
        "wife of a talking chief"
      ],
      "links": [
        [
          "wife",
          "wife"
        ],
        [
          "chief",
          "chief"
        ]
      ]
    }
  ],
  "word": "tausi"
}

{
  "etymology_templates": [
    {
      "args": {
        "1": "es",
        "2": "yue",
        "3": "-"
      },
      "expansion": "Borrowed from Cantonese",
      "name": "bor+"
    },
    {
      "args": {
        "1": "6"
      },
      "expansion": "⁶",
      "name": "sup"
    },
    {
      "args": {
        "1": "6"
      },
      "expansion": "⁶",
      "name": "sup"
    },
    {
      "args": {
        "1": "豆豉",
        "tr": "dau⁶ si⁶"
      },
      "expansion": "豆豉 (dau⁶ si⁶)",
      "name": "zh-m"
    }
  ],
  "etymology_text": "Borrowed from Cantonese 豆豉 (dau⁶ si⁶).",
  "head_templates": [
    {
      "args": {
        "1": "m",
        "2": "?"
      },
      "expansion": "tausi m",
      "name": "es-noun"
    }
  ],
  "hyphenation": [
    "tau‧si"
  ],
  "lang": "Spanish",
  "lang_code": "es",
  "pos": "noun",
  "senses": [
    {
      "categories": [
        "Pages with 5 entries",
        "Pages with entries",
        "Peruvian Spanish",
        "Rhymes:Spanish/ausi",
        "Rhymes:Spanish/ausi/2 syllables",
        "Spanish 2-syllable words",
        "Spanish entries with incorrect language header",
        "Spanish lemmas",
        "Spanish masculine nouns",
        "Spanish nouns",
        "Spanish nouns with unknown or uncertain plurals",
        "Spanish terms borrowed from Cantonese",
        "Spanish terms derived from Cantonese",
        "Spanish terms with IPA pronunciation",
        "es:Foods"
      ],
      "examples": [
        {
          "text": "Meronym: chifa (“Peruvian Chinese cuisine”)"
        }
      ],
      "glosses": [
        "douchi"
      ],
      "links": [
        [
          "douchi",
          "douchi"
        ]
      ],
      "raw_glosses": [
        "(Peru) douchi"
      ],
      "tags": [
        "Peru",
        "masculine"
      ]
    }
  ],
  "sounds": [
    {
      "ipa": "/ˈtausi/"
    },
    {
      "ipa": "[ˈt̪au̯.si]"
    },
    {
      "rhymes": "-ausi"
    }
  ],
  "word": "tausi"
}

{
  "etymology_templates": [
    {
      "args": {
        "1": "sw",
        "2": "ar",
        "3": "طَاوُوس"
      },
      "expansion": "Borrowed from Arabic طَاوُوس (ṭāwūs)",
      "name": "bor+"
    }
  ],
  "etymology_text": "Borrowed from Arabic طَاوُوس (ṭāwūs).",
  "forms": [
    {
      "form": "tausi class IX",
      "tags": [
        "canonical"
      ]
    },
    {
      "form": "tausi class X",
      "tags": [
        "plural"
      ]
    }
  ],
  "head_templates": [
    {
      "args": {
        "1": "n"
      },
      "expansion": "tausi class IX (plural tausi class X)",
      "name": "sw-noun"
    }
  ],
  "lang": "Swahili",
  "lang_code": "sw",
  "pos": "noun",
  "senses": [
    {
      "categories": [
        "Pages with 5 entries",
        "Pages with entries",
        "Swahili class IX nouns",
        "Swahili entries with incorrect language header",
        "Swahili lemmas",
        "Swahili nouns",
        "Swahili terms borrowed from Arabic",
        "Swahili terms derived from Arabic",
        "sw:Fowls"
      ],
      "glosses": [
        "peacock"
      ],
      "links": [
        [
          "peacock",
          "peacock"
        ]
      ]
    }
  ],
  "sounds": [
    {
      "audio": "Sw-ke-tausi.flac",
      "mp3_url": "https://upload.wikimedia.org/wikipedia/commons/transcoded/7/74/Sw-ke-tausi.flac/Sw-ke-tausi.flac.mp3",
      "ogg_url": "https://upload.wikimedia.org/wikipedia/commons/transcoded/7/74/Sw-ke-tausi.flac/Sw-ke-tausi.flac.ogg"
    }
  ],
  "word": "tausi"
}

Download raw JSONL data for tausi meaning in All languages combined (6.5kB)


This page is a part of the kaikki.org machine-readable All languages combined dictionary. This dictionary is based on structured data extracted on 2025-04-10 from the enwiktionary dump dated 2025-04-03 using wiktextract (74c5344 and fb63907). The data shown on this site has been post-processed and various details (e.g., extra categories) removed, some information disambiguated, and additional data merged from other sources. See the raw data download page for the unprocessed wiktextract data.

If you use this data in academic research, please cite Tatu Ylonen: Wiktextract: Wiktionary as Machine-Readable Structured Data, Proceedings of the 13th Conference on Language Resources and Evaluation (LREC), pp. 1317-1325, Marseille, 20-25 June 2022. Linking to the relevant page(s) under https://kaikki.org would also be greatly appreciated.