"tarsier" meaning in English

See tarsier in All languages combined, or Wiktionary

Noun

IPA: /ˈtɑ(ɹ)si.ə(ɹ)/, /ˈtaɹʃi.əɹ/ Audio: LL-Q1860 (eng)-Vealhurl-tarsier.wav [Southern-England] Forms: tarsiers [plural]
Etymology: From French tarsier, from Latin tarsus, from Ancient Greek ταρσός (tarsós, “wickerwork mat"; "broad, flat surface"”). Etymology templates: {{der|en|fr|tarsier}} French tarsier, {{der|en|la|tarsus|}} Latin tarsus, {{der|en|grc|ταρσός||wickerwork mat"; "broad, flat surface"}} Ancient Greek ταρσός (tarsós, “wickerwork mat"; "broad, flat surface"”) Head templates: {{en-noun}} tarsier (plural tarsiers)
  1. An insectivorous primate of the family Tarsiidae, having very large eyes and long feet, native mainly to several islands of Southeast Asia. Wikipedia link: tarsier Categories (lifeform): Prosimians Derived forms: Horsfield's tarsier Related terms: Tarsius Translations (insectivorous primate): تَارْسِير (tarsīr) [masculine] (Arabic), даўгапя́т (daŭhapját) [masculine] (Belarusian), дългопе́т (dǎlgopét) [masculine] (Bulgarian), tarser (Catalan), 眼鏡猴 (Chinese Mandarin), 眼镜猴 (yǎnjìnghóu) (Chinese Mandarin), nártoun (Czech), spøgelsesabe (Danish), spookdiertje [neuter] (Dutch), kummituseläin (Finnish), tarsier [masculine] (French), Koboldmaki [masculine] (German), קוֹפִיף (Hebrew), koboldmaki (Hungarian), tarsio (Italian), メガネザル (meganezaru) (Japanese), 眼鏡猿 (meganezaru) (alt: めがねざる) (Japanese), 안경원숭이 (an'gyeong'wonsung'i) (Korean), ilgakulnis [masculine] (Lithuanian), kera hantu (Malay), mágí binááʼtsohígíí (Navajo), tarsyi (Norman), spøkelsesaper (Norwegian), tarsièr (Occitan), wyrak [masculine] (Polish), tarsjusz [masculine] (Polish), társio [masculine] (Portuguese), долгопя́т (dolgopját) [masculine] (Russian), аветњаци [Cyrillic] (Serbo-Croatian), тарзијери [Cyrillic] (Serbo-Croatian), avetnjaci [Roman] (Serbo-Croatian), tarzijeri [Roman] (Serbo-Croatian), tarsero [masculine] (Spanish), tarsio [masculine] (Spanish), spökdjur (Swedish), vristdjur (Swedish), malmag (Tagalog), mamag (Tagalog), ทาร์เซียร์ (taa-siia) (Thai), довгоп'я́т (dovhopʺját) [masculine] (Ukrainian), phủ hầu (Vietnamese), magô (Waray-Waray)

Inflected forms

Download JSON data for tarsier meaning in English (7.4kB)

{
  "etymology_templates": [
    {
      "args": {
        "1": "en",
        "2": "fr",
        "3": "tarsier"
      },
      "expansion": "French tarsier",
      "name": "der"
    },
    {
      "args": {
        "1": "en",
        "2": "la",
        "3": "tarsus",
        "4": ""
      },
      "expansion": "Latin tarsus",
      "name": "der"
    },
    {
      "args": {
        "1": "en",
        "2": "grc",
        "3": "ταρσός",
        "4": "",
        "5": "wickerwork mat\"; \"broad, flat surface\""
      },
      "expansion": "Ancient Greek ταρσός (tarsós, “wickerwork mat\"; \"broad, flat surface\"”)",
      "name": "der"
    }
  ],
  "etymology_text": "From French tarsier, from Latin tarsus, from Ancient Greek ταρσός (tarsós, “wickerwork mat\"; \"broad, flat surface\"”).",
  "forms": [
    {
      "form": "tarsiers",
      "tags": [
        "plural"
      ]
    }
  ],
  "head_templates": [
    {
      "args": {},
      "expansion": "tarsier (plural tarsiers)",
      "name": "en-noun"
    }
  ],
  "lang": "English",
  "lang_code": "en",
  "pos": "noun",
  "senses": [
    {
      "categories": [
        {
          "kind": "other",
          "name": "Arabic terms with non-redundant manual transliterations",
          "parents": [
            "Terms with non-redundant manual transliterations",
            "Entry maintenance"
          ],
          "source": "w"
        },
        {
          "kind": "other",
          "name": "English entries with incorrect language header",
          "parents": [
            "Entries with incorrect language header",
            "Entry maintenance"
          ],
          "source": "w"
        },
        {
          "kind": "other",
          "name": "English entries with topic categories using raw markup",
          "parents": [
            "Entries with topic categories using raw markup",
            "Entry maintenance"
          ],
          "source": "w"
        },
        {
          "kind": "other",
          "name": "Mandarin terms with redundant transliterations",
          "parents": [
            "Terms with redundant transliterations",
            "Entry maintenance"
          ],
          "source": "w"
        },
        {
          "kind": "other",
          "name": "Serbo-Croatian terms with redundant script codes",
          "parents": [
            "Terms with redundant script codes",
            "Entry maintenance"
          ],
          "source": "w"
        },
        {
          "kind": "lifeform",
          "langcode": "en",
          "name": "Prosimians",
          "orig": "en:Prosimians",
          "parents": [
            "Primates",
            "Mammals",
            "Vertebrates",
            "Chordates",
            "Animals",
            "Lifeforms",
            "All topics",
            "Life",
            "Fundamental",
            "Nature"
          ],
          "source": "w"
        }
      ],
      "derived": [
        {
          "word": "Horsfield's tarsier"
        }
      ],
      "glosses": [
        "An insectivorous primate of the family Tarsiidae, having very large eyes and long feet, native mainly to several islands of Southeast Asia."
      ],
      "id": "en-tarsier-en-noun-OViNpNUC",
      "links": [
        [
          "insectivorous",
          "insectivorous"
        ],
        [
          "primate",
          "primate"
        ],
        [
          "Tarsiidae",
          "Tarsiidae#Translingual"
        ],
        [
          "Southeast Asia",
          "Southeast Asia"
        ]
      ],
      "related": [
        {
          "word": "Tarsius"
        }
      ],
      "translations": [
        {
          "code": "ar",
          "lang": "Arabic",
          "roman": "tarsīr",
          "sense": "insectivorous primate",
          "tags": [
            "masculine"
          ],
          "word": "تَارْسِير"
        },
        {
          "code": "be",
          "lang": "Belarusian",
          "roman": "daŭhapját",
          "sense": "insectivorous primate",
          "tags": [
            "masculine"
          ],
          "word": "даўгапя́т"
        },
        {
          "code": "bg",
          "lang": "Bulgarian",
          "roman": "dǎlgopét",
          "sense": "insectivorous primate",
          "tags": [
            "masculine"
          ],
          "word": "дългопе́т"
        },
        {
          "code": "ca",
          "lang": "Catalan",
          "sense": "insectivorous primate",
          "word": "tarser"
        },
        {
          "code": "cmn",
          "lang": "Chinese Mandarin",
          "sense": "insectivorous primate",
          "word": "眼鏡猴"
        },
        {
          "code": "cmn",
          "lang": "Chinese Mandarin",
          "roman": "yǎnjìnghóu",
          "sense": "insectivorous primate",
          "word": "眼镜猴"
        },
        {
          "code": "cs",
          "lang": "Czech",
          "sense": "insectivorous primate",
          "word": "nártoun"
        },
        {
          "code": "da",
          "lang": "Danish",
          "sense": "insectivorous primate",
          "word": "spøgelsesabe"
        },
        {
          "code": "nl",
          "lang": "Dutch",
          "sense": "insectivorous primate",
          "tags": [
            "neuter"
          ],
          "word": "spookdiertje"
        },
        {
          "code": "fi",
          "lang": "Finnish",
          "sense": "insectivorous primate",
          "word": "kummituseläin"
        },
        {
          "code": "fr",
          "lang": "French",
          "sense": "insectivorous primate",
          "tags": [
            "masculine"
          ],
          "word": "tarsier"
        },
        {
          "code": "de",
          "lang": "German",
          "sense": "insectivorous primate",
          "tags": [
            "masculine"
          ],
          "word": "Koboldmaki"
        },
        {
          "code": "he",
          "lang": "Hebrew",
          "sense": "insectivorous primate",
          "word": "קוֹפִיף"
        },
        {
          "code": "hu",
          "lang": "Hungarian",
          "sense": "insectivorous primate",
          "word": "koboldmaki"
        },
        {
          "code": "it",
          "lang": "Italian",
          "sense": "insectivorous primate",
          "word": "tarsio"
        },
        {
          "code": "ja",
          "lang": "Japanese",
          "roman": "meganezaru",
          "sense": "insectivorous primate",
          "word": "メガネザル"
        },
        {
          "alt": "めがねざる",
          "code": "ja",
          "lang": "Japanese",
          "roman": "meganezaru",
          "sense": "insectivorous primate",
          "word": "眼鏡猿"
        },
        {
          "code": "ko",
          "lang": "Korean",
          "roman": "an'gyeong'wonsung'i",
          "sense": "insectivorous primate",
          "word": "안경원숭이"
        },
        {
          "code": "lt",
          "lang": "Lithuanian",
          "sense": "insectivorous primate",
          "tags": [
            "masculine"
          ],
          "word": "ilgakulnis"
        },
        {
          "code": "ms",
          "lang": "Malay",
          "sense": "insectivorous primate",
          "word": "kera hantu"
        },
        {
          "code": "nv",
          "lang": "Navajo",
          "sense": "insectivorous primate",
          "word": "mágí binááʼtsohígíí"
        },
        {
          "code": "nrf",
          "lang": "Norman",
          "sense": "insectivorous primate",
          "word": "tarsyi"
        },
        {
          "code": "no",
          "lang": "Norwegian",
          "sense": "insectivorous primate",
          "word": "spøkelsesaper"
        },
        {
          "code": "oc",
          "lang": "Occitan",
          "sense": "insectivorous primate",
          "word": "tarsièr"
        },
        {
          "code": "pl",
          "lang": "Polish",
          "sense": "insectivorous primate",
          "tags": [
            "masculine"
          ],
          "word": "wyrak"
        },
        {
          "code": "pl",
          "lang": "Polish",
          "sense": "insectivorous primate",
          "tags": [
            "masculine"
          ],
          "word": "tarsjusz"
        },
        {
          "code": "pt",
          "lang": "Portuguese",
          "sense": "insectivorous primate",
          "tags": [
            "masculine"
          ],
          "word": "társio"
        },
        {
          "code": "ru",
          "lang": "Russian",
          "roman": "dolgopját",
          "sense": "insectivorous primate",
          "tags": [
            "masculine"
          ],
          "word": "долгопя́т"
        },
        {
          "code": "sh",
          "lang": "Serbo-Croatian",
          "sense": "insectivorous primate",
          "tags": [
            "Cyrillic"
          ],
          "word": "аветњаци"
        },
        {
          "code": "sh",
          "lang": "Serbo-Croatian",
          "sense": "insectivorous primate",
          "tags": [
            "Cyrillic"
          ],
          "word": "тарзијери"
        },
        {
          "code": "sh",
          "lang": "Serbo-Croatian",
          "sense": "insectivorous primate",
          "tags": [
            "Roman"
          ],
          "word": "avetnjaci"
        },
        {
          "code": "sh",
          "lang": "Serbo-Croatian",
          "sense": "insectivorous primate",
          "tags": [
            "Roman"
          ],
          "word": "tarzijeri"
        },
        {
          "code": "es",
          "lang": "Spanish",
          "sense": "insectivorous primate",
          "tags": [
            "masculine"
          ],
          "word": "tarsero"
        },
        {
          "code": "es",
          "lang": "Spanish",
          "sense": "insectivorous primate",
          "tags": [
            "masculine"
          ],
          "word": "tarsio"
        },
        {
          "code": "sv",
          "lang": "Swedish",
          "sense": "insectivorous primate",
          "word": "spökdjur"
        },
        {
          "code": "sv",
          "lang": "Swedish",
          "sense": "insectivorous primate",
          "word": "vristdjur"
        },
        {
          "code": "tl",
          "lang": "Tagalog",
          "sense": "insectivorous primate",
          "word": "malmag"
        },
        {
          "code": "tl",
          "lang": "Tagalog",
          "sense": "insectivorous primate",
          "word": "mamag"
        },
        {
          "code": "th",
          "lang": "Thai",
          "roman": "taa-siia",
          "sense": "insectivorous primate",
          "word": "ทาร์เซียร์"
        },
        {
          "code": "uk",
          "lang": "Ukrainian",
          "roman": "dovhopʺját",
          "sense": "insectivorous primate",
          "tags": [
            "masculine"
          ],
          "word": "довгоп'я́т"
        },
        {
          "code": "vi",
          "lang": "Vietnamese",
          "sense": "insectivorous primate",
          "word": "phủ hầu"
        },
        {
          "code": "war",
          "lang": "Waray-Waray",
          "sense": "insectivorous primate",
          "word": "magô"
        }
      ],
      "wikipedia": [
        "tarsier"
      ]
    }
  ],
  "sounds": [
    {
      "ipa": "/ˈtɑ(ɹ)si.ə(ɹ)/"
    },
    {
      "ipa": "/ˈtaɹʃi.əɹ/"
    },
    {
      "audio": "LL-Q1860 (eng)-Vealhurl-tarsier.wav",
      "mp3_url": "https://upload.wikimedia.org/wikipedia/commons/transcoded/5/57/LL-Q1860_%28eng%29-Vealhurl-tarsier.wav/LL-Q1860_%28eng%29-Vealhurl-tarsier.wav.mp3",
      "ogg_url": "https://upload.wikimedia.org/wikipedia/commons/transcoded/5/57/LL-Q1860_%28eng%29-Vealhurl-tarsier.wav/LL-Q1860_%28eng%29-Vealhurl-tarsier.wav.ogg",
      "tags": [
        "Southern-England"
      ],
      "text": "Audio (Southern England)"
    }
  ],
  "word": "tarsier"
}
{
  "derived": [
    {
      "word": "Horsfield's tarsier"
    }
  ],
  "etymology_templates": [
    {
      "args": {
        "1": "en",
        "2": "fr",
        "3": "tarsier"
      },
      "expansion": "French tarsier",
      "name": "der"
    },
    {
      "args": {
        "1": "en",
        "2": "la",
        "3": "tarsus",
        "4": ""
      },
      "expansion": "Latin tarsus",
      "name": "der"
    },
    {
      "args": {
        "1": "en",
        "2": "grc",
        "3": "ταρσός",
        "4": "",
        "5": "wickerwork mat\"; \"broad, flat surface\""
      },
      "expansion": "Ancient Greek ταρσός (tarsós, “wickerwork mat\"; \"broad, flat surface\"”)",
      "name": "der"
    }
  ],
  "etymology_text": "From French tarsier, from Latin tarsus, from Ancient Greek ταρσός (tarsós, “wickerwork mat\"; \"broad, flat surface\"”).",
  "forms": [
    {
      "form": "tarsiers",
      "tags": [
        "plural"
      ]
    }
  ],
  "head_templates": [
    {
      "args": {},
      "expansion": "tarsier (plural tarsiers)",
      "name": "en-noun"
    }
  ],
  "lang": "English",
  "lang_code": "en",
  "pos": "noun",
  "related": [
    {
      "word": "Tarsius"
    }
  ],
  "senses": [
    {
      "categories": [
        "Arabic terms with non-redundant manual transliterations",
        "English 3-syllable words",
        "English countable nouns",
        "English entries with incorrect language header",
        "English entries with topic categories using raw markup",
        "English lemmas",
        "English nouns",
        "English terms derived from Ancient Greek",
        "English terms derived from French",
        "English terms derived from Latin",
        "English terms with IPA pronunciation",
        "English terms with audio links",
        "Mandarin terms with redundant transliterations",
        "Serbo-Croatian terms with redundant script codes",
        "en:Prosimians"
      ],
      "glosses": [
        "An insectivorous primate of the family Tarsiidae, having very large eyes and long feet, native mainly to several islands of Southeast Asia."
      ],
      "links": [
        [
          "insectivorous",
          "insectivorous"
        ],
        [
          "primate",
          "primate"
        ],
        [
          "Tarsiidae",
          "Tarsiidae#Translingual"
        ],
        [
          "Southeast Asia",
          "Southeast Asia"
        ]
      ],
      "wikipedia": [
        "tarsier"
      ]
    }
  ],
  "sounds": [
    {
      "ipa": "/ˈtɑ(ɹ)si.ə(ɹ)/"
    },
    {
      "ipa": "/ˈtaɹʃi.əɹ/"
    },
    {
      "audio": "LL-Q1860 (eng)-Vealhurl-tarsier.wav",
      "mp3_url": "https://upload.wikimedia.org/wikipedia/commons/transcoded/5/57/LL-Q1860_%28eng%29-Vealhurl-tarsier.wav/LL-Q1860_%28eng%29-Vealhurl-tarsier.wav.mp3",
      "ogg_url": "https://upload.wikimedia.org/wikipedia/commons/transcoded/5/57/LL-Q1860_%28eng%29-Vealhurl-tarsier.wav/LL-Q1860_%28eng%29-Vealhurl-tarsier.wav.ogg",
      "tags": [
        "Southern-England"
      ],
      "text": "Audio (Southern England)"
    }
  ],
  "translations": [
    {
      "code": "ar",
      "lang": "Arabic",
      "roman": "tarsīr",
      "sense": "insectivorous primate",
      "tags": [
        "masculine"
      ],
      "word": "تَارْسِير"
    },
    {
      "code": "be",
      "lang": "Belarusian",
      "roman": "daŭhapját",
      "sense": "insectivorous primate",
      "tags": [
        "masculine"
      ],
      "word": "даўгапя́т"
    },
    {
      "code": "bg",
      "lang": "Bulgarian",
      "roman": "dǎlgopét",
      "sense": "insectivorous primate",
      "tags": [
        "masculine"
      ],
      "word": "дългопе́т"
    },
    {
      "code": "ca",
      "lang": "Catalan",
      "sense": "insectivorous primate",
      "word": "tarser"
    },
    {
      "code": "cmn",
      "lang": "Chinese Mandarin",
      "sense": "insectivorous primate",
      "word": "眼鏡猴"
    },
    {
      "code": "cmn",
      "lang": "Chinese Mandarin",
      "roman": "yǎnjìnghóu",
      "sense": "insectivorous primate",
      "word": "眼镜猴"
    },
    {
      "code": "cs",
      "lang": "Czech",
      "sense": "insectivorous primate",
      "word": "nártoun"
    },
    {
      "code": "da",
      "lang": "Danish",
      "sense": "insectivorous primate",
      "word": "spøgelsesabe"
    },
    {
      "code": "nl",
      "lang": "Dutch",
      "sense": "insectivorous primate",
      "tags": [
        "neuter"
      ],
      "word": "spookdiertje"
    },
    {
      "code": "fi",
      "lang": "Finnish",
      "sense": "insectivorous primate",
      "word": "kummituseläin"
    },
    {
      "code": "fr",
      "lang": "French",
      "sense": "insectivorous primate",
      "tags": [
        "masculine"
      ],
      "word": "tarsier"
    },
    {
      "code": "de",
      "lang": "German",
      "sense": "insectivorous primate",
      "tags": [
        "masculine"
      ],
      "word": "Koboldmaki"
    },
    {
      "code": "he",
      "lang": "Hebrew",
      "sense": "insectivorous primate",
      "word": "קוֹפִיף"
    },
    {
      "code": "hu",
      "lang": "Hungarian",
      "sense": "insectivorous primate",
      "word": "koboldmaki"
    },
    {
      "code": "it",
      "lang": "Italian",
      "sense": "insectivorous primate",
      "word": "tarsio"
    },
    {
      "code": "ja",
      "lang": "Japanese",
      "roman": "meganezaru",
      "sense": "insectivorous primate",
      "word": "メガネザル"
    },
    {
      "alt": "めがねざる",
      "code": "ja",
      "lang": "Japanese",
      "roman": "meganezaru",
      "sense": "insectivorous primate",
      "word": "眼鏡猿"
    },
    {
      "code": "ko",
      "lang": "Korean",
      "roman": "an'gyeong'wonsung'i",
      "sense": "insectivorous primate",
      "word": "안경원숭이"
    },
    {
      "code": "lt",
      "lang": "Lithuanian",
      "sense": "insectivorous primate",
      "tags": [
        "masculine"
      ],
      "word": "ilgakulnis"
    },
    {
      "code": "ms",
      "lang": "Malay",
      "sense": "insectivorous primate",
      "word": "kera hantu"
    },
    {
      "code": "nv",
      "lang": "Navajo",
      "sense": "insectivorous primate",
      "word": "mágí binááʼtsohígíí"
    },
    {
      "code": "nrf",
      "lang": "Norman",
      "sense": "insectivorous primate",
      "word": "tarsyi"
    },
    {
      "code": "no",
      "lang": "Norwegian",
      "sense": "insectivorous primate",
      "word": "spøkelsesaper"
    },
    {
      "code": "oc",
      "lang": "Occitan",
      "sense": "insectivorous primate",
      "word": "tarsièr"
    },
    {
      "code": "pl",
      "lang": "Polish",
      "sense": "insectivorous primate",
      "tags": [
        "masculine"
      ],
      "word": "wyrak"
    },
    {
      "code": "pl",
      "lang": "Polish",
      "sense": "insectivorous primate",
      "tags": [
        "masculine"
      ],
      "word": "tarsjusz"
    },
    {
      "code": "pt",
      "lang": "Portuguese",
      "sense": "insectivorous primate",
      "tags": [
        "masculine"
      ],
      "word": "társio"
    },
    {
      "code": "ru",
      "lang": "Russian",
      "roman": "dolgopját",
      "sense": "insectivorous primate",
      "tags": [
        "masculine"
      ],
      "word": "долгопя́т"
    },
    {
      "code": "sh",
      "lang": "Serbo-Croatian",
      "sense": "insectivorous primate",
      "tags": [
        "Cyrillic"
      ],
      "word": "аветњаци"
    },
    {
      "code": "sh",
      "lang": "Serbo-Croatian",
      "sense": "insectivorous primate",
      "tags": [
        "Cyrillic"
      ],
      "word": "тарзијери"
    },
    {
      "code": "sh",
      "lang": "Serbo-Croatian",
      "sense": "insectivorous primate",
      "tags": [
        "Roman"
      ],
      "word": "avetnjaci"
    },
    {
      "code": "sh",
      "lang": "Serbo-Croatian",
      "sense": "insectivorous primate",
      "tags": [
        "Roman"
      ],
      "word": "tarzijeri"
    },
    {
      "code": "es",
      "lang": "Spanish",
      "sense": "insectivorous primate",
      "tags": [
        "masculine"
      ],
      "word": "tarsero"
    },
    {
      "code": "es",
      "lang": "Spanish",
      "sense": "insectivorous primate",
      "tags": [
        "masculine"
      ],
      "word": "tarsio"
    },
    {
      "code": "sv",
      "lang": "Swedish",
      "sense": "insectivorous primate",
      "word": "spökdjur"
    },
    {
      "code": "sv",
      "lang": "Swedish",
      "sense": "insectivorous primate",
      "word": "vristdjur"
    },
    {
      "code": "tl",
      "lang": "Tagalog",
      "sense": "insectivorous primate",
      "word": "malmag"
    },
    {
      "code": "tl",
      "lang": "Tagalog",
      "sense": "insectivorous primate",
      "word": "mamag"
    },
    {
      "code": "th",
      "lang": "Thai",
      "roman": "taa-siia",
      "sense": "insectivorous primate",
      "word": "ทาร์เซียร์"
    },
    {
      "code": "uk",
      "lang": "Ukrainian",
      "roman": "dovhopʺját",
      "sense": "insectivorous primate",
      "tags": [
        "masculine"
      ],
      "word": "довгоп'я́т"
    },
    {
      "code": "vi",
      "lang": "Vietnamese",
      "sense": "insectivorous primate",
      "word": "phủ hầu"
    },
    {
      "code": "war",
      "lang": "Waray-Waray",
      "sense": "insectivorous primate",
      "word": "magô"
    }
  ],
  "word": "tarsier"
}

This page is a part of the kaikki.org machine-readable English dictionary. This dictionary is based on structured data extracted on 2024-05-09 from the enwiktionary dump dated 2024-05-02 using wiktextract (4d5d0bb and edd475d). The data shown on this site has been post-processed and various details (e.g., extra categories) removed, some information disambiguated, and additional data merged from other sources. See the raw data download page for the unprocessed wiktextract data.

If you use this data in academic research, please cite Tatu Ylonen: Wiktextract: Wiktionary as Machine-Readable Structured Data, Proceedings of the 13th Conference on Language Resources and Evaluation (LREC), pp. 1317-1325, Marseille, 20-25 June 2022. Linking to the relevant page(s) under https://kaikki.org would also be greatly appreciated.