"thesaurus" meaning in All languages combined

See thesaurus on Wiktionary

Noun [Englisch]

IPA: θɪˈsɔːrəs Audio: LL-Q1860 (eng)-Vealhurl-thesaurus.wav
  1. Zusammenstellungen von Wörtern mit ähnlichen Bedeutungen als Buch oder in elektronischer Form
    Sense id: de-thesaurus-en-noun-jvNC9EJM
The following are not (yet) sense-disambiguated
Translations (Zusammenstellungen von Wörtern mit ähnlichen Bedeutungen): Synonymwörterbuch (Deutsch)

Noun [Französisch]

IPA: tezoʁys Audio: LL-Q150 (fra)-WikiLucas00-thesaurus.wav
  1. Thesaurus
    Sense id: de-thesaurus-fr-noun-Oe58OENd
The following are not (yet) sense-disambiguated
Translations (Thesaurus): Thesaurus [masculine] (Deutsch), dizionario dei sinonimi [masculine] (Italienisch), tezaurus (Polnisch), thesaurus (Polnisch), tesauro (Spanisch)

Noun [Latein]

Forms: thēsaurus [nominative], thēsaurī [nominative, singular], thēsaurī [genitive], thēsaurōrum [genitive, singular], thēsaurō [dative], thēsaurīs [dative, singular], thēsaurum [accusative], thēsaurōs [accusative, singular], thēsaure, thēsaurī [singular], thēsaurō [ablative], thēsaurīs [ablative, singular]
  1. Vorrat, Schatz Tags: Classical Latin
    Sense id: de-thesaurus-la-noun-teiM3R1b Categories (other): Klassisches Latein
  2. Schatzkammer Tags: Classical Latin
    Sense id: de-thesaurus-la-noun-C0048qTN Categories (other): Klassisches Latein
  3. Fundgrube, Repertoire, Magazin Tags: Classical Latin, figurative
    Sense id: de-thesaurus-la-noun-4j5Tvcx7 Categories (other): Klassisches Latein
  4. Bargeld Tags: Medieval Latin, figurative
    Sense id: de-thesaurus-la-noun-wkMJ-RcD Categories (other): Mittellatein
  5. Hoheitsrecht auf einen besitzerlosen Fund Tags: Medieval Latin, figurative
    Sense id: de-thesaurus-la-noun-OEsbspH- Categories (other): Mittellatein
The following are not (yet) sense-disambiguated
Translations (siehe Deutsch): Vorrat [masculine] (Deutsch), Schatz [masculine] (Deutsch), Schatzkammer [feminine] (Deutsch), Fundgrube [feminine] (Deutsch), Repertoire [neuter] (Deutsch), Magazin [neuter] (Deutsch), Bargeld [neuter] (Deutsch)

Inflected forms

{
  "categories": [
    {
      "kind": "other",
      "name": "Anagramm sortiert (Englisch)",
      "parents": [],
      "source": "w"
    },
    {
      "kind": "other",
      "name": "Englisch",
      "parents": [],
      "source": "w"
    },
    {
      "kind": "other",
      "name": "Roter Audiolink",
      "parents": [],
      "source": "w"
    },
    {
      "kind": "other",
      "name": "Rückläufige Wörterliste (Englisch)",
      "parents": [],
      "source": "w"
    },
    {
      "kind": "other",
      "name": "Substantiv (Englisch)",
      "parents": [],
      "source": "w"
    },
    {
      "kind": "other",
      "name": "Substantiv zwei Pluralformen (Englisch)",
      "parents": [],
      "source": "w"
    },
    {
      "kind": "other",
      "name": "Wiktionary:Audio-Datei",
      "parents": [],
      "source": "w"
    },
    {
      "kind": "other",
      "name": "Siehe auch",
      "orig": "siehe auch",
      "parents": [],
      "source": "w"
    }
  ],
  "hyphenation": "the‧sau‧rus",
  "lang": "Englisch",
  "lang_code": "en",
  "pos": "noun",
  "pos_title": "Substantiv",
  "senses": [
    {
      "glosses": [
        "Zusammenstellungen von Wörtern mit ähnlichen Bedeutungen als Buch oder in elektronischer Form"
      ],
      "id": "de-thesaurus-en-noun-jvNC9EJM",
      "sense_index": "1"
    }
  ],
  "sounds": [
    {
      "ipa": "θɪˈsɔːrəs"
    },
    {
      "audio": "LL-Q1860 (eng)-Vealhurl-thesaurus.wav",
      "mp3_url": "https://upload.wikimedia.org/wikipedia/commons/transcoded/7/75/LL-Q1860_(eng)-Vealhurl-thesaurus.wav/LL-Q1860_(eng)-Vealhurl-thesaurus.wav.mp3",
      "ogg_url": "https://upload.wikimedia.org/wikipedia/commons/transcoded/7/75/LL-Q1860_(eng)-Vealhurl-thesaurus.wav/LL-Q1860_(eng)-Vealhurl-thesaurus.wav.ogg",
      "raw_tags": [
        "britisch"
      ],
      "wav_url": "https://commons.wikimedia.org/wiki/Special:FilePath/LL-Q1860 (eng)-Vealhurl-thesaurus.wav"
    }
  ],
  "translations": [
    {
      "lang": "Deutsch",
      "lang_code": "de",
      "sense": "Zusammenstellungen von Wörtern mit ähnlichen Bedeutungen",
      "sense_index": "1",
      "word": "Synonymwörterbuch"
    }
  ],
  "word": "thesaurus"
}

{
  "categories": [
    {
      "kind": "other",
      "name": "Anagramm sortiert (Französisch)",
      "parents": [],
      "source": "w"
    },
    {
      "kind": "other",
      "name": "Französisch",
      "parents": [],
      "source": "w"
    },
    {
      "kind": "other",
      "name": "Roter Audiolink",
      "parents": [],
      "source": "w"
    },
    {
      "kind": "other",
      "name": "Rückläufige Wörterliste (Französisch)",
      "parents": [],
      "source": "w"
    },
    {
      "kind": "other",
      "name": "Substantiv (Französisch)",
      "parents": [],
      "source": "w"
    },
    {
      "kind": "other",
      "name": "Substantiv m (Französisch)",
      "parents": [],
      "source": "w"
    },
    {
      "kind": "other",
      "name": "Wiktionary:Audio-Datei",
      "parents": [],
      "source": "w"
    },
    {
      "kind": "other",
      "name": "Siehe auch",
      "orig": "siehe auch",
      "parents": [],
      "source": "w"
    }
  ],
  "hyphenation": "the·sau·rus",
  "lang": "Französisch",
  "lang_code": "fr",
  "pos": "noun",
  "pos_title": "Substantiv",
  "senses": [
    {
      "glosses": [
        "Thesaurus"
      ],
      "id": "de-thesaurus-fr-noun-Oe58OENd",
      "sense_index": "1"
    }
  ],
  "sounds": [
    {
      "ipa": "tezoʁys"
    },
    {
      "audio": "LL-Q150 (fra)-WikiLucas00-thesaurus.wav",
      "mp3_url": "https://upload.wikimedia.org/wikipedia/commons/transcoded/b/b2/LL-Q150_(fra)-WikiLucas00-thesaurus.wav/LL-Q150_(fra)-WikiLucas00-thesaurus.wav.mp3",
      "ogg_url": "https://upload.wikimedia.org/wikipedia/commons/transcoded/b/b2/LL-Q150_(fra)-WikiLucas00-thesaurus.wav/LL-Q150_(fra)-WikiLucas00-thesaurus.wav.ogg",
      "wav_url": "https://commons.wikimedia.org/wiki/Special:FilePath/LL-Q150 (fra)-WikiLucas00-thesaurus.wav"
    }
  ],
  "tags": [
    "masculine"
  ],
  "translations": [
    {
      "lang": "Deutsch",
      "lang_code": "de",
      "sense": "Thesaurus",
      "sense_index": "1",
      "tags": [
        "masculine"
      ],
      "word": "Thesaurus"
    },
    {
      "lang": "Italienisch",
      "lang_code": "it",
      "sense": "Thesaurus",
      "sense_index": "1",
      "tags": [
        "masculine"
      ],
      "word": "dizionario dei sinonimi"
    },
    {
      "lang": "Polnisch",
      "lang_code": "pl",
      "sense": "Thesaurus",
      "sense_index": "1",
      "word": "tezaurus"
    },
    {
      "lang": "Polnisch",
      "lang_code": "pl",
      "sense": "Thesaurus",
      "sense_index": "1",
      "word": "thesaurus"
    },
    {
      "lang": "Spanisch",
      "lang_code": "es",
      "sense": "Thesaurus",
      "sense_index": "1",
      "word": "tesauro"
    }
  ],
  "word": "thesaurus"
}

{
  "categories": [
    {
      "kind": "other",
      "name": "Anagramm sortiert (Latein)",
      "parents": [],
      "source": "w"
    },
    {
      "kind": "other",
      "name": "Latein",
      "parents": [],
      "source": "w"
    },
    {
      "kind": "other",
      "name": "Rückläufige Wörterliste (Latein)",
      "parents": [],
      "source": "w"
    },
    {
      "kind": "other",
      "name": "Substantiv (Latein)",
      "parents": [],
      "source": "w"
    },
    {
      "kind": "other",
      "name": "Substantiv 2. Deklination (Latein)",
      "parents": [],
      "source": "w"
    },
    {
      "kind": "other",
      "name": "Siehe auch",
      "orig": "siehe auch",
      "parents": [],
      "source": "w"
    },
    {
      "kind": "other",
      "name": "Übersetzungen (Altgriechisch)",
      "parents": [],
      "source": "w"
    }
  ],
  "etymology_texts": [
    "von altgriechisch θησαυρός (thēsauros^☆) ^(→ grc) (der „Schatz“, die „Schatzkammer“)"
  ],
  "forms": [
    {
      "form": "thēsaurus",
      "raw_tags": [
        "Kasus"
      ],
      "tags": [
        "nominative"
      ]
    },
    {
      "form": "thēsaurī",
      "tags": [
        "nominative",
        "singular"
      ]
    },
    {
      "form": "thēsaurī",
      "raw_tags": [
        "Kasus"
      ],
      "tags": [
        "genitive"
      ]
    },
    {
      "form": "thēsaurōrum",
      "tags": [
        "genitive",
        "singular"
      ]
    },
    {
      "form": "thēsaurō",
      "raw_tags": [
        "Kasus"
      ],
      "tags": [
        "dative"
      ]
    },
    {
      "form": "thēsaurīs",
      "tags": [
        "dative",
        "singular"
      ]
    },
    {
      "form": "thēsaurum",
      "raw_tags": [
        "Kasus"
      ],
      "tags": [
        "accusative"
      ]
    },
    {
      "form": "thēsaurōs",
      "tags": [
        "accusative",
        "singular"
      ]
    },
    {
      "form": "thēsaure",
      "raw_tags": [
        "Vokativ",
        "Kasus"
      ]
    },
    {
      "form": "thēsaurī",
      "raw_tags": [
        "Vokativ"
      ],
      "tags": [
        "singular"
      ]
    },
    {
      "form": "thēsaurō",
      "raw_tags": [
        "Kasus"
      ],
      "tags": [
        "ablative"
      ]
    },
    {
      "form": "thēsaurīs",
      "tags": [
        "ablative",
        "singular"
      ]
    }
  ],
  "hyphenation": "the·sau·rus",
  "lang": "Latein",
  "lang_code": "la",
  "pos": "noun",
  "pos_title": "Substantiv",
  "senses": [
    {
      "categories": [
        {
          "kind": "other",
          "name": "Klassisches Latein",
          "parents": [],
          "source": "w"
        }
      ],
      "glosses": [
        "Vorrat, Schatz"
      ],
      "id": "de-thesaurus-la-noun-teiM3R1b",
      "sense_index": "1",
      "tags": [
        "Classical Latin"
      ]
    },
    {
      "categories": [
        {
          "kind": "other",
          "name": "Klassisches Latein",
          "parents": [],
          "source": "w"
        }
      ],
      "glosses": [
        "Schatzkammer"
      ],
      "id": "de-thesaurus-la-noun-C0048qTN",
      "raw_tags": [
        "metonymisch"
      ],
      "sense_index": "2",
      "tags": [
        "Classical Latin"
      ]
    },
    {
      "categories": [
        {
          "kind": "other",
          "name": "Klassisches Latein",
          "parents": [],
          "source": "w"
        }
      ],
      "glosses": [
        "Fundgrube, Repertoire, Magazin"
      ],
      "id": "de-thesaurus-la-noun-4j5Tvcx7",
      "sense_index": "3",
      "tags": [
        "Classical Latin",
        "figurative"
      ]
    },
    {
      "categories": [
        {
          "kind": "other",
          "name": "Mittellatein",
          "parents": [],
          "source": "w"
        }
      ],
      "glosses": [
        "Bargeld"
      ],
      "id": "de-thesaurus-la-noun-wkMJ-RcD",
      "sense_index": "4",
      "tags": [
        "Medieval Latin",
        "figurative"
      ]
    },
    {
      "categories": [
        {
          "kind": "other",
          "name": "Mittellatein",
          "parents": [],
          "source": "w"
        }
      ],
      "glosses": [
        "Hoheitsrecht auf einen besitzerlosen Fund"
      ],
      "id": "de-thesaurus-la-noun-OEsbspH-",
      "sense_index": "5",
      "tags": [
        "Medieval Latin",
        "figurative"
      ]
    }
  ],
  "tags": [
    "masculine"
  ],
  "translations": [
    {
      "lang": "Deutsch",
      "lang_code": "de",
      "sense": "siehe Deutsch",
      "sense_index": "1",
      "tags": [
        "masculine"
      ],
      "word": "Vorrat"
    },
    {
      "lang": "Deutsch",
      "lang_code": "de",
      "sense": "siehe Deutsch",
      "sense_index": "1",
      "tags": [
        "masculine"
      ],
      "word": "Schatz"
    },
    {
      "lang": "Deutsch",
      "lang_code": "de",
      "sense": "siehe Deutsch",
      "sense_index": "2",
      "tags": [
        "feminine"
      ],
      "word": "Schatzkammer"
    },
    {
      "lang": "Deutsch",
      "lang_code": "de",
      "sense": "siehe Deutsch",
      "sense_index": "3",
      "tags": [
        "feminine"
      ],
      "word": "Fundgrube"
    },
    {
      "lang": "Deutsch",
      "lang_code": "de",
      "sense": "siehe Deutsch",
      "sense_index": "3",
      "tags": [
        "neuter"
      ],
      "word": "Repertoire"
    },
    {
      "lang": "Deutsch",
      "lang_code": "de",
      "sense": "siehe Deutsch",
      "sense_index": "3",
      "tags": [
        "neuter"
      ],
      "word": "Magazin"
    },
    {
      "lang": "Deutsch",
      "lang_code": "de",
      "sense": "siehe Deutsch",
      "sense_index": "4",
      "tags": [
        "neuter"
      ],
      "word": "Bargeld"
    }
  ],
  "word": "thesaurus"
}
{
  "categories": [
    "Anagramm sortiert (Englisch)",
    "Englisch",
    "Roter Audiolink",
    "Rückläufige Wörterliste (Englisch)",
    "Substantiv (Englisch)",
    "Substantiv zwei Pluralformen (Englisch)",
    "Wiktionary:Audio-Datei",
    "siehe auch"
  ],
  "hyphenation": "the‧sau‧rus",
  "lang": "Englisch",
  "lang_code": "en",
  "pos": "noun",
  "pos_title": "Substantiv",
  "senses": [
    {
      "glosses": [
        "Zusammenstellungen von Wörtern mit ähnlichen Bedeutungen als Buch oder in elektronischer Form"
      ],
      "sense_index": "1"
    }
  ],
  "sounds": [
    {
      "ipa": "θɪˈsɔːrəs"
    },
    {
      "audio": "LL-Q1860 (eng)-Vealhurl-thesaurus.wav",
      "mp3_url": "https://upload.wikimedia.org/wikipedia/commons/transcoded/7/75/LL-Q1860_(eng)-Vealhurl-thesaurus.wav/LL-Q1860_(eng)-Vealhurl-thesaurus.wav.mp3",
      "ogg_url": "https://upload.wikimedia.org/wikipedia/commons/transcoded/7/75/LL-Q1860_(eng)-Vealhurl-thesaurus.wav/LL-Q1860_(eng)-Vealhurl-thesaurus.wav.ogg",
      "raw_tags": [
        "britisch"
      ],
      "wav_url": "https://commons.wikimedia.org/wiki/Special:FilePath/LL-Q1860 (eng)-Vealhurl-thesaurus.wav"
    }
  ],
  "translations": [
    {
      "lang": "Deutsch",
      "lang_code": "de",
      "sense": "Zusammenstellungen von Wörtern mit ähnlichen Bedeutungen",
      "sense_index": "1",
      "word": "Synonymwörterbuch"
    }
  ],
  "word": "thesaurus"
}

{
  "categories": [
    "Anagramm sortiert (Französisch)",
    "Französisch",
    "Roter Audiolink",
    "Rückläufige Wörterliste (Französisch)",
    "Substantiv (Französisch)",
    "Substantiv m (Französisch)",
    "Wiktionary:Audio-Datei",
    "siehe auch"
  ],
  "hyphenation": "the·sau·rus",
  "lang": "Französisch",
  "lang_code": "fr",
  "pos": "noun",
  "pos_title": "Substantiv",
  "senses": [
    {
      "glosses": [
        "Thesaurus"
      ],
      "sense_index": "1"
    }
  ],
  "sounds": [
    {
      "ipa": "tezoʁys"
    },
    {
      "audio": "LL-Q150 (fra)-WikiLucas00-thesaurus.wav",
      "mp3_url": "https://upload.wikimedia.org/wikipedia/commons/transcoded/b/b2/LL-Q150_(fra)-WikiLucas00-thesaurus.wav/LL-Q150_(fra)-WikiLucas00-thesaurus.wav.mp3",
      "ogg_url": "https://upload.wikimedia.org/wikipedia/commons/transcoded/b/b2/LL-Q150_(fra)-WikiLucas00-thesaurus.wav/LL-Q150_(fra)-WikiLucas00-thesaurus.wav.ogg",
      "wav_url": "https://commons.wikimedia.org/wiki/Special:FilePath/LL-Q150 (fra)-WikiLucas00-thesaurus.wav"
    }
  ],
  "tags": [
    "masculine"
  ],
  "translations": [
    {
      "lang": "Deutsch",
      "lang_code": "de",
      "sense": "Thesaurus",
      "sense_index": "1",
      "tags": [
        "masculine"
      ],
      "word": "Thesaurus"
    },
    {
      "lang": "Italienisch",
      "lang_code": "it",
      "sense": "Thesaurus",
      "sense_index": "1",
      "tags": [
        "masculine"
      ],
      "word": "dizionario dei sinonimi"
    },
    {
      "lang": "Polnisch",
      "lang_code": "pl",
      "sense": "Thesaurus",
      "sense_index": "1",
      "word": "tezaurus"
    },
    {
      "lang": "Polnisch",
      "lang_code": "pl",
      "sense": "Thesaurus",
      "sense_index": "1",
      "word": "thesaurus"
    },
    {
      "lang": "Spanisch",
      "lang_code": "es",
      "sense": "Thesaurus",
      "sense_index": "1",
      "word": "tesauro"
    }
  ],
  "word": "thesaurus"
}

{
  "categories": [
    "Anagramm sortiert (Latein)",
    "Latein",
    "Rückläufige Wörterliste (Latein)",
    "Substantiv (Latein)",
    "Substantiv 2. Deklination (Latein)",
    "siehe auch",
    "Übersetzungen (Altgriechisch)"
  ],
  "etymology_texts": [
    "von altgriechisch θησαυρός (thēsauros^☆) ^(→ grc) (der „Schatz“, die „Schatzkammer“)"
  ],
  "forms": [
    {
      "form": "thēsaurus",
      "raw_tags": [
        "Kasus"
      ],
      "tags": [
        "nominative"
      ]
    },
    {
      "form": "thēsaurī",
      "tags": [
        "nominative",
        "singular"
      ]
    },
    {
      "form": "thēsaurī",
      "raw_tags": [
        "Kasus"
      ],
      "tags": [
        "genitive"
      ]
    },
    {
      "form": "thēsaurōrum",
      "tags": [
        "genitive",
        "singular"
      ]
    },
    {
      "form": "thēsaurō",
      "raw_tags": [
        "Kasus"
      ],
      "tags": [
        "dative"
      ]
    },
    {
      "form": "thēsaurīs",
      "tags": [
        "dative",
        "singular"
      ]
    },
    {
      "form": "thēsaurum",
      "raw_tags": [
        "Kasus"
      ],
      "tags": [
        "accusative"
      ]
    },
    {
      "form": "thēsaurōs",
      "tags": [
        "accusative",
        "singular"
      ]
    },
    {
      "form": "thēsaure",
      "raw_tags": [
        "Vokativ",
        "Kasus"
      ]
    },
    {
      "form": "thēsaurī",
      "raw_tags": [
        "Vokativ"
      ],
      "tags": [
        "singular"
      ]
    },
    {
      "form": "thēsaurō",
      "raw_tags": [
        "Kasus"
      ],
      "tags": [
        "ablative"
      ]
    },
    {
      "form": "thēsaurīs",
      "tags": [
        "ablative",
        "singular"
      ]
    }
  ],
  "hyphenation": "the·sau·rus",
  "lang": "Latein",
  "lang_code": "la",
  "pos": "noun",
  "pos_title": "Substantiv",
  "senses": [
    {
      "categories": [
        "Klassisches Latein"
      ],
      "glosses": [
        "Vorrat, Schatz"
      ],
      "sense_index": "1",
      "tags": [
        "Classical Latin"
      ]
    },
    {
      "categories": [
        "Klassisches Latein"
      ],
      "glosses": [
        "Schatzkammer"
      ],
      "raw_tags": [
        "metonymisch"
      ],
      "sense_index": "2",
      "tags": [
        "Classical Latin"
      ]
    },
    {
      "categories": [
        "Klassisches Latein"
      ],
      "glosses": [
        "Fundgrube, Repertoire, Magazin"
      ],
      "sense_index": "3",
      "tags": [
        "Classical Latin",
        "figurative"
      ]
    },
    {
      "categories": [
        "Mittellatein"
      ],
      "glosses": [
        "Bargeld"
      ],
      "sense_index": "4",
      "tags": [
        "Medieval Latin",
        "figurative"
      ]
    },
    {
      "categories": [
        "Mittellatein"
      ],
      "glosses": [
        "Hoheitsrecht auf einen besitzerlosen Fund"
      ],
      "sense_index": "5",
      "tags": [
        "Medieval Latin",
        "figurative"
      ]
    }
  ],
  "tags": [
    "masculine"
  ],
  "translations": [
    {
      "lang": "Deutsch",
      "lang_code": "de",
      "sense": "siehe Deutsch",
      "sense_index": "1",
      "tags": [
        "masculine"
      ],
      "word": "Vorrat"
    },
    {
      "lang": "Deutsch",
      "lang_code": "de",
      "sense": "siehe Deutsch",
      "sense_index": "1",
      "tags": [
        "masculine"
      ],
      "word": "Schatz"
    },
    {
      "lang": "Deutsch",
      "lang_code": "de",
      "sense": "siehe Deutsch",
      "sense_index": "2",
      "tags": [
        "feminine"
      ],
      "word": "Schatzkammer"
    },
    {
      "lang": "Deutsch",
      "lang_code": "de",
      "sense": "siehe Deutsch",
      "sense_index": "3",
      "tags": [
        "feminine"
      ],
      "word": "Fundgrube"
    },
    {
      "lang": "Deutsch",
      "lang_code": "de",
      "sense": "siehe Deutsch",
      "sense_index": "3",
      "tags": [
        "neuter"
      ],
      "word": "Repertoire"
    },
    {
      "lang": "Deutsch",
      "lang_code": "de",
      "sense": "siehe Deutsch",
      "sense_index": "3",
      "tags": [
        "neuter"
      ],
      "word": "Magazin"
    },
    {
      "lang": "Deutsch",
      "lang_code": "de",
      "sense": "siehe Deutsch",
      "sense_index": "4",
      "tags": [
        "neuter"
      ],
      "word": "Bargeld"
    }
  ],
  "word": "thesaurus"
}

Download raw JSONL data for thesaurus meaning in All languages combined (5.5kB)


This page is a part of the kaikki.org machine-readable All languages combined dictionary. This dictionary is based on structured data extracted on 2025-05-15 from the dewiktionary dump dated 2025-05-01 using wiktextract (142890b and 1d3fdbf). The data shown on this site has been post-processed and various details (e.g., extra categories) removed, some information disambiguated, and additional data merged from other sources. See the raw data download page for the unprocessed wiktextract data.

If you use this data in academic research, please cite Tatu Ylonen: Wiktextract: Wiktionary as Machine-Readable Structured Data, Proceedings of the 13th Conference on Language Resources and Evaluation (LREC), pp. 1317-1325, Marseille, 20-25 June 2022. Linking to the relevant page(s) under https://kaikki.org would also be greatly appreciated.