"bulok" meaning in Tagalog

See bulok in All languages combined, or Wiktionary

Adjective

IPA: /buˈlok/, [bʊˈlok] Forms: bulók [canonical], ᜊᜓᜎᜓᜃ᜔ [Baybayin]
Etymology: From Malay buruk, from Proto-Malayic *buruk, from Proto-Malayo-Polynesian *buʀuk, from Proto-Austronesian *buʀuk. Doublet of bugok. Etymology templates: {{der|tl|ms|buruk}} Malay buruk, {{der|tl|poz-mly-pro|*buruk}} Proto-Malayic *buruk, {{der|tl|poz-pro|*buʀuk}} Proto-Malayo-Polynesian *buʀuk, {{der|tl|map-pro|*buʀuk}} Proto-Austronesian *buʀuk, {{doublet|tl|bugok}} Doublet of bugok Head templates: {{tl-adj|bulók|b=+}} bulók (Baybayin spelling ᜊᜓᜎᜓᜃ᜔)
  1. rotten; decomposed; decayed Synonyms: agnas, sira, panis [food, lifestyle], bilasa [fish, ichthyology, zoology, biology, natural-sciences], bugok (english: egg)
    Sense id: en-bulok-tl-adj-FH4cDnaj Categories (other): Tagalog entries with incorrect language header, Tagalog terms with Baybayin script, Tagalog terms with missing Baybayin script entries Disambiguation of Tagalog entries with incorrect language header: 32 15 29 16 3 5 Disambiguation of Tagalog terms with Baybayin script: 24 24 22 20 4 7 Disambiguation of Tagalog terms with missing Baybayin script entries: 23 22 23 20 4 7
  2. stinking; ill-smelling
    Sense id: en-bulok-tl-adj-zS60iPaU Categories (other): Tagalog entries with incorrect language header, Tagalog terms with Baybayin script, Tagalog terms with missing Baybayin script entries Disambiguation of Tagalog entries with incorrect language header: 32 15 29 16 3 5 Disambiguation of Tagalog terms with Baybayin script: 24 24 22 20 4 7 Disambiguation of Tagalog terms with missing Baybayin script entries: 23 22 23 20 4 7
  3. (figuratively) morally corrupt; wicked; immoral Tags: figuratively Synonyms: masama
    Sense id: en-bulok-tl-adj-hk9Fn514 Categories (other): Tagalog entries with incorrect language header, Tagalog terms with Baybayin script, Tagalog terms with missing Baybayin script entries Disambiguation of Tagalog entries with incorrect language header: 32 15 29 16 3 5 Disambiguation of Tagalog terms with Baybayin script: 24 24 22 20 4 7 Disambiguation of Tagalog terms with missing Baybayin script entries: 23 22 23 20 4 7
  4. (figuratively) of poor quality; inferior; bum Tags: figuratively
    Sense id: en-bulok-tl-adj-m8D4njMm Categories (other): Tagalog entries with incorrect language header, Tagalog terms with Baybayin script, Tagalog terms with missing Baybayin script entries Disambiguation of Tagalog entries with incorrect language header: 32 15 29 16 3 5 Disambiguation of Tagalog terms with Baybayin script: 24 24 22 20 4 7 Disambiguation of Tagalog terms with missing Baybayin script entries: 23 22 23 20 4 7
  5. (figuratively) inefficient; incapable (in work, school, etc.) Tags: figuratively
    Sense id: en-bulok-tl-adj-OXO~EKGU
The following are not (yet) sense-disambiguated
Synonyms: boloc [obsolete], Spanish-based orthography Derived forms: bulukin, bumulok, di-nabubulok, kabulukan, mabulok, nabubulok, nakabubulok, pagkabulok Related terms: bungkok

Noun

IPA: /buˈlok/, [bʊˈlok] Forms: bulók [canonical], ᜊᜓᜎᜓᜃ᜔ [Baybayin]
Etymology: From Malay buruk, from Proto-Malayic *buruk, from Proto-Malayo-Polynesian *buʀuk, from Proto-Austronesian *buʀuk. Doublet of bugok. Etymology templates: {{der|tl|ms|buruk}} Malay buruk, {{der|tl|poz-mly-pro|*buruk}} Proto-Malayic *buruk, {{der|tl|poz-pro|*buʀuk}} Proto-Malayo-Polynesian *buʀuk, {{der|tl|map-pro|*buʀuk}} Proto-Austronesian *buʀuk, {{doublet|tl|bugok}} Doublet of bugok Head templates: {{tl-noun|bulók|b=+}} bulók (Baybayin spelling ᜊᜓᜎᜓᜃ᜔)
  1. putrid smell (of rotting meat, flesh, garbage, etc.)
    Sense id: en-bulok-tl-noun-DIJl1del
The following are not (yet) sense-disambiguated
Synonyms: boloc [obsolete], Spanish-based orthography

Download JSON data for bulok meaning in Tagalog (6.4kB)

{
  "derived": [
    {
      "_dis1": "0 0 0 0 0",
      "word": "bulukin"
    },
    {
      "_dis1": "0 0 0 0 0",
      "word": "bumulok"
    },
    {
      "_dis1": "0 0 0 0 0",
      "word": "di-nabubulok"
    },
    {
      "_dis1": "0 0 0 0 0",
      "word": "kabulukan"
    },
    {
      "_dis1": "0 0 0 0 0",
      "word": "mabulok"
    },
    {
      "_dis1": "0 0 0 0 0",
      "word": "nabubulok"
    },
    {
      "_dis1": "0 0 0 0 0",
      "word": "nakabubulok"
    },
    {
      "_dis1": "0 0 0 0 0",
      "word": "pagkabulok"
    }
  ],
  "etymology_templates": [
    {
      "args": {
        "1": "tl",
        "2": "ms",
        "3": "buruk"
      },
      "expansion": "Malay buruk",
      "name": "der"
    },
    {
      "args": {
        "1": "tl",
        "2": "poz-mly-pro",
        "3": "*buruk"
      },
      "expansion": "Proto-Malayic *buruk",
      "name": "der"
    },
    {
      "args": {
        "1": "tl",
        "2": "poz-pro",
        "3": "*buʀuk"
      },
      "expansion": "Proto-Malayo-Polynesian *buʀuk",
      "name": "der"
    },
    {
      "args": {
        "1": "tl",
        "2": "map-pro",
        "3": "*buʀuk"
      },
      "expansion": "Proto-Austronesian *buʀuk",
      "name": "der"
    },
    {
      "args": {
        "1": "tl",
        "2": "bugok"
      },
      "expansion": "Doublet of bugok",
      "name": "doublet"
    }
  ],
  "etymology_text": "From Malay buruk, from Proto-Malayic *buruk, from Proto-Malayo-Polynesian *buʀuk, from Proto-Austronesian *buʀuk. Doublet of bugok.",
  "forms": [
    {
      "form": "bulók",
      "tags": [
        "canonical"
      ]
    },
    {
      "form": "ᜊᜓᜎᜓᜃ᜔",
      "tags": [
        "Baybayin"
      ]
    }
  ],
  "head_templates": [
    {
      "args": {
        "1": "bulók",
        "b": "+"
      },
      "expansion": "bulók (Baybayin spelling ᜊᜓᜎᜓᜃ᜔)",
      "name": "tl-adj"
    }
  ],
  "hyphenation": [
    "bu‧lok"
  ],
  "lang": "Tagalog",
  "lang_code": "tl",
  "pos": "adj",
  "related": [
    {
      "_dis1": "0 0 0 0 0",
      "word": "bungkok"
    }
  ],
  "senses": [
    {
      "categories": [
        {
          "_dis": "32 15 29 16 3 5",
          "kind": "other",
          "name": "Tagalog entries with incorrect language header",
          "parents": [
            "Entries with incorrect language header",
            "Entry maintenance"
          ],
          "source": "w+disamb"
        },
        {
          "_dis": "24 24 22 20 4 7",
          "kind": "other",
          "name": "Tagalog terms with Baybayin script",
          "parents": [],
          "source": "w+disamb"
        },
        {
          "_dis": "23 22 23 20 4 7",
          "kind": "other",
          "name": "Tagalog terms with missing Baybayin script entries",
          "parents": [],
          "source": "w+disamb"
        }
      ],
      "glosses": [
        "rotten; decomposed; decayed"
      ],
      "id": "en-bulok-tl-adj-FH4cDnaj",
      "links": [
        [
          "rotten",
          "rotten"
        ],
        [
          "decomposed",
          "decomposed"
        ],
        [
          "decayed",
          "decayed"
        ]
      ],
      "synonyms": [
        {
          "word": "agnas"
        },
        {
          "word": "sira"
        },
        {
          "topics": [
            "food",
            "lifestyle"
          ],
          "word": "panis"
        },
        {
          "topics": [
            "fish",
            "ichthyology",
            "zoology",
            "biology",
            "natural-sciences"
          ],
          "word": "bilasa"
        },
        {
          "english": "egg",
          "word": "bugok"
        }
      ]
    },
    {
      "categories": [
        {
          "_dis": "32 15 29 16 3 5",
          "kind": "other",
          "name": "Tagalog entries with incorrect language header",
          "parents": [
            "Entries with incorrect language header",
            "Entry maintenance"
          ],
          "source": "w+disamb"
        },
        {
          "_dis": "24 24 22 20 4 7",
          "kind": "other",
          "name": "Tagalog terms with Baybayin script",
          "parents": [],
          "source": "w+disamb"
        },
        {
          "_dis": "23 22 23 20 4 7",
          "kind": "other",
          "name": "Tagalog terms with missing Baybayin script entries",
          "parents": [],
          "source": "w+disamb"
        }
      ],
      "glosses": [
        "stinking; ill-smelling"
      ],
      "id": "en-bulok-tl-adj-zS60iPaU",
      "links": [
        [
          "stinking",
          "stinking"
        ],
        [
          "ill",
          "ill"
        ],
        [
          "smelling",
          "smelling"
        ]
      ]
    },
    {
      "categories": [
        {
          "_dis": "32 15 29 16 3 5",
          "kind": "other",
          "name": "Tagalog entries with incorrect language header",
          "parents": [
            "Entries with incorrect language header",
            "Entry maintenance"
          ],
          "source": "w+disamb"
        },
        {
          "_dis": "24 24 22 20 4 7",
          "kind": "other",
          "name": "Tagalog terms with Baybayin script",
          "parents": [],
          "source": "w+disamb"
        },
        {
          "_dis": "23 22 23 20 4 7",
          "kind": "other",
          "name": "Tagalog terms with missing Baybayin script entries",
          "parents": [],
          "source": "w+disamb"
        }
      ],
      "examples": [
        {
          "english": "rotten system",
          "text": "bulok na sistema",
          "type": "example"
        }
      ],
      "glosses": [
        "morally corrupt; wicked; immoral"
      ],
      "id": "en-bulok-tl-adj-hk9Fn514",
      "links": [
        [
          "morally",
          "morally"
        ],
        [
          "corrupt",
          "corrupt"
        ],
        [
          "wicked",
          "wicked"
        ],
        [
          "immoral",
          "immoral"
        ]
      ],
      "raw_glosses": [
        "(figuratively) morally corrupt; wicked; immoral"
      ],
      "synonyms": [
        {
          "word": "masama"
        }
      ],
      "tags": [
        "figuratively"
      ]
    },
    {
      "categories": [
        {
          "_dis": "32 15 29 16 3 5",
          "kind": "other",
          "name": "Tagalog entries with incorrect language header",
          "parents": [
            "Entries with incorrect language header",
            "Entry maintenance"
          ],
          "source": "w+disamb"
        },
        {
          "_dis": "24 24 22 20 4 7",
          "kind": "other",
          "name": "Tagalog terms with Baybayin script",
          "parents": [],
          "source": "w+disamb"
        },
        {
          "_dis": "23 22 23 20 4 7",
          "kind": "other",
          "name": "Tagalog terms with missing Baybayin script entries",
          "parents": [],
          "source": "w+disamb"
        }
      ],
      "glosses": [
        "of poor quality; inferior; bum"
      ],
      "id": "en-bulok-tl-adj-m8D4njMm",
      "links": [
        [
          "poor",
          "poor"
        ],
        [
          "quality",
          "quality"
        ],
        [
          "inferior",
          "inferior"
        ],
        [
          "bum",
          "bum"
        ]
      ],
      "raw_glosses": [
        "(figuratively) of poor quality; inferior; bum"
      ],
      "tags": [
        "figuratively"
      ]
    },
    {
      "glosses": [
        "inefficient; incapable (in work, school, etc.)"
      ],
      "id": "en-bulok-tl-adj-OXO~EKGU",
      "links": [
        [
          "inefficient",
          "inefficient"
        ],
        [
          "incapable",
          "incapable"
        ]
      ],
      "raw_glosses": [
        "(figuratively) inefficient; incapable (in work, school, etc.)"
      ],
      "tags": [
        "figuratively"
      ]
    }
  ],
  "sounds": [
    {
      "ipa": "/buˈlok/"
    },
    {
      "ipa": "[bʊˈlok]"
    }
  ],
  "synonyms": [
    {
      "_dis1": "0 0 0 0 0 0",
      "tags": [
        "obsolete"
      ],
      "word": "boloc"
    },
    {
      "_dis1": "0 0 0 0 0 0",
      "word": "Spanish-based orthography"
    }
  ],
  "word": "bulok"
}

{
  "etymology_templates": [
    {
      "args": {
        "1": "tl",
        "2": "ms",
        "3": "buruk"
      },
      "expansion": "Malay buruk",
      "name": "der"
    },
    {
      "args": {
        "1": "tl",
        "2": "poz-mly-pro",
        "3": "*buruk"
      },
      "expansion": "Proto-Malayic *buruk",
      "name": "der"
    },
    {
      "args": {
        "1": "tl",
        "2": "poz-pro",
        "3": "*buʀuk"
      },
      "expansion": "Proto-Malayo-Polynesian *buʀuk",
      "name": "der"
    },
    {
      "args": {
        "1": "tl",
        "2": "map-pro",
        "3": "*buʀuk"
      },
      "expansion": "Proto-Austronesian *buʀuk",
      "name": "der"
    },
    {
      "args": {
        "1": "tl",
        "2": "bugok"
      },
      "expansion": "Doublet of bugok",
      "name": "doublet"
    }
  ],
  "etymology_text": "From Malay buruk, from Proto-Malayic *buruk, from Proto-Malayo-Polynesian *buʀuk, from Proto-Austronesian *buʀuk. Doublet of bugok.",
  "forms": [
    {
      "form": "bulók",
      "tags": [
        "canonical"
      ]
    },
    {
      "form": "ᜊᜓᜎᜓᜃ᜔",
      "tags": [
        "Baybayin"
      ]
    }
  ],
  "head_templates": [
    {
      "args": {
        "1": "bulók",
        "b": "+"
      },
      "expansion": "bulók (Baybayin spelling ᜊᜓᜎᜓᜃ᜔)",
      "name": "tl-noun"
    }
  ],
  "hyphenation": [
    "bu‧lok"
  ],
  "lang": "Tagalog",
  "lang_code": "tl",
  "pos": "noun",
  "senses": [
    {
      "glosses": [
        "putrid smell (of rotting meat, flesh, garbage, etc.)"
      ],
      "id": "en-bulok-tl-noun-DIJl1del",
      "links": [
        [
          "putrid",
          "putrid"
        ],
        [
          "smell",
          "smell"
        ]
      ]
    }
  ],
  "sounds": [
    {
      "ipa": "/buˈlok/"
    },
    {
      "ipa": "[bʊˈlok]"
    }
  ],
  "synonyms": [
    {
      "_dis1": "0 0 0 0 0 0",
      "tags": [
        "obsolete"
      ],
      "word": "boloc"
    },
    {
      "_dis1": "0 0 0 0 0 0",
      "word": "Spanish-based orthography"
    }
  ],
  "word": "bulok"
}
{
  "categories": [
    "Tagalog 2-syllable words",
    "Tagalog adjectives",
    "Tagalog doublets",
    "Tagalog entries with incorrect language header",
    "Tagalog lemmas",
    "Tagalog nouns",
    "Tagalog terms derived from Malay",
    "Tagalog terms derived from Proto-Austronesian",
    "Tagalog terms derived from Proto-Malayic",
    "Tagalog terms derived from Proto-Malayo-Polynesian",
    "Tagalog terms with Baybayin script",
    "Tagalog terms with IPA pronunciation",
    "Tagalog terms with missing Baybayin script entries"
  ],
  "derived": [
    {
      "word": "bulukin"
    },
    {
      "word": "bumulok"
    },
    {
      "word": "di-nabubulok"
    },
    {
      "word": "kabulukan"
    },
    {
      "word": "mabulok"
    },
    {
      "word": "nabubulok"
    },
    {
      "word": "nakabubulok"
    },
    {
      "word": "pagkabulok"
    }
  ],
  "etymology_templates": [
    {
      "args": {
        "1": "tl",
        "2": "ms",
        "3": "buruk"
      },
      "expansion": "Malay buruk",
      "name": "der"
    },
    {
      "args": {
        "1": "tl",
        "2": "poz-mly-pro",
        "3": "*buruk"
      },
      "expansion": "Proto-Malayic *buruk",
      "name": "der"
    },
    {
      "args": {
        "1": "tl",
        "2": "poz-pro",
        "3": "*buʀuk"
      },
      "expansion": "Proto-Malayo-Polynesian *buʀuk",
      "name": "der"
    },
    {
      "args": {
        "1": "tl",
        "2": "map-pro",
        "3": "*buʀuk"
      },
      "expansion": "Proto-Austronesian *buʀuk",
      "name": "der"
    },
    {
      "args": {
        "1": "tl",
        "2": "bugok"
      },
      "expansion": "Doublet of bugok",
      "name": "doublet"
    }
  ],
  "etymology_text": "From Malay buruk, from Proto-Malayic *buruk, from Proto-Malayo-Polynesian *buʀuk, from Proto-Austronesian *buʀuk. Doublet of bugok.",
  "forms": [
    {
      "form": "bulók",
      "tags": [
        "canonical"
      ]
    },
    {
      "form": "ᜊᜓᜎᜓᜃ᜔",
      "tags": [
        "Baybayin"
      ]
    }
  ],
  "head_templates": [
    {
      "args": {
        "1": "bulók",
        "b": "+"
      },
      "expansion": "bulók (Baybayin spelling ᜊᜓᜎᜓᜃ᜔)",
      "name": "tl-adj"
    }
  ],
  "hyphenation": [
    "bu‧lok"
  ],
  "lang": "Tagalog",
  "lang_code": "tl",
  "pos": "adj",
  "related": [
    {
      "word": "bungkok"
    }
  ],
  "senses": [
    {
      "glosses": [
        "rotten; decomposed; decayed"
      ],
      "links": [
        [
          "rotten",
          "rotten"
        ],
        [
          "decomposed",
          "decomposed"
        ],
        [
          "decayed",
          "decayed"
        ]
      ],
      "synonyms": [
        {
          "word": "agnas"
        },
        {
          "word": "sira"
        },
        {
          "topics": [
            "food",
            "lifestyle"
          ],
          "word": "panis"
        },
        {
          "topics": [
            "fish",
            "ichthyology",
            "zoology",
            "biology",
            "natural-sciences"
          ],
          "word": "bilasa"
        },
        {
          "english": "egg",
          "word": "bugok"
        }
      ]
    },
    {
      "glosses": [
        "stinking; ill-smelling"
      ],
      "links": [
        [
          "stinking",
          "stinking"
        ],
        [
          "ill",
          "ill"
        ],
        [
          "smelling",
          "smelling"
        ]
      ]
    },
    {
      "categories": [
        "Tagalog terms with usage examples"
      ],
      "examples": [
        {
          "english": "rotten system",
          "text": "bulok na sistema",
          "type": "example"
        }
      ],
      "glosses": [
        "morally corrupt; wicked; immoral"
      ],
      "links": [
        [
          "morally",
          "morally"
        ],
        [
          "corrupt",
          "corrupt"
        ],
        [
          "wicked",
          "wicked"
        ],
        [
          "immoral",
          "immoral"
        ]
      ],
      "raw_glosses": [
        "(figuratively) morally corrupt; wicked; immoral"
      ],
      "synonyms": [
        {
          "word": "masama"
        }
      ],
      "tags": [
        "figuratively"
      ]
    },
    {
      "glosses": [
        "of poor quality; inferior; bum"
      ],
      "links": [
        [
          "poor",
          "poor"
        ],
        [
          "quality",
          "quality"
        ],
        [
          "inferior",
          "inferior"
        ],
        [
          "bum",
          "bum"
        ]
      ],
      "raw_glosses": [
        "(figuratively) of poor quality; inferior; bum"
      ],
      "tags": [
        "figuratively"
      ]
    },
    {
      "glosses": [
        "inefficient; incapable (in work, school, etc.)"
      ],
      "links": [
        [
          "inefficient",
          "inefficient"
        ],
        [
          "incapable",
          "incapable"
        ]
      ],
      "raw_glosses": [
        "(figuratively) inefficient; incapable (in work, school, etc.)"
      ],
      "tags": [
        "figuratively"
      ]
    }
  ],
  "sounds": [
    {
      "ipa": "/buˈlok/"
    },
    {
      "ipa": "[bʊˈlok]"
    }
  ],
  "synonyms": [
    {
      "tags": [
        "obsolete"
      ],
      "word": "boloc"
    },
    {
      "word": "Spanish-based orthography"
    }
  ],
  "word": "bulok"
}

{
  "categories": [
    "Tagalog 2-syllable words",
    "Tagalog adjectives",
    "Tagalog doublets",
    "Tagalog entries with incorrect language header",
    "Tagalog lemmas",
    "Tagalog nouns",
    "Tagalog terms derived from Malay",
    "Tagalog terms derived from Proto-Austronesian",
    "Tagalog terms derived from Proto-Malayic",
    "Tagalog terms derived from Proto-Malayo-Polynesian",
    "Tagalog terms with Baybayin script",
    "Tagalog terms with IPA pronunciation",
    "Tagalog terms with missing Baybayin script entries"
  ],
  "etymology_templates": [
    {
      "args": {
        "1": "tl",
        "2": "ms",
        "3": "buruk"
      },
      "expansion": "Malay buruk",
      "name": "der"
    },
    {
      "args": {
        "1": "tl",
        "2": "poz-mly-pro",
        "3": "*buruk"
      },
      "expansion": "Proto-Malayic *buruk",
      "name": "der"
    },
    {
      "args": {
        "1": "tl",
        "2": "poz-pro",
        "3": "*buʀuk"
      },
      "expansion": "Proto-Malayo-Polynesian *buʀuk",
      "name": "der"
    },
    {
      "args": {
        "1": "tl",
        "2": "map-pro",
        "3": "*buʀuk"
      },
      "expansion": "Proto-Austronesian *buʀuk",
      "name": "der"
    },
    {
      "args": {
        "1": "tl",
        "2": "bugok"
      },
      "expansion": "Doublet of bugok",
      "name": "doublet"
    }
  ],
  "etymology_text": "From Malay buruk, from Proto-Malayic *buruk, from Proto-Malayo-Polynesian *buʀuk, from Proto-Austronesian *buʀuk. Doublet of bugok.",
  "forms": [
    {
      "form": "bulók",
      "tags": [
        "canonical"
      ]
    },
    {
      "form": "ᜊᜓᜎᜓᜃ᜔",
      "tags": [
        "Baybayin"
      ]
    }
  ],
  "head_templates": [
    {
      "args": {
        "1": "bulók",
        "b": "+"
      },
      "expansion": "bulók (Baybayin spelling ᜊᜓᜎᜓᜃ᜔)",
      "name": "tl-noun"
    }
  ],
  "hyphenation": [
    "bu‧lok"
  ],
  "lang": "Tagalog",
  "lang_code": "tl",
  "pos": "noun",
  "senses": [
    {
      "glosses": [
        "putrid smell (of rotting meat, flesh, garbage, etc.)"
      ],
      "links": [
        [
          "putrid",
          "putrid"
        ],
        [
          "smell",
          "smell"
        ]
      ]
    }
  ],
  "sounds": [
    {
      "ipa": "/buˈlok/"
    },
    {
      "ipa": "[bʊˈlok]"
    }
  ],
  "synonyms": [
    {
      "tags": [
        "obsolete"
      ],
      "word": "boloc"
    },
    {
      "word": "Spanish-based orthography"
    }
  ],
  "word": "bulok"
}

This page is a part of the kaikki.org machine-readable Tagalog dictionary. This dictionary is based on structured data extracted on 2024-05-10 from the enwiktionary dump dated 2024-05-02 using wiktextract (a644e18 and edd475d). The data shown on this site has been post-processed and various details (e.g., extra categories) removed, some information disambiguated, and additional data merged from other sources. See the raw data download page for the unprocessed wiktextract data.

If you use this data in academic research, please cite Tatu Ylonen: Wiktextract: Wiktionary as Machine-Readable Structured Data, Proceedings of the 13th Conference on Language Resources and Evaluation (LREC), pp. 1317-1325, Marseille, 20-25 June 2022. Linking to the relevant page(s) under https://kaikki.org would also be greatly appreciated.