"さん" meaning in All languages combined

See さん on Wiktionary

Suffix [Japanese]

Forms: -san [romanization]
Etymology: Derived from 様(さま) (sama). Etymology templates: {{ja-see-kango|三|山|参|産|酸|賛|餐}}, {{ja-r|様|さま}} 様(さま) (sama) Head templates: {{ja-pos|suffix}} さん • (-san)
  1. title used after a person's name (family, given or full) or job title, or a company name, to show respect; Mr, Ms, Mrs, Miss Tags: morpheme Synonyms: さま
    Sense id: en-さん-ja-suffix-4FI4nZq4 Categories (other): Japanese entries with incorrect language header, Japanese links with redundant wikilinks, Japanese terms with redundant sortkeys, Pages with 2 entries, Pages with entries Disambiguation of Japanese entries with incorrect language header: 65 9 26 Disambiguation of Japanese links with redundant wikilinks: 62 14 24 Disambiguation of Japanese terms with redundant sortkeys: 63 14 23 Disambiguation of Pages with 2 entries: 56 6 31 2 5 Disambiguation of Pages with entries: 68 4 23 1 3
  2. (colloquial) used after a shop name Tags: colloquial, morpheme
    Sense id: en-さん-ja-suffix-~Xkcx9mA
  3. (polite) attaching to nouns or other nominals: a politeness marker that often has no direct translation, replacing copula です (desu) Tags: morpheme, polite Synonyms: さま
    Sense id: en-さん-ja-suffix-mhtwHmVW Categories (other): Japanese terms with non-redundant manual script codes
The following are not (yet) sense-disambiguated
Related terms: (sama) (ruby: (さま)) (english: more respectful), ちゃん (chan) (english: more familiar, especially of young women and children), (kun) (ruby: (くん)) (english: more familiar, especially of men), 殿 (dono) (ruby: 殿(どの)) (english: more respectful)
Etymology number: 2

Noun [Okinawan]

Forms: san [romanization]
Head templates: {{ryu-noun}} さん (san)
  1. 山: mountain Categories (place): Landforms
    Sense id: en-さん-ryu-noun-uMwAQRUu Disambiguation of Landforms: 54 46
The following are not (yet) sense-disambiguated
Etymology number: 1

Suffix [Okinawan]

Forms: -san [romanization]
Etymology: Generally held to be a combination of an adjective nominalizer suffix cognate to Japanese さ (-sa) and the verb 有ん (an, “to be, exist, have”). Head templates: {{ryu-head|suffix}} さん (-san)
  1. Terminal-form ending for inflected adjectives. Tags: morpheme Categories (place): Landforms
    Sense id: en-さん-ryu-suffix-xBQKM4Qz Disambiguation of Landforms: 54 46 Categories (other): Okinawan entries with incorrect language header, Okinawan hiragana, Okinawan terms with redundant sortkeys Disambiguation of Okinawan entries with incorrect language header: 13 87 Disambiguation of Okinawan hiragana: 26 74 Disambiguation of Okinawan terms with redundant sortkeys: 5 95
The following are not (yet) sense-disambiguated
Etymology number: 2

Alternative forms

{
  "descendants": [
    {
      "depth": 1,
      "templates": [
        {
          "args": {
            "1": "en",
            "2": "-san",
            "bor": "1"
          },
          "expansion": "→ English: -san",
          "name": "desc"
        }
      ],
      "text": "→ English: -san"
    },
    {
      "depth": 1,
      "templates": [
        {
          "args": {
            "1": "ko",
            "2": "상",
            "bor": "1"
          },
          "expansion": "→ Korean: 상 (sang)",
          "name": "desc"
        }
      ],
      "text": "→ Korean: 상 (sang)"
    },
    {
      "depth": 1,
      "templates": [
        {
          "args": {
            "1": "cmn",
            "2": "桑",
            "bor": "1"
          },
          "expansion": "→ Mandarin: 桑 (sāng)",
          "name": "desc"
        }
      ],
      "text": "→ Mandarin: 桑 (sāng)"
    }
  ],
  "etymology_number": 2,
  "etymology_templates": [
    {
      "args": {
        "1": "三",
        "2": "山",
        "3": "参",
        "4": "産",
        "5": "酸",
        "6": "賛",
        "7": "餐"
      },
      "expansion": "",
      "name": "ja-see-kango"
    },
    {
      "args": {
        "1": "様",
        "2": "さま"
      },
      "expansion": "様(さま) (sama)",
      "name": "ja-r"
    }
  ],
  "etymology_text": "Derived from 様(さま) (sama).",
  "forms": [
    {
      "form": "-san",
      "tags": [
        "romanization"
      ]
    }
  ],
  "head_templates": [
    {
      "args": {
        "1": "suffix"
      },
      "expansion": "さん • (-san)",
      "name": "ja-pos"
    }
  ],
  "lang": "Japanese",
  "lang_code": "ja",
  "pos": "suffix",
  "redirects": [
    "三",
    "山",
    "参",
    "産",
    "酸",
    "賛",
    "餐"
  ],
  "related": [
    {
      "_dis1": "0 0 0",
      "english": "more respectful",
      "roman": "sama",
      "ruby": [
        [
          "様",
          "さま"
        ]
      ],
      "word": "様"
    },
    {
      "_dis1": "0 0 0",
      "english": "more familiar, especially of young women and children",
      "roman": "chan",
      "word": "ちゃん"
    },
    {
      "_dis1": "0 0 0",
      "english": "more familiar, especially of men",
      "roman": "kun",
      "ruby": [
        [
          "君",
          "くん"
        ]
      ],
      "word": "君"
    },
    {
      "_dis1": "0 0 0",
      "english": "more respectful",
      "roman": "dono",
      "ruby": [
        [
          "殿",
          "どの"
        ]
      ],
      "word": "殿"
    }
  ],
  "senses": [
    {
      "categories": [
        {
          "_dis": "65 9 26",
          "kind": "other",
          "name": "Japanese entries with incorrect language header",
          "parents": [
            "Entries with incorrect language header",
            "Entry maintenance"
          ],
          "source": "w+disamb"
        },
        {
          "_dis": "62 14 24",
          "kind": "other",
          "name": "Japanese links with redundant wikilinks",
          "parents": [
            "Links with redundant wikilinks",
            "Entry maintenance"
          ],
          "source": "w+disamb"
        },
        {
          "_dis": "63 14 23",
          "kind": "other",
          "name": "Japanese terms with redundant sortkeys",
          "parents": [
            "Terms with redundant sortkeys",
            "Entry maintenance"
          ],
          "source": "w+disamb"
        },
        {
          "_dis": "56 6 31 2 5",
          "kind": "other",
          "name": "Pages with 2 entries",
          "parents": [],
          "source": "w+disamb"
        },
        {
          "_dis": "68 4 23 1 3",
          "kind": "other",
          "name": "Pages with entries",
          "parents": [],
          "source": "w+disamb"
        }
      ],
      "examples": [
        {
          "ruby": [
            [
              "山",
              "やま"
            ],
            [
              "田",
              "だ"
            ]
          ],
          "text": "山田さん ― Yamada-san ― Mr/Ms Yamada"
        },
        {
          "text": "あきらさん ― Akira-san ― Akira",
          "type": "example"
        },
        {
          "ruby": [
            [
              "山",
              "やま"
            ],
            [
              "田",
              "だ"
            ]
          ],
          "text": "山田あきらさん ― Yamada Akira-san ― Mr/Ms Akira Yamada"
        },
        {
          "english": "Sir/Madam (when talking to a shop clerk) (literally, “Mr/Ms shop clerk”)",
          "roman": "ten'in-san",
          "ruby": [
            [
              "店",
              "てん"
            ],
            [
              "員",
              "いん"
            ]
          ],
          "text": "店員さん"
        },
        {
          "english": "Sir/Madam (when talking to a taxi/bus driver) (literally, “Mr/Ms driver”)",
          "roman": "untenshu-san",
          "ruby": [
            [
              "運",
              "うん"
            ],
            [
              "転",
              "てん"
            ],
            [
              "手",
              "しゅ"
            ]
          ],
          "text": "運転手さん"
        },
        {
          "english": "Sir/Madam (used in business by people meeting Sony)",
          "roman": "Sonī-san",
          "text": "ソニーさん",
          "type": "example"
        }
      ],
      "glosses": [
        "title used after a person's name (family, given or full) or job title, or a company name, to show respect; Mr, Ms, Mrs, Miss"
      ],
      "id": "en-さん-ja-suffix-4FI4nZq4",
      "links": [
        [
          "Mr",
          "Mr"
        ],
        [
          "Ms",
          "Ms"
        ],
        [
          "Mrs",
          "Mrs"
        ],
        [
          "Miss",
          "Miss"
        ]
      ],
      "synonyms": [
        {
          "word": "さま"
        }
      ],
      "tags": [
        "morpheme"
      ]
    },
    {
      "categories": [],
      "examples": [
        {
          "english": "There's a barber's in front of the school.",
          "roman": "Gakkō no mae ni tokoya-san ga aru.",
          "ruby": [
            [
              "学",
              "がっ"
            ],
            [
              "校",
              "こう"
            ],
            [
              "前",
              "まえ"
            ],
            [
              "床",
              "とこ"
            ],
            [
              "屋",
              "や"
            ]
          ],
          "text": "学校の前に床屋さんがある。",
          "type": "example"
        }
      ],
      "glosses": [
        "used after a shop name"
      ],
      "id": "en-さん-ja-suffix-~Xkcx9mA",
      "raw_glosses": [
        "(colloquial) used after a shop name"
      ],
      "tags": [
        "colloquial",
        "morpheme"
      ]
    },
    {
      "categories": [
        {
          "kind": "other",
          "name": "Japanese terms with non-redundant manual script codes",
          "parents": [
            "Terms with non-redundant manual script codes",
            "Entry maintenance"
          ],
          "source": "w"
        }
      ],
      "examples": [
        {
          "english": "(polite, uncommon) thank you",
          "roman": "arigatōsan",
          "text": "ありがとうさん",
          "type": "example"
        }
      ],
      "glosses": [
        "attaching to nouns or other nominals: a politeness marker that often has no direct translation, replacing copula です (desu)"
      ],
      "id": "en-さん-ja-suffix-mhtwHmVW",
      "links": [
        [
          "politeness",
          "politeness"
        ],
        [
          "copula",
          "copula"
        ]
      ],
      "raw_glosses": [
        "(polite) attaching to nouns or other nominals: a politeness marker that often has no direct translation, replacing copula です (desu)"
      ],
      "synonyms": [
        {
          "word": "さま"
        }
      ],
      "tags": [
        "morpheme",
        "polite"
      ]
    }
  ],
  "word": "さん"
}

{
  "etymology_number": 1,
  "forms": [
    {
      "form": "san",
      "tags": [
        "romanization"
      ]
    }
  ],
  "head_templates": [
    {
      "args": {},
      "expansion": "さん (san)",
      "name": "ryu-noun"
    }
  ],
  "lang": "Okinawan",
  "lang_code": "ryu",
  "pos": "noun",
  "senses": [
    {
      "categories": [
        {
          "_dis": "54 46",
          "kind": "place",
          "langcode": "ryu",
          "name": "Landforms",
          "orig": "ryu:Landforms",
          "parents": [
            "Earth",
            "Places",
            "Nature",
            "Names",
            "All topics",
            "Proper nouns",
            "Terms by semantic function",
            "Fundamental",
            "Nouns",
            "Lemmas"
          ],
          "source": "w+disamb"
        }
      ],
      "glosses": [
        "山: mountain"
      ],
      "id": "en-さん-ryu-noun-uMwAQRUu",
      "links": [
        [
          "山",
          "山#Okinawan"
        ],
        [
          "mountain",
          "mountain"
        ]
      ]
    }
  ],
  "word": "さん"
}

{
  "etymology_number": 2,
  "etymology_text": "Generally held to be a combination of an adjective nominalizer suffix cognate to Japanese さ (-sa) and the verb 有ん (an, “to be, exist, have”).",
  "forms": [
    {
      "form": "-san",
      "tags": [
        "romanization"
      ]
    }
  ],
  "head_templates": [
    {
      "args": {
        "1": "suffix"
      },
      "expansion": "さん (-san)",
      "name": "ryu-head"
    }
  ],
  "lang": "Okinawan",
  "lang_code": "ryu",
  "pos": "suffix",
  "senses": [
    {
      "categories": [
        {
          "_dis": "13 87",
          "kind": "other",
          "name": "Okinawan entries with incorrect language header",
          "parents": [
            "Entries with incorrect language header",
            "Entry maintenance"
          ],
          "source": "w+disamb"
        },
        {
          "_dis": "26 74",
          "kind": "other",
          "name": "Okinawan hiragana",
          "parents": [],
          "source": "w+disamb"
        },
        {
          "_dis": "5 95",
          "kind": "other",
          "name": "Okinawan terms with redundant sortkeys",
          "parents": [
            "Terms with redundant sortkeys",
            "Entry maintenance"
          ],
          "source": "w+disamb"
        },
        {
          "_dis": "54 46",
          "kind": "place",
          "langcode": "ryu",
          "name": "Landforms",
          "orig": "ryu:Landforms",
          "parents": [
            "Earth",
            "Places",
            "Nature",
            "Names",
            "All topics",
            "Proper nouns",
            "Terms by semantic function",
            "Fundamental",
            "Nouns",
            "Lemmas"
          ],
          "source": "w+disamb"
        }
      ],
      "examples": [
        {
          "english": "It is white.",
          "roman": "shirusan",
          "text": "白(しる)さん",
          "type": "example"
        }
      ],
      "glosses": [
        "Terminal-form ending for inflected adjectives."
      ],
      "id": "en-さん-ryu-suffix-xBQKM4Qz",
      "tags": [
        "morpheme"
      ]
    }
  ],
  "word": "さん"
}
{
  "categories": [
    "Japanese entries with incorrect language header",
    "Japanese hiragana",
    "Japanese lemmas",
    "Japanese links with redundant wikilinks",
    "Japanese suffixes",
    "Japanese terms with redundant sortkeys",
    "Pages with 2 entries",
    "Pages with entries",
    "ryu:Landforms"
  ],
  "descendants": [
    {
      "depth": 1,
      "templates": [
        {
          "args": {
            "1": "en",
            "2": "-san",
            "bor": "1"
          },
          "expansion": "→ English: -san",
          "name": "desc"
        }
      ],
      "text": "→ English: -san"
    },
    {
      "depth": 1,
      "templates": [
        {
          "args": {
            "1": "ko",
            "2": "상",
            "bor": "1"
          },
          "expansion": "→ Korean: 상 (sang)",
          "name": "desc"
        }
      ],
      "text": "→ Korean: 상 (sang)"
    },
    {
      "depth": 1,
      "templates": [
        {
          "args": {
            "1": "cmn",
            "2": "桑",
            "bor": "1"
          },
          "expansion": "→ Mandarin: 桑 (sāng)",
          "name": "desc"
        }
      ],
      "text": "→ Mandarin: 桑 (sāng)"
    }
  ],
  "etymology_number": 2,
  "etymology_templates": [
    {
      "args": {
        "1": "三",
        "2": "山",
        "3": "参",
        "4": "産",
        "5": "酸",
        "6": "賛",
        "7": "餐"
      },
      "expansion": "",
      "name": "ja-see-kango"
    },
    {
      "args": {
        "1": "様",
        "2": "さま"
      },
      "expansion": "様(さま) (sama)",
      "name": "ja-r"
    }
  ],
  "etymology_text": "Derived from 様(さま) (sama).",
  "forms": [
    {
      "form": "-san",
      "tags": [
        "romanization"
      ]
    }
  ],
  "head_templates": [
    {
      "args": {
        "1": "suffix"
      },
      "expansion": "さん • (-san)",
      "name": "ja-pos"
    }
  ],
  "lang": "Japanese",
  "lang_code": "ja",
  "pos": "suffix",
  "redirects": [
    "三",
    "山",
    "参",
    "産",
    "酸",
    "賛",
    "餐"
  ],
  "related": [
    {
      "english": "more respectful",
      "roman": "sama",
      "ruby": [
        [
          "様",
          "さま"
        ]
      ],
      "word": "様"
    },
    {
      "english": "more familiar, especially of young women and children",
      "roman": "chan",
      "word": "ちゃん"
    },
    {
      "english": "more familiar, especially of men",
      "roman": "kun",
      "ruby": [
        [
          "君",
          "くん"
        ]
      ],
      "word": "君"
    },
    {
      "english": "more respectful",
      "roman": "dono",
      "ruby": [
        [
          "殿",
          "どの"
        ]
      ],
      "word": "殿"
    }
  ],
  "senses": [
    {
      "categories": [
        "Japanese terms with usage examples"
      ],
      "examples": [
        {
          "ruby": [
            [
              "山",
              "やま"
            ],
            [
              "田",
              "だ"
            ]
          ],
          "text": "山田さん ― Yamada-san ― Mr/Ms Yamada"
        },
        {
          "text": "あきらさん ― Akira-san ― Akira",
          "type": "example"
        },
        {
          "ruby": [
            [
              "山",
              "やま"
            ],
            [
              "田",
              "だ"
            ]
          ],
          "text": "山田あきらさん ― Yamada Akira-san ― Mr/Ms Akira Yamada"
        },
        {
          "english": "Sir/Madam (when talking to a shop clerk) (literally, “Mr/Ms shop clerk”)",
          "roman": "ten'in-san",
          "ruby": [
            [
              "店",
              "てん"
            ],
            [
              "員",
              "いん"
            ]
          ],
          "text": "店員さん"
        },
        {
          "english": "Sir/Madam (when talking to a taxi/bus driver) (literally, “Mr/Ms driver”)",
          "roman": "untenshu-san",
          "ruby": [
            [
              "運",
              "うん"
            ],
            [
              "転",
              "てん"
            ],
            [
              "手",
              "しゅ"
            ]
          ],
          "text": "運転手さん"
        },
        {
          "english": "Sir/Madam (used in business by people meeting Sony)",
          "roman": "Sonī-san",
          "text": "ソニーさん",
          "type": "example"
        }
      ],
      "glosses": [
        "title used after a person's name (family, given or full) or job title, or a company name, to show respect; Mr, Ms, Mrs, Miss"
      ],
      "links": [
        [
          "Mr",
          "Mr"
        ],
        [
          "Ms",
          "Ms"
        ],
        [
          "Mrs",
          "Mrs"
        ],
        [
          "Miss",
          "Miss"
        ]
      ],
      "synonyms": [
        {
          "word": "さま"
        }
      ],
      "tags": [
        "morpheme"
      ]
    },
    {
      "categories": [
        "Japanese colloquialisms",
        "Japanese terms with usage examples"
      ],
      "examples": [
        {
          "english": "There's a barber's in front of the school.",
          "roman": "Gakkō no mae ni tokoya-san ga aru.",
          "ruby": [
            [
              "学",
              "がっ"
            ],
            [
              "校",
              "こう"
            ],
            [
              "前",
              "まえ"
            ],
            [
              "床",
              "とこ"
            ],
            [
              "屋",
              "や"
            ]
          ],
          "text": "学校の前に床屋さんがある。",
          "type": "example"
        }
      ],
      "glosses": [
        "used after a shop name"
      ],
      "raw_glosses": [
        "(colloquial) used after a shop name"
      ],
      "tags": [
        "colloquial",
        "morpheme"
      ]
    },
    {
      "categories": [
        "Japanese polite terms",
        "Japanese terms with non-redundant manual script codes",
        "Japanese terms with usage examples"
      ],
      "examples": [
        {
          "english": "(polite, uncommon) thank you",
          "roman": "arigatōsan",
          "text": "ありがとうさん",
          "type": "example"
        }
      ],
      "glosses": [
        "attaching to nouns or other nominals: a politeness marker that often has no direct translation, replacing copula です (desu)"
      ],
      "links": [
        [
          "politeness",
          "politeness"
        ],
        [
          "copula",
          "copula"
        ]
      ],
      "raw_glosses": [
        "(polite) attaching to nouns or other nominals: a politeness marker that often has no direct translation, replacing copula です (desu)"
      ],
      "synonyms": [
        {
          "word": "さま"
        }
      ],
      "tags": [
        "morpheme",
        "polite"
      ]
    }
  ],
  "word": "さん"
}

{
  "categories": [
    "Okinawan entries with incorrect language header",
    "Okinawan hiragana",
    "Okinawan lemmas",
    "Okinawan nouns",
    "Okinawan suffixes",
    "Okinawan terms with redundant sortkeys",
    "Pages with 2 entries",
    "Pages with entries",
    "ryu:Landforms"
  ],
  "etymology_number": 1,
  "forms": [
    {
      "form": "san",
      "tags": [
        "romanization"
      ]
    }
  ],
  "head_templates": [
    {
      "args": {},
      "expansion": "さん (san)",
      "name": "ryu-noun"
    }
  ],
  "lang": "Okinawan",
  "lang_code": "ryu",
  "pos": "noun",
  "senses": [
    {
      "glosses": [
        "山: mountain"
      ],
      "links": [
        [
          "山",
          "山#Okinawan"
        ],
        [
          "mountain",
          "mountain"
        ]
      ]
    }
  ],
  "word": "さん"
}

{
  "categories": [
    "Okinawan entries with incorrect language header",
    "Okinawan hiragana",
    "Okinawan lemmas",
    "Okinawan suffixes",
    "Okinawan terms with redundant sortkeys",
    "Pages with 2 entries",
    "Pages with entries",
    "ryu:Landforms"
  ],
  "etymology_number": 2,
  "etymology_text": "Generally held to be a combination of an adjective nominalizer suffix cognate to Japanese さ (-sa) and the verb 有ん (an, “to be, exist, have”).",
  "forms": [
    {
      "form": "-san",
      "tags": [
        "romanization"
      ]
    }
  ],
  "head_templates": [
    {
      "args": {
        "1": "suffix"
      },
      "expansion": "さん (-san)",
      "name": "ryu-head"
    }
  ],
  "lang": "Okinawan",
  "lang_code": "ryu",
  "pos": "suffix",
  "senses": [
    {
      "categories": [
        "Okinawan terms with usage examples"
      ],
      "examples": [
        {
          "english": "It is white.",
          "roman": "shirusan",
          "text": "白(しる)さん",
          "type": "example"
        }
      ],
      "glosses": [
        "Terminal-form ending for inflected adjectives."
      ],
      "tags": [
        "morpheme"
      ]
    }
  ],
  "word": "さん"
}

Download raw JSONL data for さん meaning in All languages combined (5.6kB)

{
  "called_from": "parser/1336",
  "msg": "no corresponding start tag found for </span>",
  "path": [
    "さん"
  ],
  "section": "Japanese",
  "subsection": "suffix",
  "title": "さん",
  "trace": ""
}

{
  "called_from": "luaexec/683",
  "msg": "LUA error in #invoke('form of/templates', 'form_of_t', 'short for', 'cat=short forms', 'withcap=1', 'withdot=1') parent ('Template:short for', {1: 'ja', 2: '参議院', 'tr': 'Sangiin', 'nodot': '1'})",
  "path": [
    "さん",
    "Template:ja-see-kango",
    "#invoke",
    "#invoke",
    "Lua:Module:ja-see:show_kango()",
    "frame:preprocess()",
    "Template:short for",
    "Template:no deprecated lang param usage",
    "ARGVAL-1",
    "#invoke",
    "#invoke"
  ],
  "section": "Japanese",
  "subsection": "",
  "title": "さん",
  "trace": "[string \"Module:form of/templates\"]:550: attempt to call upvalue 'get_force_cat' (a nil value)"
}

{
  "called_from": "luaexec/683",
  "msg": "LUA error in #invoke('form of/templates', 'form_of_t', 'short for', 'cat=short forms', 'withcap=1', 'withdot=1') parent ('Template:short for', {1: 'ja', 2: '[[三河]][[三河国|国]]', 'tr': 'Mikawa-no-kuni', 'dot': ':'})",
  "path": [
    "さん",
    "Template:ja-see-kango",
    "#invoke",
    "#invoke",
    "Lua:Module:ja-see:show_kango()",
    "frame:preprocess()",
    "Template:short for",
    "Template:no deprecated lang param usage",
    "ARGVAL-1",
    "#invoke",
    "#invoke"
  ],
  "section": "Japanese",
  "subsection": "",
  "title": "さん",
  "trace": "[string \"Module:form of/templates\"]:550: attempt to call upvalue 'get_force_cat' (a nil value)"
}

This page is a part of the kaikki.org machine-readable All languages combined dictionary. This dictionary is based on structured data extracted on 2025-02-03 from the enwiktionary dump dated 2025-01-20 using wiktextract (05fdf6b and 9dbd323). The data shown on this site has been post-processed and various details (e.g., extra categories) removed, some information disambiguated, and additional data merged from other sources. See the raw data download page for the unprocessed wiktextract data.

If you use this data in academic research, please cite Tatu Ylonen: Wiktextract: Wiktionary as Machine-Readable Structured Data, Proceedings of the 13th Conference on Language Resources and Evaluation (LREC), pp. 1317-1325, Marseille, 20-25 June 2022. Linking to the relevant page(s) under https://kaikki.org would also be greatly appreciated.