"語料庫" meaning in 漢語

See 語料庫 in All languages combined, or Wiktionary

Noun

IPA: /y²¹⁴⁻²¹ li̯ɑʊ̯⁵¹⁻⁵³ kʰu⁵¹/ [Mandarin, Standard Chinese, Sinological-IPA], /jyː¹³ liːu̯²²⁻³⁵ fuː³³/ [Cantonese, IPA], /ɲi²⁴⁻¹¹ li̯au̯⁵⁵ kʰu⁵⁵/ [IPA] Forms: 语料库 [Simplified Chinese]
  1. 通常經過整理,具有既定格式與標記的大量文本
    Sense id: zh-語料庫-zh-noun-o4-pYqGM Categories (other): 漢語 計算機, 漢語 語言學 Topics: computing, linguistics
The following are not (yet) sense-disambiguated
Derived forms: 語料庫語言學 (yǔliàokù yǔyánxué) [Traditional Chinese], 语料库语言学 (yǔliàokù yǔyánxué) [Simplified Chinese] Translations (經過整理、具有既定格式與標記的大量文本): tekstaro (世界語), korpuso (世界語), korpus [neuter] (丹麥語), ко́рпус [masculine] (俄語), собра́ние [neuter] (俄語), ко́рпус [masculine] (保加利亞語), corpus [masculine] (加泰羅尼亞語), korpusz (匈牙利語), külliyat (土耳其語), σώμα [neuter] (希臘語), συλλογή [feminine] (希臘語), Korpus [neuter] (德語), Textkorpus [neuter] (德語), corpus [masculine] (意大利語), korpus (愛沙尼亞語), korpus [masculine] (挪威語), korpus [masculine] (捷克語), korpus [masculine] (斯洛伐克語), korpus [masculine] (斯洛文尼亞語), コーパス (kōpasu) (日語), 말뭉치 (朝鮮語), 코퍼스 (朝鮮語), putunga kōrero (毛利語), whakaputunga (毛利語), corpus [masculine] (法語), ко́рпус [masculine] (烏克蘭語), збі́рник [masculine] (烏克蘭語), korpus [common] (瑞典語), språkbank [common] (瑞典語), ко́рпус [masculine] (白俄羅斯語), збор [masculine] (白俄羅斯語), korpus (芬蘭語), corpus (英語), corpus [neuter] (荷蘭語), corpus [masculine] (葡萄牙語), corpus [masculine] (西班牙語), مَتْن [masculine] (阿拉伯語), مَكْنَز لُغَوِيّ [masculine] (阿拉伯語), ко́рпус [masculine] (馬其頓語)
{
  "categories": [
    {
      "kind": "other",
      "name": "官話名詞",
      "parents": [],
      "source": "w"
    },
    {
      "kind": "other",
      "name": "官話詞元",
      "parents": [],
      "source": "w"
    },
    {
      "kind": "other",
      "name": "客家語名詞",
      "parents": [],
      "source": "w"
    },
    {
      "kind": "other",
      "name": "客家語詞元",
      "parents": [],
      "source": "w"
    },
    {
      "kind": "other",
      "name": "帶「庫」的漢語詞",
      "parents": [],
      "source": "w"
    },
    {
      "kind": "other",
      "name": "帶「料」的漢語詞",
      "parents": [],
      "source": "w"
    },
    {
      "kind": "other",
      "name": "帶「語」的漢語詞",
      "parents": [],
      "source": "w"
    },
    {
      "kind": "other",
      "name": "有1個詞條的頁面",
      "parents": [],
      "source": "w"
    },
    {
      "kind": "other",
      "name": "有國際音標的漢語詞",
      "parents": [],
      "source": "w"
    },
    {
      "kind": "other",
      "name": "漢語名詞",
      "parents": [],
      "source": "w"
    },
    {
      "kind": "other",
      "name": "漢語詞元",
      "parents": [],
      "source": "w"
    },
    {
      "kind": "other",
      "name": "粵語名詞",
      "parents": [],
      "source": "w"
    },
    {
      "kind": "other",
      "name": "粵語詞元",
      "parents": [],
      "source": "w"
    }
  ],
  "derived": [
    {
      "roman": "yǔliàokù yǔyánxué",
      "tags": [
        "Traditional Chinese"
      ],
      "word": "語料庫語言學"
    },
    {
      "roman": "yǔliàokù yǔyánxué",
      "tags": [
        "Simplified Chinese"
      ],
      "word": "语料库语言学"
    }
  ],
  "forms": [
    {
      "form": "语料库",
      "tags": [
        "Simplified Chinese"
      ]
    }
  ],
  "lang": "漢語",
  "lang_code": "zh",
  "pos": "noun",
  "senses": [
    {
      "categories": [
        {
          "kind": "other",
          "name": "漢語 計算機",
          "parents": [],
          "source": "w"
        },
        {
          "kind": "other",
          "name": "漢語 語言學",
          "parents": [],
          "source": "w"
        }
      ],
      "glosses": [
        "通常經過整理,具有既定格式與標記的大量文本"
      ],
      "id": "zh-語料庫-zh-noun-o4-pYqGM",
      "topics": [
        "computing",
        "linguistics"
      ]
    }
  ],
  "sounds": [
    {
      "tags": [
        "Mandarin",
        "Pinyin"
      ],
      "zh_pron": "yǔliàokù"
    },
    {
      "tags": [
        "Mandarin",
        "Bopomofo"
      ],
      "zh_pron": "ㄩˇ ㄌㄧㄠˋ ㄎㄨˋ"
    },
    {
      "tags": [
        "Cantonese",
        "Jyutping"
      ],
      "zh_pron": "jyu⁵ liu⁶⁻² fu³"
    },
    {
      "raw_tags": [
        "客家語",
        "四縣",
        "白話字"
      ],
      "zh_pron": "ngî-liau-khu"
    },
    {
      "tags": [
        "Mandarin",
        "Standard Chinese",
        "Pinyin"
      ],
      "zh_pron": "yǔliàokù"
    },
    {
      "tags": [
        "Mandarin",
        "Standard Chinese",
        "Bopomofo"
      ],
      "zh_pron": "ㄩˇ ㄌㄧㄠˋ ㄎㄨˋ"
    },
    {
      "tags": [
        "Mandarin",
        "Standard Chinese",
        "Tongyong-Pinyin"
      ],
      "zh_pron": "yǔliàokù"
    },
    {
      "tags": [
        "Mandarin",
        "Standard Chinese",
        "Wade–Giles"
      ],
      "zh_pron": "yü³-liao⁴-kʻu⁴"
    },
    {
      "tags": [
        "Mandarin",
        "Standard Chinese",
        "Yale"
      ],
      "zh_pron": "yǔ-lyàu-kù"
    },
    {
      "tags": [
        "Mandarin",
        "Standard Chinese",
        "Gwoyeu-Romatsyh"
      ],
      "zh_pron": "yeuliawkuh"
    },
    {
      "tags": [
        "Mandarin",
        "Standard Chinese",
        "Palladius"
      ],
      "zh_pron": "юйляоку (jujljaoku)"
    },
    {
      "ipa": "/y²¹⁴⁻²¹ li̯ɑʊ̯⁵¹⁻⁵³ kʰu⁵¹/",
      "tags": [
        "Mandarin",
        "Standard Chinese",
        "Sinological-IPA"
      ]
    },
    {
      "raw_tags": [
        "標準粵語",
        "廣州–香港話"
      ],
      "tags": [
        "Cantonese",
        "Jyutping"
      ],
      "zh_pron": "jyu⁵ liu⁶⁻² fu³"
    },
    {
      "raw_tags": [
        "標準粵語",
        "廣州–香港話"
      ],
      "tags": [
        "Cantonese",
        "Yale",
        "Jyutping"
      ],
      "zh_pron": "yúh líu fu"
    },
    {
      "raw_tags": [
        "標準粵語",
        "廣州–香港話"
      ],
      "tags": [
        "Cantonese",
        "Cantonese",
        "Pinyin"
      ],
      "zh_pron": "jy⁵ liu⁶⁻² fu³"
    },
    {
      "raw_tags": [
        "標準粵語",
        "廣州–香港話"
      ],
      "tags": [
        "Cantonese",
        "Guangdong-Romanization"
      ],
      "zh_pron": "yu⁵ liu⁶⁻² fu³"
    },
    {
      "ipa": "/jyː¹³ liːu̯²²⁻³⁵ fuː³³/",
      "raw_tags": [
        "標準粵語",
        "廣州–香港話"
      ],
      "tags": [
        "Cantonese",
        "IPA"
      ]
    },
    {
      "raw_tags": [
        "客家語",
        "四縣話",
        "包括苗栗和美濃",
        "白話字"
      ],
      "zh_pron": "ngî-liau-khu"
    },
    {
      "raw_tags": [
        "客家語",
        "四縣話",
        "包括苗栗和美濃",
        "客家語拼音"
      ],
      "zh_pron": "ngi´ liau ku"
    },
    {
      "raw_tags": [
        "客家語",
        "四縣話",
        "包括苗栗和美濃",
        "客家話拼音"
      ],
      "zh_pron": "ngi¹ liau⁴ ku⁴"
    },
    {
      "ipa": "/ɲi²⁴⁻¹¹ li̯au̯⁵⁵ kʰu⁵⁵/",
      "raw_tags": [
        "客家語",
        "四縣話",
        "包括苗栗和美濃"
      ],
      "tags": [
        "IPA"
      ]
    }
  ],
  "translations": [
    {
      "lang": "阿拉伯語",
      "lang_code": "ar",
      "sense": "經過整理、具有既定格式與標記的大量文本",
      "tags": [
        "masculine"
      ],
      "word": "مَتْن"
    },
    {
      "lang": "阿拉伯語",
      "lang_code": "ar",
      "sense": "經過整理、具有既定格式與標記的大量文本",
      "tags": [
        "masculine"
      ],
      "word": "مَكْنَز لُغَوِيّ"
    },
    {
      "lang": "白俄羅斯語",
      "lang_code": "be",
      "sense": "經過整理、具有既定格式與標記的大量文本",
      "tags": [
        "masculine"
      ],
      "word": "ко́рпус"
    },
    {
      "lang": "白俄羅斯語",
      "lang_code": "be",
      "sense": "經過整理、具有既定格式與標記的大量文本",
      "tags": [
        "masculine"
      ],
      "word": "збор"
    },
    {
      "lang": "保加利亞語",
      "lang_code": "bg",
      "sense": "經過整理、具有既定格式與標記的大量文本",
      "tags": [
        "masculine"
      ],
      "word": "ко́рпус"
    },
    {
      "lang": "加泰羅尼亞語",
      "lang_code": "ca",
      "sense": "經過整理、具有既定格式與標記的大量文本",
      "tags": [
        "masculine"
      ],
      "word": "corpus"
    },
    {
      "lang": "捷克語",
      "lang_code": "cs",
      "sense": "經過整理、具有既定格式與標記的大量文本",
      "tags": [
        "masculine"
      ],
      "word": "korpus"
    },
    {
      "lang": "丹麥語",
      "lang_code": "da",
      "sense": "經過整理、具有既定格式與標記的大量文本",
      "tags": [
        "neuter"
      ],
      "word": "korpus"
    },
    {
      "lang": "荷蘭語",
      "lang_code": "nl",
      "sense": "經過整理、具有既定格式與標記的大量文本",
      "tags": [
        "neuter"
      ],
      "word": "corpus"
    },
    {
      "lang": "英語",
      "lang_code": "en",
      "sense": "經過整理、具有既定格式與標記的大量文本",
      "word": "corpus"
    },
    {
      "lang": "世界語",
      "lang_code": "eo",
      "sense": "經過整理、具有既定格式與標記的大量文本",
      "word": "tekstaro"
    },
    {
      "lang": "世界語",
      "lang_code": "eo",
      "sense": "經過整理、具有既定格式與標記的大量文本",
      "word": "korpuso"
    },
    {
      "lang": "愛沙尼亞語",
      "lang_code": "et",
      "sense": "經過整理、具有既定格式與標記的大量文本",
      "word": "korpus"
    },
    {
      "lang": "芬蘭語",
      "lang_code": "fi",
      "sense": "經過整理、具有既定格式與標記的大量文本",
      "word": "korpus"
    },
    {
      "lang": "法語",
      "lang_code": "fr",
      "sense": "經過整理、具有既定格式與標記的大量文本",
      "tags": [
        "masculine"
      ],
      "word": "corpus"
    },
    {
      "lang": "德語",
      "lang_code": "de",
      "sense": "經過整理、具有既定格式與標記的大量文本",
      "tags": [
        "neuter"
      ],
      "word": "Korpus"
    },
    {
      "lang": "德語",
      "lang_code": "de",
      "sense": "經過整理、具有既定格式與標記的大量文本",
      "tags": [
        "neuter"
      ],
      "word": "Textkorpus"
    },
    {
      "lang": "希臘語",
      "lang_code": "el",
      "sense": "經過整理、具有既定格式與標記的大量文本",
      "tags": [
        "neuter"
      ],
      "word": "σώμα"
    },
    {
      "lang": "希臘語",
      "lang_code": "el",
      "sense": "經過整理、具有既定格式與標記的大量文本",
      "tags": [
        "feminine"
      ],
      "word": "συλλογή"
    },
    {
      "lang": "匈牙利語",
      "lang_code": "hu",
      "sense": "經過整理、具有既定格式與標記的大量文本",
      "word": "korpusz"
    },
    {
      "lang": "意大利語",
      "lang_code": "it",
      "sense": "經過整理、具有既定格式與標記的大量文本",
      "tags": [
        "masculine"
      ],
      "word": "corpus"
    },
    {
      "lang": "日語",
      "lang_code": "ja",
      "roman": "kōpasu",
      "sense": "經過整理、具有既定格式與標記的大量文本",
      "word": "コーパス"
    },
    {
      "lang": "朝鮮語",
      "lang_code": "ko",
      "sense": "經過整理、具有既定格式與標記的大量文本",
      "word": "말뭉치"
    },
    {
      "lang": "朝鮮語",
      "lang_code": "ko",
      "sense": "經過整理、具有既定格式與標記的大量文本",
      "word": "코퍼스"
    },
    {
      "lang": "馬其頓語",
      "lang_code": "mk",
      "sense": "經過整理、具有既定格式與標記的大量文本",
      "tags": [
        "masculine"
      ],
      "word": "ко́рпус"
    },
    {
      "lang": "毛利語",
      "lang_code": "mi",
      "sense": "經過整理、具有既定格式與標記的大量文本",
      "word": "putunga kōrero"
    },
    {
      "lang": "毛利語",
      "lang_code": "mi",
      "sense": "經過整理、具有既定格式與標記的大量文本",
      "word": "whakaputunga"
    },
    {
      "lang": "挪威語",
      "lang_code": "no",
      "sense": "經過整理、具有既定格式與標記的大量文本",
      "tags": [
        "masculine"
      ],
      "word": "korpus"
    },
    {
      "lang": "葡萄牙語",
      "lang_code": "pt",
      "sense": "經過整理、具有既定格式與標記的大量文本",
      "tags": [
        "masculine"
      ],
      "word": "corpus"
    },
    {
      "lang": "俄語",
      "lang_code": "ru",
      "sense": "經過整理、具有既定格式與標記的大量文本",
      "tags": [
        "masculine"
      ],
      "word": "ко́рпус"
    },
    {
      "lang": "俄語",
      "lang_code": "ru",
      "sense": "經過整理、具有既定格式與標記的大量文本",
      "tags": [
        "neuter"
      ],
      "word": "собра́ние"
    },
    {
      "lang": "斯洛伐克語",
      "lang_code": "sk",
      "sense": "經過整理、具有既定格式與標記的大量文本",
      "tags": [
        "masculine"
      ],
      "word": "korpus"
    },
    {
      "lang": "斯洛文尼亞語",
      "lang_code": "sl",
      "sense": "經過整理、具有既定格式與標記的大量文本",
      "tags": [
        "masculine"
      ],
      "word": "korpus"
    },
    {
      "lang": "西班牙語",
      "lang_code": "es",
      "sense": "經過整理、具有既定格式與標記的大量文本",
      "tags": [
        "masculine"
      ],
      "word": "corpus"
    },
    {
      "lang": "瑞典語",
      "lang_code": "sv",
      "sense": "經過整理、具有既定格式與標記的大量文本",
      "tags": [
        "common"
      ],
      "word": "korpus"
    },
    {
      "lang": "瑞典語",
      "lang_code": "sv",
      "sense": "經過整理、具有既定格式與標記的大量文本",
      "tags": [
        "common"
      ],
      "word": "språkbank"
    },
    {
      "lang": "土耳其語",
      "lang_code": "tr",
      "raw_tags": [
        "單一作者的所有作品"
      ],
      "sense": "經過整理、具有既定格式與標記的大量文本",
      "word": "külliyat"
    },
    {
      "lang": "烏克蘭語",
      "lang_code": "uk",
      "sense": "經過整理、具有既定格式與標記的大量文本",
      "tags": [
        "masculine"
      ],
      "word": "ко́рпус"
    },
    {
      "lang": "烏克蘭語",
      "lang_code": "uk",
      "sense": "經過整理、具有既定格式與標記的大量文本",
      "tags": [
        "masculine"
      ],
      "word": "збі́рник"
    }
  ],
  "word": "語料庫"
}
{
  "categories": [
    "官話名詞",
    "官話詞元",
    "客家語名詞",
    "客家語詞元",
    "帶「庫」的漢語詞",
    "帶「料」的漢語詞",
    "帶「語」的漢語詞",
    "有1個詞條的頁面",
    "有國際音標的漢語詞",
    "漢語名詞",
    "漢語詞元",
    "粵語名詞",
    "粵語詞元"
  ],
  "derived": [
    {
      "roman": "yǔliàokù yǔyánxué",
      "tags": [
        "Traditional Chinese"
      ],
      "word": "語料庫語言學"
    },
    {
      "roman": "yǔliàokù yǔyánxué",
      "tags": [
        "Simplified Chinese"
      ],
      "word": "语料库语言学"
    }
  ],
  "forms": [
    {
      "form": "语料库",
      "tags": [
        "Simplified Chinese"
      ]
    }
  ],
  "lang": "漢語",
  "lang_code": "zh",
  "pos": "noun",
  "senses": [
    {
      "categories": [
        "漢語 計算機",
        "漢語 語言學"
      ],
      "glosses": [
        "通常經過整理,具有既定格式與標記的大量文本"
      ],
      "topics": [
        "computing",
        "linguistics"
      ]
    }
  ],
  "sounds": [
    {
      "tags": [
        "Mandarin",
        "Pinyin"
      ],
      "zh_pron": "yǔliàokù"
    },
    {
      "tags": [
        "Mandarin",
        "Bopomofo"
      ],
      "zh_pron": "ㄩˇ ㄌㄧㄠˋ ㄎㄨˋ"
    },
    {
      "tags": [
        "Cantonese",
        "Jyutping"
      ],
      "zh_pron": "jyu⁵ liu⁶⁻² fu³"
    },
    {
      "raw_tags": [
        "客家語",
        "四縣",
        "白話字"
      ],
      "zh_pron": "ngî-liau-khu"
    },
    {
      "tags": [
        "Mandarin",
        "Standard Chinese",
        "Pinyin"
      ],
      "zh_pron": "yǔliàokù"
    },
    {
      "tags": [
        "Mandarin",
        "Standard Chinese",
        "Bopomofo"
      ],
      "zh_pron": "ㄩˇ ㄌㄧㄠˋ ㄎㄨˋ"
    },
    {
      "tags": [
        "Mandarin",
        "Standard Chinese",
        "Tongyong-Pinyin"
      ],
      "zh_pron": "yǔliàokù"
    },
    {
      "tags": [
        "Mandarin",
        "Standard Chinese",
        "Wade–Giles"
      ],
      "zh_pron": "yü³-liao⁴-kʻu⁴"
    },
    {
      "tags": [
        "Mandarin",
        "Standard Chinese",
        "Yale"
      ],
      "zh_pron": "yǔ-lyàu-kù"
    },
    {
      "tags": [
        "Mandarin",
        "Standard Chinese",
        "Gwoyeu-Romatsyh"
      ],
      "zh_pron": "yeuliawkuh"
    },
    {
      "tags": [
        "Mandarin",
        "Standard Chinese",
        "Palladius"
      ],
      "zh_pron": "юйляоку (jujljaoku)"
    },
    {
      "ipa": "/y²¹⁴⁻²¹ li̯ɑʊ̯⁵¹⁻⁵³ kʰu⁵¹/",
      "tags": [
        "Mandarin",
        "Standard Chinese",
        "Sinological-IPA"
      ]
    },
    {
      "raw_tags": [
        "標準粵語",
        "廣州–香港話"
      ],
      "tags": [
        "Cantonese",
        "Jyutping"
      ],
      "zh_pron": "jyu⁵ liu⁶⁻² fu³"
    },
    {
      "raw_tags": [
        "標準粵語",
        "廣州–香港話"
      ],
      "tags": [
        "Cantonese",
        "Yale",
        "Jyutping"
      ],
      "zh_pron": "yúh líu fu"
    },
    {
      "raw_tags": [
        "標準粵語",
        "廣州–香港話"
      ],
      "tags": [
        "Cantonese",
        "Cantonese",
        "Pinyin"
      ],
      "zh_pron": "jy⁵ liu⁶⁻² fu³"
    },
    {
      "raw_tags": [
        "標準粵語",
        "廣州–香港話"
      ],
      "tags": [
        "Cantonese",
        "Guangdong-Romanization"
      ],
      "zh_pron": "yu⁵ liu⁶⁻² fu³"
    },
    {
      "ipa": "/jyː¹³ liːu̯²²⁻³⁵ fuː³³/",
      "raw_tags": [
        "標準粵語",
        "廣州–香港話"
      ],
      "tags": [
        "Cantonese",
        "IPA"
      ]
    },
    {
      "raw_tags": [
        "客家語",
        "四縣話",
        "包括苗栗和美濃",
        "白話字"
      ],
      "zh_pron": "ngî-liau-khu"
    },
    {
      "raw_tags": [
        "客家語",
        "四縣話",
        "包括苗栗和美濃",
        "客家語拼音"
      ],
      "zh_pron": "ngi´ liau ku"
    },
    {
      "raw_tags": [
        "客家語",
        "四縣話",
        "包括苗栗和美濃",
        "客家話拼音"
      ],
      "zh_pron": "ngi¹ liau⁴ ku⁴"
    },
    {
      "ipa": "/ɲi²⁴⁻¹¹ li̯au̯⁵⁵ kʰu⁵⁵/",
      "raw_tags": [
        "客家語",
        "四縣話",
        "包括苗栗和美濃"
      ],
      "tags": [
        "IPA"
      ]
    }
  ],
  "translations": [
    {
      "lang": "阿拉伯語",
      "lang_code": "ar",
      "sense": "經過整理、具有既定格式與標記的大量文本",
      "tags": [
        "masculine"
      ],
      "word": "مَتْن"
    },
    {
      "lang": "阿拉伯語",
      "lang_code": "ar",
      "sense": "經過整理、具有既定格式與標記的大量文本",
      "tags": [
        "masculine"
      ],
      "word": "مَكْنَز لُغَوِيّ"
    },
    {
      "lang": "白俄羅斯語",
      "lang_code": "be",
      "sense": "經過整理、具有既定格式與標記的大量文本",
      "tags": [
        "masculine"
      ],
      "word": "ко́рпус"
    },
    {
      "lang": "白俄羅斯語",
      "lang_code": "be",
      "sense": "經過整理、具有既定格式與標記的大量文本",
      "tags": [
        "masculine"
      ],
      "word": "збор"
    },
    {
      "lang": "保加利亞語",
      "lang_code": "bg",
      "sense": "經過整理、具有既定格式與標記的大量文本",
      "tags": [
        "masculine"
      ],
      "word": "ко́рпус"
    },
    {
      "lang": "加泰羅尼亞語",
      "lang_code": "ca",
      "sense": "經過整理、具有既定格式與標記的大量文本",
      "tags": [
        "masculine"
      ],
      "word": "corpus"
    },
    {
      "lang": "捷克語",
      "lang_code": "cs",
      "sense": "經過整理、具有既定格式與標記的大量文本",
      "tags": [
        "masculine"
      ],
      "word": "korpus"
    },
    {
      "lang": "丹麥語",
      "lang_code": "da",
      "sense": "經過整理、具有既定格式與標記的大量文本",
      "tags": [
        "neuter"
      ],
      "word": "korpus"
    },
    {
      "lang": "荷蘭語",
      "lang_code": "nl",
      "sense": "經過整理、具有既定格式與標記的大量文本",
      "tags": [
        "neuter"
      ],
      "word": "corpus"
    },
    {
      "lang": "英語",
      "lang_code": "en",
      "sense": "經過整理、具有既定格式與標記的大量文本",
      "word": "corpus"
    },
    {
      "lang": "世界語",
      "lang_code": "eo",
      "sense": "經過整理、具有既定格式與標記的大量文本",
      "word": "tekstaro"
    },
    {
      "lang": "世界語",
      "lang_code": "eo",
      "sense": "經過整理、具有既定格式與標記的大量文本",
      "word": "korpuso"
    },
    {
      "lang": "愛沙尼亞語",
      "lang_code": "et",
      "sense": "經過整理、具有既定格式與標記的大量文本",
      "word": "korpus"
    },
    {
      "lang": "芬蘭語",
      "lang_code": "fi",
      "sense": "經過整理、具有既定格式與標記的大量文本",
      "word": "korpus"
    },
    {
      "lang": "法語",
      "lang_code": "fr",
      "sense": "經過整理、具有既定格式與標記的大量文本",
      "tags": [
        "masculine"
      ],
      "word": "corpus"
    },
    {
      "lang": "德語",
      "lang_code": "de",
      "sense": "經過整理、具有既定格式與標記的大量文本",
      "tags": [
        "neuter"
      ],
      "word": "Korpus"
    },
    {
      "lang": "德語",
      "lang_code": "de",
      "sense": "經過整理、具有既定格式與標記的大量文本",
      "tags": [
        "neuter"
      ],
      "word": "Textkorpus"
    },
    {
      "lang": "希臘語",
      "lang_code": "el",
      "sense": "經過整理、具有既定格式與標記的大量文本",
      "tags": [
        "neuter"
      ],
      "word": "σώμα"
    },
    {
      "lang": "希臘語",
      "lang_code": "el",
      "sense": "經過整理、具有既定格式與標記的大量文本",
      "tags": [
        "feminine"
      ],
      "word": "συλλογή"
    },
    {
      "lang": "匈牙利語",
      "lang_code": "hu",
      "sense": "經過整理、具有既定格式與標記的大量文本",
      "word": "korpusz"
    },
    {
      "lang": "意大利語",
      "lang_code": "it",
      "sense": "經過整理、具有既定格式與標記的大量文本",
      "tags": [
        "masculine"
      ],
      "word": "corpus"
    },
    {
      "lang": "日語",
      "lang_code": "ja",
      "roman": "kōpasu",
      "sense": "經過整理、具有既定格式與標記的大量文本",
      "word": "コーパス"
    },
    {
      "lang": "朝鮮語",
      "lang_code": "ko",
      "sense": "經過整理、具有既定格式與標記的大量文本",
      "word": "말뭉치"
    },
    {
      "lang": "朝鮮語",
      "lang_code": "ko",
      "sense": "經過整理、具有既定格式與標記的大量文本",
      "word": "코퍼스"
    },
    {
      "lang": "馬其頓語",
      "lang_code": "mk",
      "sense": "經過整理、具有既定格式與標記的大量文本",
      "tags": [
        "masculine"
      ],
      "word": "ко́рпус"
    },
    {
      "lang": "毛利語",
      "lang_code": "mi",
      "sense": "經過整理、具有既定格式與標記的大量文本",
      "word": "putunga kōrero"
    },
    {
      "lang": "毛利語",
      "lang_code": "mi",
      "sense": "經過整理、具有既定格式與標記的大量文本",
      "word": "whakaputunga"
    },
    {
      "lang": "挪威語",
      "lang_code": "no",
      "sense": "經過整理、具有既定格式與標記的大量文本",
      "tags": [
        "masculine"
      ],
      "word": "korpus"
    },
    {
      "lang": "葡萄牙語",
      "lang_code": "pt",
      "sense": "經過整理、具有既定格式與標記的大量文本",
      "tags": [
        "masculine"
      ],
      "word": "corpus"
    },
    {
      "lang": "俄語",
      "lang_code": "ru",
      "sense": "經過整理、具有既定格式與標記的大量文本",
      "tags": [
        "masculine"
      ],
      "word": "ко́рпус"
    },
    {
      "lang": "俄語",
      "lang_code": "ru",
      "sense": "經過整理、具有既定格式與標記的大量文本",
      "tags": [
        "neuter"
      ],
      "word": "собра́ние"
    },
    {
      "lang": "斯洛伐克語",
      "lang_code": "sk",
      "sense": "經過整理、具有既定格式與標記的大量文本",
      "tags": [
        "masculine"
      ],
      "word": "korpus"
    },
    {
      "lang": "斯洛文尼亞語",
      "lang_code": "sl",
      "sense": "經過整理、具有既定格式與標記的大量文本",
      "tags": [
        "masculine"
      ],
      "word": "korpus"
    },
    {
      "lang": "西班牙語",
      "lang_code": "es",
      "sense": "經過整理、具有既定格式與標記的大量文本",
      "tags": [
        "masculine"
      ],
      "word": "corpus"
    },
    {
      "lang": "瑞典語",
      "lang_code": "sv",
      "sense": "經過整理、具有既定格式與標記的大量文本",
      "tags": [
        "common"
      ],
      "word": "korpus"
    },
    {
      "lang": "瑞典語",
      "lang_code": "sv",
      "sense": "經過整理、具有既定格式與標記的大量文本",
      "tags": [
        "common"
      ],
      "word": "språkbank"
    },
    {
      "lang": "土耳其語",
      "lang_code": "tr",
      "raw_tags": [
        "單一作者的所有作品"
      ],
      "sense": "經過整理、具有既定格式與標記的大量文本",
      "word": "külliyat"
    },
    {
      "lang": "烏克蘭語",
      "lang_code": "uk",
      "sense": "經過整理、具有既定格式與標記的大量文本",
      "tags": [
        "masculine"
      ],
      "word": "ко́рпус"
    },
    {
      "lang": "烏克蘭語",
      "lang_code": "uk",
      "sense": "經過整理、具有既定格式與標記的大量文本",
      "tags": [
        "masculine"
      ],
      "word": "збі́рник"
    }
  ],
  "word": "語料庫"
}

Download raw JSONL data for 語料庫 meaning in 漢語 (8.8kB)


This page is a part of the kaikki.org machine-readable 漢語 dictionary. This dictionary is based on structured data extracted on 2024-10-03 from the zhwiktionary dump dated 2024-10-02 using wiktextract (593e81e and 59b8406). The data shown on this site has been post-processed and various details (e.g., extra categories) removed, some information disambiguated, and additional data merged from other sources. See the raw data download page for the unprocessed wiktextract data.

If you use this data in academic research, please cite Tatu Ylonen: Wiktextract: Wiktionary as Machine-Readable Structured Data, Proceedings of the 13th Conference on Language Resources and Evaluation (LREC), pp. 1317-1325, Marseille, 20-25 June 2022. Linking to the relevant page(s) under https://kaikki.org would also be greatly appreciated.