"corpus linguistics" meaning in English

See corpus linguistics in All languages combined, or Wiktionary

Noun

Head templates: {{en-noun|-}} corpus linguistics (uncountable)
  1. (computing, linguistics) A branch of linguistics that studies large samples (corpora) of real-world text, usually with the aid of computer software. Wikipedia link: corpus linguistics Tags: uncountable Categories (topical): Computing, Linguistics Related terms: cotext, KWIC Translations (a branch of linguistics that studies large samples): korpuslinguistiek (Afrikaans), 語料庫語言學 (Chinese Mandarin), 语料库语言学 (yǔliàokù yǔyánxué) (Chinese Mandarin), korpustutkimus (Finnish), korpuslingvistiikka (Finnish), korpusznyelvészet (Hungarian), korpuslingvistik [common-gender] (Swedish)

Download JSON data for corpus linguistics meaning in English (3.1kB)

{
  "head_templates": [
    {
      "args": {
        "1": "-"
      },
      "expansion": "corpus linguistics (uncountable)",
      "name": "en-noun"
    }
  ],
  "lang": "English",
  "lang_code": "en",
  "pos": "noun",
  "senses": [
    {
      "categories": [
        {
          "kind": "other",
          "name": "English entries with incorrect language header",
          "parents": [
            "Entries with incorrect language header",
            "Entry maintenance"
          ],
          "source": "w"
        },
        {
          "kind": "other",
          "name": "Mandarin terms with redundant transliterations",
          "parents": [
            "Terms with redundant transliterations",
            "Entry maintenance"
          ],
          "source": "w"
        },
        {
          "kind": "topical",
          "langcode": "en",
          "name": "Computing",
          "orig": "en:Computing",
          "parents": [
            "Technology",
            "All topics",
            "Fundamental"
          ],
          "source": "w"
        },
        {
          "kind": "topical",
          "langcode": "en",
          "name": "Linguistics",
          "orig": "en:Linguistics",
          "parents": [
            "Language",
            "Social sciences",
            "Communication",
            "Sciences",
            "Society",
            "All topics",
            "Fundamental"
          ],
          "source": "w"
        }
      ],
      "examples": [
        {
          "ref": "2018, Clarence Green, James Lambert, “Position vectors, homologous chromosomes and gamma rays: Promoting disciplinary literacy through Secondary Phrase Lists”, in English for Specific Purposes, →DOI, page 2",
          "text": "ESP, using the tools of corpus linguistics, has advanced the methodologies for investigating discipline-specific language, yet there has been little cross-fertilization so far with disciplinary literacy in secondary education.",
          "type": "quotation"
        }
      ],
      "glosses": [
        "A branch of linguistics that studies large samples (corpora) of real-world text, usually with the aid of computer software."
      ],
      "id": "en-corpus_linguistics-en-noun-frNw-xcN",
      "links": [
        [
          "computing",
          "computing#Noun"
        ],
        [
          "linguistics",
          "linguistics"
        ],
        [
          "corpora",
          "corpus"
        ],
        [
          "computer",
          "computer"
        ],
        [
          "software",
          "software"
        ]
      ],
      "raw_glosses": [
        "(computing, linguistics) A branch of linguistics that studies large samples (corpora) of real-world text, usually with the aid of computer software."
      ],
      "related": [
        {
          "word": "cotext"
        },
        {
          "word": "KWIC"
        }
      ],
      "tags": [
        "uncountable"
      ],
      "topics": [
        "computing",
        "engineering",
        "human-sciences",
        "linguistics",
        "mathematics",
        "natural-sciences",
        "physical-sciences",
        "sciences"
      ],
      "translations": [
        {
          "code": "af",
          "lang": "Afrikaans",
          "sense": "a branch of linguistics that studies large samples",
          "word": "korpuslinguistiek"
        },
        {
          "code": "cmn",
          "lang": "Chinese Mandarin",
          "sense": "a branch of linguistics that studies large samples",
          "word": "語料庫語言學"
        },
        {
          "code": "cmn",
          "lang": "Chinese Mandarin",
          "roman": "yǔliàokù yǔyánxué",
          "sense": "a branch of linguistics that studies large samples",
          "word": "语料库语言学"
        },
        {
          "code": "fi",
          "lang": "Finnish",
          "sense": "a branch of linguistics that studies large samples",
          "word": "korpustutkimus"
        },
        {
          "code": "fi",
          "lang": "Finnish",
          "sense": "a branch of linguistics that studies large samples",
          "word": "korpuslingvistiikka"
        },
        {
          "code": "hu",
          "lang": "Hungarian",
          "sense": "a branch of linguistics that studies large samples",
          "word": "korpusznyelvészet"
        },
        {
          "code": "sv",
          "lang": "Swedish",
          "sense": "a branch of linguistics that studies large samples",
          "tags": [
            "common-gender"
          ],
          "word": "korpuslingvistik"
        }
      ],
      "wikipedia": [
        "corpus linguistics"
      ]
    }
  ],
  "word": "corpus linguistics"
}
{
  "head_templates": [
    {
      "args": {
        "1": "-"
      },
      "expansion": "corpus linguistics (uncountable)",
      "name": "en-noun"
    }
  ],
  "lang": "English",
  "lang_code": "en",
  "pos": "noun",
  "related": [
    {
      "word": "cotext"
    },
    {
      "word": "KWIC"
    }
  ],
  "senses": [
    {
      "categories": [
        "English entries with incorrect language header",
        "English lemmas",
        "English multiword terms",
        "English nouns",
        "English terms with quotations",
        "English uncountable nouns",
        "Mandarin terms with redundant transliterations",
        "en:Computing",
        "en:Linguistics"
      ],
      "examples": [
        {
          "ref": "2018, Clarence Green, James Lambert, “Position vectors, homologous chromosomes and gamma rays: Promoting disciplinary literacy through Secondary Phrase Lists”, in English for Specific Purposes, →DOI, page 2",
          "text": "ESP, using the tools of corpus linguistics, has advanced the methodologies for investigating discipline-specific language, yet there has been little cross-fertilization so far with disciplinary literacy in secondary education.",
          "type": "quotation"
        }
      ],
      "glosses": [
        "A branch of linguistics that studies large samples (corpora) of real-world text, usually with the aid of computer software."
      ],
      "links": [
        [
          "computing",
          "computing#Noun"
        ],
        [
          "linguistics",
          "linguistics"
        ],
        [
          "corpora",
          "corpus"
        ],
        [
          "computer",
          "computer"
        ],
        [
          "software",
          "software"
        ]
      ],
      "raw_glosses": [
        "(computing, linguistics) A branch of linguistics that studies large samples (corpora) of real-world text, usually with the aid of computer software."
      ],
      "tags": [
        "uncountable"
      ],
      "topics": [
        "computing",
        "engineering",
        "human-sciences",
        "linguistics",
        "mathematics",
        "natural-sciences",
        "physical-sciences",
        "sciences"
      ],
      "wikipedia": [
        "corpus linguistics"
      ]
    }
  ],
  "translations": [
    {
      "code": "af",
      "lang": "Afrikaans",
      "sense": "a branch of linguistics that studies large samples",
      "word": "korpuslinguistiek"
    },
    {
      "code": "cmn",
      "lang": "Chinese Mandarin",
      "sense": "a branch of linguistics that studies large samples",
      "word": "語料庫語言學"
    },
    {
      "code": "cmn",
      "lang": "Chinese Mandarin",
      "roman": "yǔliàokù yǔyánxué",
      "sense": "a branch of linguistics that studies large samples",
      "word": "语料库语言学"
    },
    {
      "code": "fi",
      "lang": "Finnish",
      "sense": "a branch of linguistics that studies large samples",
      "word": "korpustutkimus"
    },
    {
      "code": "fi",
      "lang": "Finnish",
      "sense": "a branch of linguistics that studies large samples",
      "word": "korpuslingvistiikka"
    },
    {
      "code": "hu",
      "lang": "Hungarian",
      "sense": "a branch of linguistics that studies large samples",
      "word": "korpusznyelvészet"
    },
    {
      "code": "sv",
      "lang": "Swedish",
      "sense": "a branch of linguistics that studies large samples",
      "tags": [
        "common-gender"
      ],
      "word": "korpuslingvistik"
    }
  ],
  "word": "corpus linguistics"
}

This page is a part of the kaikki.org machine-readable English dictionary. This dictionary is based on structured data extracted on 2024-05-03 from the enwiktionary dump dated 2024-05-02 using wiktextract (f4fd8c9 and c9440ce). The data shown on this site has been post-processed and various details (e.g., extra categories) removed, some information disambiguated, and additional data merged from other sources. See the raw data download page for the unprocessed wiktextract data.

If you use this data in academic research, please cite Tatu Ylonen: Wiktextract: Wiktionary as Machine-Readable Structured Data, Proceedings of the 13th Conference on Language Resources and Evaluation (LREC), pp. 1317-1325, Marseille, 20-25 June 2022. Linking to the relevant page(s) under https://kaikki.org would also be greatly appreciated.