"subcorpus" meaning in English

See subcorpus in All languages combined, or Wiktionary

Noun

Forms: subcorpora [plural]
Etymology: sub- + corpus Etymology templates: {{prefix|en|sub|corpus}} sub- + corpus Head templates: {{en-noun|subcorpora}} subcorpus (plural subcorpora)
  1. A subset of a corpus.
    Sense id: en-subcorpus-en-noun-JgUU8WLj Categories (other): English entries with incorrect language header, English terms prefixed with sub-

Inflected forms

Download JSON data for subcorpus meaning in English (1.3kB)

{
  "etymology_templates": [
    {
      "args": {
        "1": "en",
        "2": "sub",
        "3": "corpus"
      },
      "expansion": "sub- + corpus",
      "name": "prefix"
    }
  ],
  "etymology_text": "sub- + corpus",
  "forms": [
    {
      "form": "subcorpora",
      "tags": [
        "plural"
      ]
    }
  ],
  "head_templates": [
    {
      "args": {
        "1": "subcorpora"
      },
      "expansion": "subcorpus (plural subcorpora)",
      "name": "en-noun"
    }
  ],
  "lang": "English",
  "lang_code": "en",
  "pos": "noun",
  "senses": [
    {
      "categories": [
        {
          "kind": "other",
          "name": "English entries with incorrect language header",
          "parents": [
            "Entries with incorrect language header",
            "Entry maintenance"
          ],
          "source": "w"
        },
        {
          "kind": "other",
          "name": "English terms prefixed with sub-",
          "parents": [],
          "source": "w"
        }
      ],
      "examples": [
        {
          "ref": "2018, Clarence Green, James Lambert, “Advancing disciplinary literacy through English for academic purposes: Discipline-specific wordlists, collocations and word families for eight secondary subjects”, in Journal of English for Academic Purposes, volume 35, →DOI, page 110",
          "text": "Thus the word react occurs 2331 times in the Chemistry subcorpus, reacts occurs 2195 times, etc., and adding all members together, the REACT family occurs 27,991 times throughout Chemistry.",
          "type": "quotation"
        }
      ],
      "glosses": [
        "A subset of a corpus."
      ],
      "id": "en-subcorpus-en-noun-JgUU8WLj",
      "links": [
        [
          "subset",
          "subset"
        ],
        [
          "corpus",
          "corpus"
        ]
      ]
    }
  ],
  "word": "subcorpus"
}
{
  "etymology_templates": [
    {
      "args": {
        "1": "en",
        "2": "sub",
        "3": "corpus"
      },
      "expansion": "sub- + corpus",
      "name": "prefix"
    }
  ],
  "etymology_text": "sub- + corpus",
  "forms": [
    {
      "form": "subcorpora",
      "tags": [
        "plural"
      ]
    }
  ],
  "head_templates": [
    {
      "args": {
        "1": "subcorpora"
      },
      "expansion": "subcorpus (plural subcorpora)",
      "name": "en-noun"
    }
  ],
  "lang": "English",
  "lang_code": "en",
  "pos": "noun",
  "senses": [
    {
      "categories": [
        "English countable nouns",
        "English entries with incorrect language header",
        "English lemmas",
        "English nouns",
        "English nouns with irregular plurals",
        "English terms prefixed with sub-",
        "English terms with quotations"
      ],
      "examples": [
        {
          "ref": "2018, Clarence Green, James Lambert, “Advancing disciplinary literacy through English for academic purposes: Discipline-specific wordlists, collocations and word families for eight secondary subjects”, in Journal of English for Academic Purposes, volume 35, →DOI, page 110",
          "text": "Thus the word react occurs 2331 times in the Chemistry subcorpus, reacts occurs 2195 times, etc., and adding all members together, the REACT family occurs 27,991 times throughout Chemistry.",
          "type": "quotation"
        }
      ],
      "glosses": [
        "A subset of a corpus."
      ],
      "links": [
        [
          "subset",
          "subset"
        ],
        [
          "corpus",
          "corpus"
        ]
      ]
    }
  ],
  "word": "subcorpus"
}

This page is a part of the kaikki.org machine-readable English dictionary. This dictionary is based on structured data extracted on 2024-06-04 from the enwiktionary dump dated 2024-05-02 using wiktextract (e9e0a99 and db5a844). The data shown on this site has been post-processed and various details (e.g., extra categories) removed, some information disambiguated, and additional data merged from other sources. See the raw data download page for the unprocessed wiktextract data.

If you use this data in academic research, please cite Tatu Ylonen: Wiktextract: Wiktionary as Machine-Readable Structured Data, Proceedings of the 13th Conference on Language Resources and Evaluation (LREC), pp. 1317-1325, Marseille, 20-25 June 2022. Linking to the relevant page(s) under https://kaikki.org would also be greatly appreciated.