"Heaps' law" meaning in All languages combined

See Heaps' law on Wiktionary

Noun [English]

Etymology: Named after Harold Stanley Heaps, but was originally discovered by Gustav Herdan. Head templates: {{en-proper noun|head=Heaps' law}} Heaps' law
  1. (linguistics) An empirical law that expresses the correlation between the length of a document or set of documents and the corresponding number of distinct words. Wikipedia link: Heaps' law Categories (topical): Computational linguistics, Linguistics

Download JSON data for Heaps' law meaning in All languages combined (2.2kB)

{
  "etymology_text": "Named after Harold Stanley Heaps, but was originally discovered by Gustav Herdan.",
  "head_templates": [
    {
      "args": {
        "head": "Heaps' law"
      },
      "expansion": "Heaps' law",
      "name": "en-proper noun"
    }
  ],
  "lang": "English",
  "lang_code": "en",
  "pos": "noun",
  "senses": [
    {
      "categories": [
        {
          "kind": "other",
          "name": "English entries with incorrect language header",
          "parents": [
            "Entries with incorrect language header",
            "Entry maintenance"
          ],
          "source": "w"
        },
        {
          "kind": "other",
          "name": "English entries with language name categories using raw markup",
          "parents": [
            "Entries with language name categories using raw markup",
            "Entry maintenance"
          ],
          "source": "w"
        },
        {
          "kind": "other",
          "name": "English entries with topic categories using raw markup",
          "parents": [
            "Entries with topic categories using raw markup",
            "Entry maintenance"
          ],
          "source": "w"
        },
        {
          "kind": "other",
          "name": "English terms with non-redundant non-automated sortkeys",
          "parents": [
            "Terms with non-redundant non-automated sortkeys",
            "Entry maintenance"
          ],
          "source": "w"
        },
        {
          "kind": "topical",
          "langcode": "en",
          "name": "Computational linguistics",
          "orig": "en:Computational linguistics",
          "parents": [
            "Computer science",
            "Linguistics",
            "Computing",
            "Sciences",
            "Language",
            "Social sciences",
            "Technology",
            "All topics",
            "Communication",
            "Society",
            "Fundamental"
          ],
          "source": "w"
        },
        {
          "kind": "topical",
          "langcode": "en",
          "name": "Linguistics",
          "orig": "en:Linguistics",
          "parents": [
            "Language",
            "Social sciences",
            "Communication",
            "Sciences",
            "Society",
            "All topics",
            "Fundamental"
          ],
          "source": "w"
        }
      ],
      "glosses": [
        "An empirical law that expresses the correlation between the length of a document or set of documents and the corresponding number of distinct words."
      ],
      "id": "en-Heaps'_law-en-noun-iEAu0aL3",
      "links": [
        [
          "linguistics",
          "linguistics"
        ],
        [
          "empirical",
          "empirical"
        ],
        [
          "correlation",
          "correlation"
        ],
        [
          "length",
          "length"
        ],
        [
          "document",
          "document"
        ],
        [
          "word",
          "word"
        ]
      ],
      "raw_glosses": [
        "(linguistics) An empirical law that expresses the correlation between the length of a document or set of documents and the corresponding number of distinct words."
      ],
      "topics": [
        "human-sciences",
        "linguistics",
        "sciences"
      ],
      "wikipedia": [
        "Heaps' law"
      ]
    }
  ],
  "word": "Heaps' law"
}
{
  "etymology_text": "Named after Harold Stanley Heaps, but was originally discovered by Gustav Herdan.",
  "head_templates": [
    {
      "args": {
        "head": "Heaps' law"
      },
      "expansion": "Heaps' law",
      "name": "en-proper noun"
    }
  ],
  "lang": "English",
  "lang_code": "en",
  "pos": "noun",
  "senses": [
    {
      "categories": [
        "English entries with incorrect language header",
        "English entries with language name categories using raw markup",
        "English entries with topic categories using raw markup",
        "English eponyms",
        "English lemmas",
        "English multiword terms",
        "English proper nouns",
        "English terms with non-redundant non-automated sortkeys",
        "English uncountable nouns",
        "en:Computational linguistics",
        "en:Linguistics"
      ],
      "glosses": [
        "An empirical law that expresses the correlation between the length of a document or set of documents and the corresponding number of distinct words."
      ],
      "links": [
        [
          "linguistics",
          "linguistics"
        ],
        [
          "empirical",
          "empirical"
        ],
        [
          "correlation",
          "correlation"
        ],
        [
          "length",
          "length"
        ],
        [
          "document",
          "document"
        ],
        [
          "word",
          "word"
        ]
      ],
      "raw_glosses": [
        "(linguistics) An empirical law that expresses the correlation between the length of a document or set of documents and the corresponding number of distinct words."
      ],
      "topics": [
        "human-sciences",
        "linguistics",
        "sciences"
      ],
      "wikipedia": [
        "Heaps' law"
      ]
    }
  ],
  "word": "Heaps' law"
}

This page is a part of the kaikki.org machine-readable All languages combined dictionary. This dictionary is based on structured data extracted on 2024-05-10 from the enwiktionary dump dated 2024-05-02 using wiktextract (a644e18 and edd475d). The data shown on this site has been post-processed and various details (e.g., extra categories) removed, some information disambiguated, and additional data merged from other sources. See the raw data download page for the unprocessed wiktextract data.

If you use this data in academic research, please cite Tatu Ylonen: Wiktextract: Wiktionary as Machine-Readable Structured Data, Proceedings of the 13th Conference on Language Resources and Evaluation (LREC), pp. 1317-1325, Marseille, 20-25 June 2022. Linking to the relevant page(s) under https://kaikki.org would also be greatly appreciated.