"Heaps' law" meaning in All languages combined

See Heaps' law on Wiktionary

Noun [English]

Etymology: Named after Harold Stanley Heaps, but was originally discovered by Gustav Herdan. Head templates: {{en-proper noun|head=Heaps' law}} Heaps' law
  1. (linguistics) An empirical law that expresses the correlation between the length of a document or set of documents and the corresponding number of distinct words. Wikipedia link: Heaps' law Categories (topical): Computational linguistics, Linguistics
    Sense id: en-Heaps'_law-en-noun-iEAu0aL3 Categories (other): English entries with incorrect language header, Pages with 1 entry, Pages with entries Topics: human-sciences, linguistics, sciences
{
  "etymology_text": "Named after Harold Stanley Heaps, but was originally discovered by Gustav Herdan.",
  "head_templates": [
    {
      "args": {
        "head": "Heaps' law"
      },
      "expansion": "Heaps' law",
      "name": "en-proper noun"
    }
  ],
  "lang": "English",
  "lang_code": "en",
  "pos": "noun",
  "senses": [
    {
      "categories": [
        {
          "kind": "other",
          "name": "English entries with incorrect language header",
          "parents": [
            "Entries with incorrect language header",
            "Entry maintenance"
          ],
          "source": "w"
        },
        {
          "kind": "other",
          "name": "Pages with 1 entry",
          "parents": [],
          "source": "w"
        },
        {
          "kind": "other",
          "name": "Pages with entries",
          "parents": [],
          "source": "w"
        },
        {
          "kind": "topical",
          "langcode": "en",
          "name": "Computational linguistics",
          "orig": "en:Computational linguistics",
          "parents": [
            "Computer science",
            "Linguistics",
            "Computing",
            "Sciences",
            "Language",
            "Social sciences",
            "Technology",
            "All topics",
            "Communication",
            "Society",
            "Fundamental"
          ],
          "source": "w"
        },
        {
          "kind": "topical",
          "langcode": "en",
          "name": "Linguistics",
          "orig": "en:Linguistics",
          "parents": [
            "Language",
            "Social sciences",
            "Communication",
            "Sciences",
            "Society",
            "All topics",
            "Fundamental"
          ],
          "source": "w"
        }
      ],
      "glosses": [
        "An empirical law that expresses the correlation between the length of a document or set of documents and the corresponding number of distinct words."
      ],
      "id": "en-Heaps'_law-en-noun-iEAu0aL3",
      "links": [
        [
          "linguistics",
          "linguistics"
        ],
        [
          "empirical",
          "empirical"
        ],
        [
          "correlation",
          "correlation"
        ],
        [
          "length",
          "length"
        ],
        [
          "document",
          "document"
        ],
        [
          "word",
          "word"
        ]
      ],
      "raw_glosses": [
        "(linguistics) An empirical law that expresses the correlation between the length of a document or set of documents and the corresponding number of distinct words."
      ],
      "topics": [
        "human-sciences",
        "linguistics",
        "sciences"
      ],
      "wikipedia": [
        "Heaps' law"
      ]
    }
  ],
  "word": "Heaps' law"
}
{
  "etymology_text": "Named after Harold Stanley Heaps, but was originally discovered by Gustav Herdan.",
  "head_templates": [
    {
      "args": {
        "head": "Heaps' law"
      },
      "expansion": "Heaps' law",
      "name": "en-proper noun"
    }
  ],
  "lang": "English",
  "lang_code": "en",
  "pos": "noun",
  "senses": [
    {
      "categories": [
        "English entries with incorrect language header",
        "English eponyms",
        "English lemmas",
        "English multiword terms",
        "English proper nouns",
        "English uncountable nouns",
        "Pages with 1 entry",
        "Pages with entries",
        "en:Computational linguistics",
        "en:Linguistics"
      ],
      "glosses": [
        "An empirical law that expresses the correlation between the length of a document or set of documents and the corresponding number of distinct words."
      ],
      "links": [
        [
          "linguistics",
          "linguistics"
        ],
        [
          "empirical",
          "empirical"
        ],
        [
          "correlation",
          "correlation"
        ],
        [
          "length",
          "length"
        ],
        [
          "document",
          "document"
        ],
        [
          "word",
          "word"
        ]
      ],
      "raw_glosses": [
        "(linguistics) An empirical law that expresses the correlation between the length of a document or set of documents and the corresponding number of distinct words."
      ],
      "topics": [
        "human-sciences",
        "linguistics",
        "sciences"
      ],
      "wikipedia": [
        "Heaps' law"
      ]
    }
  ],
  "word": "Heaps' law"
}

Download raw JSONL data for Heaps' law meaning in All languages combined (1.2kB)


This page is a part of the kaikki.org machine-readable All languages combined dictionary. This dictionary is based on structured data extracted on 2024-11-06 from the enwiktionary dump dated 2024-10-02 using wiktextract (fbeafe8 and 7f03c9b). The data shown on this site has been post-processed and various details (e.g., extra categories) removed, some information disambiguated, and additional data merged from other sources. See the raw data download page for the unprocessed wiktextract data.

If you use this data in academic research, please cite Tatu Ylonen: Wiktextract: Wiktionary as Machine-Readable Structured Data, Proceedings of the 13th Conference on Language Resources and Evaluation (LREC), pp. 1317-1325, Marseille, 20-25 June 2022. Linking to the relevant page(s) under https://kaikki.org would also be greatly appreciated.