"tokenization" meaning in English

See tokenization in All languages combined, or Wiktionary

Noun

Forms: tokenizations [plural]
Etymology: token + -ization Etymology templates: {{suffix|en|token|ization}} token + -ization Head templates: {{en-noun|~}} tokenization (countable and uncountable, plural tokenizations)
  1. The act or process of tokenizing. Tags: countable, uncountable
    Sense id: en-tokenization-en-noun-Haufhfbs
  2. Something tokenized. Tags: countable, uncountable
    Sense id: en-tokenization-en-noun-PZEdrkEY Categories (other): English entries with incorrect language header, English terms suffixed with -ization Disambiguation of English entries with incorrect language header: 40 60 Disambiguation of English terms suffixed with -ization: 35 65

Inflected forms

Alternative forms

Download JSON data for tokenization meaning in English (1.2kB)

{
  "etymology_templates": [
    {
      "args": {
        "1": "en",
        "2": "token",
        "3": "ization"
      },
      "expansion": "token + -ization",
      "name": "suffix"
    }
  ],
  "etymology_text": "token + -ization",
  "forms": [
    {
      "form": "tokenizations",
      "tags": [
        "plural"
      ]
    }
  ],
  "head_templates": [
    {
      "args": {
        "1": "~"
      },
      "expansion": "tokenization (countable and uncountable, plural tokenizations)",
      "name": "en-noun"
    }
  ],
  "lang": "English",
  "lang_code": "en",
  "pos": "noun",
  "senses": [
    {
      "glosses": [
        "The act or process of tokenizing."
      ],
      "id": "en-tokenization-en-noun-Haufhfbs",
      "links": [
        [
          "tokenizing",
          "tokenize"
        ]
      ],
      "tags": [
        "countable",
        "uncountable"
      ]
    },
    {
      "categories": [
        {
          "_dis": "40 60",
          "kind": "other",
          "name": "English entries with incorrect language header",
          "parents": [
            "Entries with incorrect language header",
            "Entry maintenance"
          ],
          "source": "w+disamb"
        },
        {
          "_dis": "35 65",
          "kind": "other",
          "name": "English terms suffixed with -ization",
          "parents": [],
          "source": "w+disamb"
        }
      ],
      "examples": [
        {
          "text": "This was an unlikely tokenization of the input string."
        }
      ],
      "glosses": [
        "Something tokenized."
      ],
      "id": "en-tokenization-en-noun-PZEdrkEY",
      "tags": [
        "countable",
        "uncountable"
      ]
    }
  ],
  "wikipedia": [
    "tokenization"
  ],
  "word": "tokenization"
}
{
  "categories": [
    "English countable nouns",
    "English entries with incorrect language header",
    "English lemmas",
    "English nouns",
    "English terms suffixed with -ization",
    "English uncountable nouns"
  ],
  "etymology_templates": [
    {
      "args": {
        "1": "en",
        "2": "token",
        "3": "ization"
      },
      "expansion": "token + -ization",
      "name": "suffix"
    }
  ],
  "etymology_text": "token + -ization",
  "forms": [
    {
      "form": "tokenizations",
      "tags": [
        "plural"
      ]
    }
  ],
  "head_templates": [
    {
      "args": {
        "1": "~"
      },
      "expansion": "tokenization (countable and uncountable, plural tokenizations)",
      "name": "en-noun"
    }
  ],
  "lang": "English",
  "lang_code": "en",
  "pos": "noun",
  "senses": [
    {
      "glosses": [
        "The act or process of tokenizing."
      ],
      "links": [
        [
          "tokenizing",
          "tokenize"
        ]
      ],
      "tags": [
        "countable",
        "uncountable"
      ]
    },
    {
      "examples": [
        {
          "text": "This was an unlikely tokenization of the input string."
        }
      ],
      "glosses": [
        "Something tokenized."
      ],
      "tags": [
        "countable",
        "uncountable"
      ]
    }
  ],
  "wikipedia": [
    "tokenization"
  ],
  "word": "tokenization"
}

This page is a part of the kaikki.org machine-readable English dictionary. This dictionary is based on structured data extracted on 2024-05-03 from the enwiktionary dump dated 2024-05-02 using wiktextract (f4fd8c9 and c9440ce). The data shown on this site has been post-processed and various details (e.g., extra categories) removed, some information disambiguated, and additional data merged from other sources. See the raw data download page for the unprocessed wiktextract data.

If you use this data in academic research, please cite Tatu Ylonen: Wiktextract: Wiktionary as Machine-Readable Structured Data, Proceedings of the 13th Conference on Language Resources and Evaluation (LREC), pp. 1317-1325, Marseille, 20-25 June 2022. Linking to the relevant page(s) under https://kaikki.org would also be greatly appreciated.