"tokenize" meaning in All languages combined

See tokenize on Wiktionary

Verb [Anglais]

Forms: to tokenize [infinitive], tokenizes [present, third-person, singular], tokenized [preterite], tokenized [participle, past], tokenizing [participle, present]
  1. Parser un texte composé de tokens.
    Sense id: fr-tokenize-en-verb-BInBBwcH Categories (other): Lexique en anglais de l’informatique Topics: computing
The following are not (yet) sense-disambiguated
Related terms: tokenise, tokenization, tokenizer

Inflected forms

Download JSONL data for tokenize meaning in All languages combined (1.4kB)

{
  "categories": [
    {
      "kind": "other",
      "name": "Lemmes en anglais",
      "parents": [],
      "source": "w"
    },
    {
      "kind": "other",
      "name": "Verbes en anglais",
      "parents": [],
      "source": "w"
    },
    {
      "kind": "other",
      "name": "Anglais",
      "orig": "anglais",
      "parents": [],
      "source": "w"
    }
  ],
  "etymology_texts": [
    "Dérivé de token, avec le suffixe -ize."
  ],
  "forms": [
    {
      "form": "to tokenize",
      "ipas": [
        "\\toʊ.kə.naɪz\\"
      ],
      "tags": [
        "infinitive"
      ]
    },
    {
      "form": "tokenizes",
      "ipas": [
        "\\toʊ.kə.naɪz.ɪz\\"
      ],
      "tags": [
        "present",
        "third-person",
        "singular"
      ]
    },
    {
      "form": "tokenized",
      "ipas": [
        "\\toʊ.kə.naɪzd\\"
      ],
      "tags": [
        "preterite"
      ]
    },
    {
      "form": "tokenized",
      "ipas": [
        "\\toʊ.kə.naɪzd\\"
      ],
      "tags": [
        "participle",
        "past"
      ]
    },
    {
      "form": "tokenizing",
      "ipas": [
        "\\toʊ.kə.naɪz.ɪŋ\\"
      ],
      "tags": [
        "participle",
        "present"
      ]
    }
  ],
  "lang": "Anglais",
  "lang_code": "en",
  "pos": "verb",
  "pos_id": "en-verb-1",
  "pos_title": "Verbe",
  "related": [
    {
      "word": "tokenise"
    },
    {
      "word": "tokenization"
    },
    {
      "word": "tokenizer"
    }
  ],
  "senses": [
    {
      "categories": [
        {
          "kind": "other",
          "name": "Lexique en anglais de l’informatique",
          "parents": [],
          "source": "w"
        }
      ],
      "examples": [
        {
          "ref": "Ben Stephenson, The Python Workbook (2nde édition),Springer, 2019, p. 90",
          "text": "Tokenizing is the process of converting a string into a list of substrings, known as tokens."
        }
      ],
      "glosses": [
        "Parser un texte composé de tokens."
      ],
      "id": "fr-tokenize-en-verb-BInBBwcH",
      "topics": [
        "computing"
      ]
    }
  ],
  "word": "tokenize"
}
{
  "categories": [
    "Lemmes en anglais",
    "Verbes en anglais",
    "anglais"
  ],
  "etymology_texts": [
    "Dérivé de token, avec le suffixe -ize."
  ],
  "forms": [
    {
      "form": "to tokenize",
      "ipas": [
        "\\toʊ.kə.naɪz\\"
      ],
      "tags": [
        "infinitive"
      ]
    },
    {
      "form": "tokenizes",
      "ipas": [
        "\\toʊ.kə.naɪz.ɪz\\"
      ],
      "tags": [
        "present",
        "third-person",
        "singular"
      ]
    },
    {
      "form": "tokenized",
      "ipas": [
        "\\toʊ.kə.naɪzd\\"
      ],
      "tags": [
        "preterite"
      ]
    },
    {
      "form": "tokenized",
      "ipas": [
        "\\toʊ.kə.naɪzd\\"
      ],
      "tags": [
        "participle",
        "past"
      ]
    },
    {
      "form": "tokenizing",
      "ipas": [
        "\\toʊ.kə.naɪz.ɪŋ\\"
      ],
      "tags": [
        "participle",
        "present"
      ]
    }
  ],
  "lang": "Anglais",
  "lang_code": "en",
  "pos": "verb",
  "pos_id": "en-verb-1",
  "pos_title": "Verbe",
  "related": [
    {
      "word": "tokenise"
    },
    {
      "word": "tokenization"
    },
    {
      "word": "tokenizer"
    }
  ],
  "senses": [
    {
      "categories": [
        "Lexique en anglais de l’informatique"
      ],
      "examples": [
        {
          "ref": "Ben Stephenson, The Python Workbook (2nde édition),Springer, 2019, p. 90",
          "text": "Tokenizing is the process of converting a string into a list of substrings, known as tokens."
        }
      ],
      "glosses": [
        "Parser un texte composé de tokens."
      ],
      "topics": [
        "computing"
      ]
    }
  ],
  "word": "tokenize"
}

This page is a part of the kaikki.org machine-readable All languages combined dictionary. This dictionary is based on structured data extracted on 2024-07-01 from the frwiktionary dump dated 2024-06-20 using wiktextract (e79c026 and b863ecc). The data shown on this site has been post-processed and various details (e.g., extra categories) removed, some information disambiguated, and additional data merged from other sources. See the raw data download page for the unprocessed wiktextract data.

If you use this data in academic research, please cite Tatu Ylonen: Wiktextract: Wiktionary as Machine-Readable Structured Data, Proceedings of the 13th Conference on Language Resources and Evaluation (LREC), pp. 1317-1325, Marseille, 20-25 June 2022. Linking to the relevant page(s) under https://kaikki.org would also be greatly appreciated.