"UTF-8" meaning in English

See UTF-8 in All languages combined, or Wiktionary

Proper name

Forms: UTF8 [alternative]
Head templates: {{en-proper noun|head=UTF-8}} UTF-8
  1. (Unicode) Unicode Transformation Format-8, a variable-width encoding scheme for Unicode characters, using sequences of one to four one-byte (8-bit) code units per character, and the default encoding for numerous internet protocols and applications. Coordinate_terms: UTF-7, UTF-16, UTF-32
    Sense id: en-UTF-8-en-name-T-vd8pje Categories (other): English entries with incorrect language header, Pages with 1 entry, Pages with entries, Eight, Unicode

Alternative forms

{
  "forms": [
    {
      "form": "UTF8",
      "tags": [
        "alternative"
      ]
    }
  ],
  "head_templates": [
    {
      "args": {
        "head": "UTF-8"
      },
      "expansion": "UTF-8",
      "name": "en-proper noun"
    }
  ],
  "lang": "English",
  "lang_code": "en",
  "pos": "name",
  "senses": [
    {
      "categories": [
        {
          "kind": "other",
          "name": "English entries with incorrect language header",
          "parents": [],
          "source": "w"
        },
        {
          "kind": "other",
          "name": "Pages with 1 entry",
          "parents": [],
          "source": "w"
        },
        {
          "kind": "other",
          "name": "Pages with entries",
          "parents": [],
          "source": "w"
        },
        {
          "kind": "other",
          "langcode": "en",
          "name": "Eight",
          "orig": "en:Eight",
          "parents": [],
          "source": "w"
        },
        {
          "kind": "other",
          "langcode": "en",
          "name": "Unicode",
          "orig": "en:Unicode",
          "parents": [],
          "source": "w"
        }
      ],
      "coordinate_terms": [
        {
          "word": "UTF-7"
        },
        {
          "word": "UTF-16"
        },
        {
          "word": "UTF-32"
        }
      ],
      "glosses": [
        "Unicode Transformation Format-8, a variable-width encoding scheme for Unicode characters, using sequences of one to four one-byte (8-bit) code units per character, and the default encoding for numerous internet protocols and applications."
      ],
      "id": "en-UTF-8-en-name-T-vd8pje",
      "links": [
        [
          "Unicode",
          "Unicode"
        ],
        [
          "character",
          "character"
        ],
        [
          "sequence",
          "sequence"
        ],
        [
          "byte",
          "byte"
        ],
        [
          "8-bit",
          "8-bit"
        ],
        [
          "code unit",
          "code unit"
        ]
      ],
      "qualifier": "Unicode",
      "raw_glosses": [
        "(Unicode) Unicode Transformation Format-8, a variable-width encoding scheme for Unicode characters, using sequences of one to four one-byte (8-bit) code units per character, and the default encoding for numerous internet protocols and applications."
      ]
    }
  ],
  "word": "UTF-8"
}
{
  "coordinate_terms": [
    {
      "word": "UTF-7"
    },
    {
      "word": "UTF-16"
    },
    {
      "word": "UTF-32"
    }
  ],
  "forms": [
    {
      "form": "UTF8",
      "tags": [
        "alternative"
      ]
    }
  ],
  "head_templates": [
    {
      "args": {
        "head": "UTF-8"
      },
      "expansion": "UTF-8",
      "name": "en-proper noun"
    }
  ],
  "lang": "English",
  "lang_code": "en",
  "pos": "name",
  "senses": [
    {
      "categories": [
        "English entries with incorrect language header",
        "English lemmas",
        "English multiword terms",
        "English proper nouns",
        "English terms spelled with numbers",
        "English uncountable nouns",
        "Pages with 1 entry",
        "Pages with entries",
        "en:Eight",
        "en:Unicode"
      ],
      "glosses": [
        "Unicode Transformation Format-8, a variable-width encoding scheme for Unicode characters, using sequences of one to four one-byte (8-bit) code units per character, and the default encoding for numerous internet protocols and applications."
      ],
      "links": [
        [
          "Unicode",
          "Unicode"
        ],
        [
          "character",
          "character"
        ],
        [
          "sequence",
          "sequence"
        ],
        [
          "byte",
          "byte"
        ],
        [
          "8-bit",
          "8-bit"
        ],
        [
          "code unit",
          "code unit"
        ]
      ],
      "qualifier": "Unicode",
      "raw_glosses": [
        "(Unicode) Unicode Transformation Format-8, a variable-width encoding scheme for Unicode characters, using sequences of one to four one-byte (8-bit) code units per character, and the default encoding for numerous internet protocols and applications."
      ]
    }
  ],
  "word": "UTF-8"
}

Download raw JSONL data for UTF-8 meaning in English (1.3kB)


This page is a part of the kaikki.org machine-readable English dictionary. This dictionary is based on structured data extracted on 2026-01-13 from the enwiktionary dump dated 2026-01-01 using wiktextract (96027d6 and 9905b1f). The data shown on this site has been post-processed and various details (e.g., extra categories) removed, some information disambiguated, and additional data merged from other sources. See the raw data download page for the unprocessed wiktextract data.

If you use this data in academic research, please cite Tatu Ylonen: Wiktextract: Wiktionary as Machine-Readable Structured Data, Proceedings of the 13th Conference on Language Resources and Evaluation (LREC), pp. 1317-1325, Marseille, 20-25 June 2022. Linking to the relevant page(s) under https://kaikki.org would also be greatly appreciated.