"UTF-8" meaning in English

See UTF-8 in All languages combined, or Wiktionary

Proper name

Head templates: {{en-proper noun}} UTF-8
  1. (computing) Unicode Transformation Format-8, a variable-width encoding scheme for Unicode characters, using sequences of one to four one-byte (eight-bit) code units per character. Categories (topical): Computing, Eight Synonyms: UTF8 Coordinate_terms: UTF-7, UTF-16, UTF-32

Alternative forms

{
  "head_templates": [
    {
      "args": {},
      "expansion": "UTF-8",
      "name": "en-proper noun"
    }
  ],
  "lang": "English",
  "lang_code": "en",
  "pos": "name",
  "senses": [
    {
      "categories": [
        {
          "kind": "other",
          "name": "English entries with incorrect language header",
          "parents": [
            "Entries with incorrect language header",
            "Entry maintenance"
          ],
          "source": "w"
        },
        {
          "kind": "other",
          "name": "Pages with 1 entry",
          "parents": [],
          "source": "w"
        },
        {
          "kind": "other",
          "name": "Pages with entries",
          "parents": [],
          "source": "w"
        },
        {
          "kind": "topical",
          "langcode": "en",
          "name": "Computing",
          "orig": "en:Computing",
          "parents": [
            "Technology",
            "All topics",
            "Fundamental"
          ],
          "source": "w"
        },
        {
          "kind": "topical",
          "langcode": "en",
          "name": "Eight",
          "orig": "en:Eight",
          "parents": [
            "Numbers",
            "All topics",
            "Terms by semantic function",
            "Fundamental"
          ],
          "source": "w"
        }
      ],
      "coordinate_terms": [
        {
          "word": "UTF-7"
        },
        {
          "word": "UTF-16"
        },
        {
          "word": "UTF-32"
        }
      ],
      "glosses": [
        "Unicode Transformation Format-8, a variable-width encoding scheme for Unicode characters, using sequences of one to four one-byte (eight-bit) code units per character."
      ],
      "id": "en-UTF-8-en-name-iKryYBBp",
      "links": [
        [
          "computing",
          "computing#Noun"
        ],
        [
          "Unicode",
          "Unicode"
        ],
        [
          "character",
          "character"
        ],
        [
          "sequence",
          "sequence"
        ],
        [
          "byte",
          "byte"
        ],
        [
          "bit",
          "bit"
        ],
        [
          "code unit",
          "code unit"
        ]
      ],
      "raw_glosses": [
        "(computing) Unicode Transformation Format-8, a variable-width encoding scheme for Unicode characters, using sequences of one to four one-byte (eight-bit) code units per character."
      ],
      "synonyms": [
        {
          "word": "UTF8"
        }
      ],
      "topics": [
        "computing",
        "engineering",
        "mathematics",
        "natural-sciences",
        "physical-sciences",
        "sciences"
      ]
    }
  ],
  "word": "UTF-8"
}
{
  "coordinate_terms": [
    {
      "word": "UTF-7"
    },
    {
      "word": "UTF-16"
    },
    {
      "word": "UTF-32"
    }
  ],
  "head_templates": [
    {
      "args": {},
      "expansion": "UTF-8",
      "name": "en-proper noun"
    }
  ],
  "lang": "English",
  "lang_code": "en",
  "pos": "name",
  "senses": [
    {
      "categories": [
        "English entries with incorrect language header",
        "English lemmas",
        "English multiword terms",
        "English proper nouns",
        "English terms spelled with numbers",
        "English uncountable nouns",
        "Pages with 1 entry",
        "Pages with entries",
        "en:Computing",
        "en:Eight"
      ],
      "glosses": [
        "Unicode Transformation Format-8, a variable-width encoding scheme for Unicode characters, using sequences of one to four one-byte (eight-bit) code units per character."
      ],
      "links": [
        [
          "computing",
          "computing#Noun"
        ],
        [
          "Unicode",
          "Unicode"
        ],
        [
          "character",
          "character"
        ],
        [
          "sequence",
          "sequence"
        ],
        [
          "byte",
          "byte"
        ],
        [
          "bit",
          "bit"
        ],
        [
          "code unit",
          "code unit"
        ]
      ],
      "raw_glosses": [
        "(computing) Unicode Transformation Format-8, a variable-width encoding scheme for Unicode characters, using sequences of one to four one-byte (eight-bit) code units per character."
      ],
      "topics": [
        "computing",
        "engineering",
        "mathematics",
        "natural-sciences",
        "physical-sciences",
        "sciences"
      ]
    }
  ],
  "synonyms": [
    {
      "word": "UTF8"
    }
  ],
  "word": "UTF-8"
}

Download raw JSONL data for UTF-8 meaning in English (1.2kB)


This page is a part of the kaikki.org machine-readable English dictionary. This dictionary is based on structured data extracted on 2024-11-06 from the enwiktionary dump dated 2024-10-02 using wiktextract (fbeafe8 and 7f03c9b). The data shown on this site has been post-processed and various details (e.g., extra categories) removed, some information disambiguated, and additional data merged from other sources. See the raw data download page for the unprocessed wiktextract data.

If you use this data in academic research, please cite Tatu Ylonen: Wiktextract: Wiktionary as Machine-Readable Structured Data, Proceedings of the 13th Conference on Language Resources and Evaluation (LREC), pp. 1317-1325, Marseille, 20-25 June 2022. Linking to the relevant page(s) under https://kaikki.org would also be greatly appreciated.