"UTF-8" meaning in All languages combined

See UTF-8 on Wiktionary

Noun [Conventions internationales]

  1. Format de codage de caractères.
    Sense id: fr-UTF-8-conv-noun-hi0~vAqO Categories (other): Lexique en conventions internationales de l’informatique Topics: computing
The following are not (yet) sense-disambiguated

Noun [Français]

IPA: \y.te.ɛf ɥit\
  1. Format de codage de caractères.
    Sense id: fr-UTF-8-fr-noun-hi0~vAqO Categories (other): Exemples en français, Lexique en français de l’informatique Topics: computing
The following are not (yet) sense-disambiguated
{
  "categories": [
    {
      "kind": "other",
      "name": "Noms scientifiques",
      "parents": [],
      "source": "w"
    },
    {
      "kind": "other",
      "name": "Conventions internationales",
      "orig": "conventions internationales",
      "parents": [],
      "source": "w"
    }
  ],
  "etymology_texts": [
    "Du U de UCS (Universal Character Set), du T de transformation, du F de format et de 8 de 8 bits."
  ],
  "lang": "Conventions internationales",
  "lang_code": "conv",
  "pos": "noun",
  "pos_title": "Nom scientifique",
  "senses": [
    {
      "categories": [
        {
          "kind": "other",
          "name": "Lexique en conventions internationales de l’informatique",
          "parents": [],
          "source": "w"
        }
      ],
      "glosses": [
        "Format de codage de caractères."
      ],
      "id": "fr-UTF-8-conv-noun-hi0~vAqO",
      "topics": [
        "computing"
      ]
    }
  ],
  "tags": [
    "invariable"
  ],
  "word": "UTF-8"
}

{
  "categories": [
    {
      "kind": "other",
      "name": "Codages de caractères",
      "parents": [],
      "source": "w"
    },
    {
      "kind": "other",
      "name": "Lemmes en français",
      "parents": [],
      "source": "w"
    },
    {
      "kind": "other",
      "name": "Noms communs en français",
      "parents": [],
      "source": "w"
    },
    {
      "kind": "other",
      "name": "Français",
      "orig": "français",
      "parents": [],
      "source": "w"
    }
  ],
  "etymology_texts": [
    "Du U de UCS (Universal Character Set), du T de transformation, du F de format et de 8 de 8 bits."
  ],
  "lang": "Français",
  "lang_code": "fr",
  "pos": "noun",
  "pos_title": "Nom commun",
  "senses": [
    {
      "categories": [
        {
          "kind": "other",
          "name": "Exemples en français",
          "parents": [],
          "source": "w"
        },
        {
          "kind": "other",
          "name": "Lexique en français de l’informatique",
          "parents": [],
          "source": "w"
        }
      ],
      "examples": [
        {
          "bold_text_offsets": [
            [
              53,
              58
            ]
          ],
          "ref": "site www.tuteurs.ens.fr",
          "text": "Mon éditeur n'a pas détecté que mon fichier était en UTF-8, comment le lui dire ?."
        }
      ],
      "glosses": [
        "Format de codage de caractères."
      ],
      "id": "fr-UTF-8-fr-noun-hi0~vAqO",
      "topics": [
        "computing"
      ]
    }
  ],
  "sounds": [
    {
      "ipa": "\\y.te.ɛf ɥit\\"
    }
  ],
  "tags": [
    "invariable"
  ],
  "word": "UTF-8"
}
{
  "categories": [
    "Noms scientifiques",
    "conventions internationales"
  ],
  "etymology_texts": [
    "Du U de UCS (Universal Character Set), du T de transformation, du F de format et de 8 de 8 bits."
  ],
  "lang": "Conventions internationales",
  "lang_code": "conv",
  "pos": "noun",
  "pos_title": "Nom scientifique",
  "senses": [
    {
      "categories": [
        "Lexique en conventions internationales de l’informatique"
      ],
      "glosses": [
        "Format de codage de caractères."
      ],
      "topics": [
        "computing"
      ]
    }
  ],
  "tags": [
    "invariable"
  ],
  "word": "UTF-8"
}

{
  "categories": [
    "Codages de caractères",
    "Lemmes en français",
    "Noms communs en français",
    "français"
  ],
  "etymology_texts": [
    "Du U de UCS (Universal Character Set), du T de transformation, du F de format et de 8 de 8 bits."
  ],
  "lang": "Français",
  "lang_code": "fr",
  "pos": "noun",
  "pos_title": "Nom commun",
  "senses": [
    {
      "categories": [
        "Exemples en français",
        "Lexique en français de l’informatique"
      ],
      "examples": [
        {
          "bold_text_offsets": [
            [
              53,
              58
            ]
          ],
          "ref": "site www.tuteurs.ens.fr",
          "text": "Mon éditeur n'a pas détecté que mon fichier était en UTF-8, comment le lui dire ?."
        }
      ],
      "glosses": [
        "Format de codage de caractères."
      ],
      "topics": [
        "computing"
      ]
    }
  ],
  "sounds": [
    {
      "ipa": "\\y.te.ɛf ɥit\\"
    }
  ],
  "tags": [
    "invariable"
  ],
  "word": "UTF-8"
}

Download raw JSONL data for UTF-8 meaning in All languages combined (1.2kB)


This page is a part of the kaikki.org machine-readable All languages combined dictionary. This dictionary is based on structured data extracted on 2025-06-17 from the frwiktionary dump dated 2025-06-01 using wiktextract (074e7de and f1c2b61). The data shown on this site has been post-processed and various details (e.g., extra categories) removed, some information disambiguated, and additional data merged from other sources. See the raw data download page for the unprocessed wiktextract data.

If you use this data in academic research, please cite Tatu Ylonen: Wiktextract: Wiktionary as Machine-Readable Structured Data, Proceedings of the 13th Conference on Language Resources and Evaluation (LREC), pp. 1317-1325, Marseille, 20-25 June 2022. Linking to the relevant page(s) under https://kaikki.org would also be greatly appreciated.