"w-shingling" meaning in All languages combined

See w-shingling on Wiktionary

Noun [English]

Forms: w-shinglings [plural]
Etymology: w denotes the number of tokens in each shingle in the set. Head templates: {{en-noun}} w-shingling (plural w-shinglings)
  1. (computing) In natural language processing, a set of unique "shingles" (contiguous subsequences of tokens in a document) that can be used to gauge the similarity of documents. Wikipedia link: w-shingling Categories (topical): Computing

Inflected forms

{
  "etymology_text": "w denotes the number of tokens in each shingle in the set.",
  "forms": [
    {
      "form": "w-shinglings",
      "tags": [
        "plural"
      ]
    }
  ],
  "head_templates": [
    {
      "args": {},
      "expansion": "w-shingling (plural w-shinglings)",
      "name": "en-noun"
    }
  ],
  "lang": "English",
  "lang_code": "en",
  "pos": "noun",
  "senses": [
    {
      "categories": [
        {
          "kind": "other",
          "name": "English entries with incorrect language header",
          "parents": [
            "Entries with incorrect language header",
            "Entry maintenance"
          ],
          "source": "w"
        },
        {
          "kind": "other",
          "name": "Pages with 1 entry",
          "parents": [],
          "source": "w"
        },
        {
          "kind": "other",
          "name": "Pages with entries",
          "parents": [],
          "source": "w"
        },
        {
          "kind": "topical",
          "langcode": "en",
          "name": "Computing",
          "orig": "en:Computing",
          "parents": [
            "Technology",
            "All topics",
            "Fundamental"
          ],
          "source": "w"
        }
      ],
      "glosses": [
        "In natural language processing, a set of unique \"shingles\" (contiguous subsequences of tokens in a document) that can be used to gauge the similarity of documents."
      ],
      "id": "en-w-shingling-en-noun-JKWu6agl",
      "links": [
        [
          "computing",
          "computing#Noun"
        ],
        [
          "natural language processing",
          "natural language processing"
        ],
        [
          "set",
          "set"
        ],
        [
          "unique",
          "unique"
        ],
        [
          "shingle",
          "shingle"
        ],
        [
          "contiguous",
          "contiguous"
        ],
        [
          "subsequence",
          "subsequence"
        ],
        [
          "token",
          "token"
        ],
        [
          "document",
          "document"
        ],
        [
          "gauge",
          "gauge"
        ],
        [
          "similarity",
          "similarity"
        ]
      ],
      "raw_glosses": [
        "(computing) In natural language processing, a set of unique \"shingles\" (contiguous subsequences of tokens in a document) that can be used to gauge the similarity of documents."
      ],
      "topics": [
        "computing",
        "engineering",
        "mathematics",
        "natural-sciences",
        "physical-sciences",
        "sciences"
      ],
      "wikipedia": [
        "w-shingling"
      ]
    }
  ],
  "word": "w-shingling"
}
{
  "etymology_text": "w denotes the number of tokens in each shingle in the set.",
  "forms": [
    {
      "form": "w-shinglings",
      "tags": [
        "plural"
      ]
    }
  ],
  "head_templates": [
    {
      "args": {},
      "expansion": "w-shingling (plural w-shinglings)",
      "name": "en-noun"
    }
  ],
  "lang": "English",
  "lang_code": "en",
  "pos": "noun",
  "senses": [
    {
      "categories": [
        "English countable nouns",
        "English entries with incorrect language header",
        "English lemmas",
        "English multiword terms",
        "English nouns",
        "Pages with 1 entry",
        "Pages with entries",
        "en:Computing"
      ],
      "glosses": [
        "In natural language processing, a set of unique \"shingles\" (contiguous subsequences of tokens in a document) that can be used to gauge the similarity of documents."
      ],
      "links": [
        [
          "computing",
          "computing#Noun"
        ],
        [
          "natural language processing",
          "natural language processing"
        ],
        [
          "set",
          "set"
        ],
        [
          "unique",
          "unique"
        ],
        [
          "shingle",
          "shingle"
        ],
        [
          "contiguous",
          "contiguous"
        ],
        [
          "subsequence",
          "subsequence"
        ],
        [
          "token",
          "token"
        ],
        [
          "document",
          "document"
        ],
        [
          "gauge",
          "gauge"
        ],
        [
          "similarity",
          "similarity"
        ]
      ],
      "raw_glosses": [
        "(computing) In natural language processing, a set of unique \"shingles\" (contiguous subsequences of tokens in a document) that can be used to gauge the similarity of documents."
      ],
      "topics": [
        "computing",
        "engineering",
        "mathematics",
        "natural-sciences",
        "physical-sciences",
        "sciences"
      ],
      "wikipedia": [
        "w-shingling"
      ]
    }
  ],
  "word": "w-shingling"
}

Download raw JSONL data for w-shingling meaning in All languages combined (1.4kB)


This page is a part of the kaikki.org machine-readable All languages combined dictionary. This dictionary is based on structured data extracted on 2024-12-15 from the enwiktionary dump dated 2024-12-04 using wiktextract (8a39820 and 4401a4c). The data shown on this site has been post-processed and various details (e.g., extra categories) removed, some information disambiguated, and additional data merged from other sources. See the raw data download page for the unprocessed wiktextract data.

If you use this data in academic research, please cite Tatu Ylonen: Wiktextract: Wiktionary as Machine-Readable Structured Data, Proceedings of the 13th Conference on Language Resources and Evaluation (LREC), pp. 1317-1325, Marseille, 20-25 June 2022. Linking to the relevant page(s) under https://kaikki.org would also be greatly appreciated.