"w-shingling" meaning in English

See w-shingling in All languages combined, or Wiktionary

Noun

Forms: w-shinglings [plural]
Etymology: w denotes the number of tokens in each shingle in the set. Head templates: {{en-noun}} w-shingling (plural w-shinglings)
  1. (computing) In natural language processing, a set of unique "shingles" (contiguous subsequences of tokens in a document) that can be used to gauge the similarity of documents. Wikipedia link: w-shingling Categories (topical): Computing
    Sense id: en-w-shingling-en-noun-JKWu6agl Categories (other): English entries with incorrect language header Topics: computing, engineering, mathematics, natural-sciences, physical-sciences, sciences

Inflected forms

Download JSON data for w-shingling meaning in English (1.5kB)

{
  "etymology_text": "w denotes the number of tokens in each shingle in the set.",
  "forms": [
    {
      "form": "w-shinglings",
      "tags": [
        "plural"
      ]
    }
  ],
  "head_templates": [
    {
      "args": {},
      "expansion": "w-shingling (plural w-shinglings)",
      "name": "en-noun"
    }
  ],
  "lang": "English",
  "lang_code": "en",
  "pos": "noun",
  "senses": [
    {
      "categories": [
        {
          "kind": "other",
          "name": "English entries with incorrect language header",
          "parents": [
            "Entries with incorrect language header",
            "Entry maintenance"
          ],
          "source": "w"
        },
        {
          "kind": "topical",
          "langcode": "en",
          "name": "Computing",
          "orig": "en:Computing",
          "parents": [
            "Technology",
            "All topics",
            "Fundamental"
          ],
          "source": "w"
        }
      ],
      "glosses": [
        "In natural language processing, a set of unique \"shingles\" (contiguous subsequences of tokens in a document) that can be used to gauge the similarity of documents."
      ],
      "id": "en-w-shingling-en-noun-JKWu6agl",
      "links": [
        [
          "computing",
          "computing#Noun"
        ],
        [
          "natural language processing",
          "natural language processing"
        ],
        [
          "set",
          "set"
        ],
        [
          "unique",
          "unique"
        ],
        [
          "shingle",
          "shingle"
        ],
        [
          "contiguous",
          "contiguous"
        ],
        [
          "subsequence",
          "subsequence"
        ],
        [
          "token",
          "token"
        ],
        [
          "document",
          "document"
        ],
        [
          "gauge",
          "gauge"
        ],
        [
          "similarity",
          "similarity"
        ]
      ],
      "raw_glosses": [
        "(computing) In natural language processing, a set of unique \"shingles\" (contiguous subsequences of tokens in a document) that can be used to gauge the similarity of documents."
      ],
      "topics": [
        "computing",
        "engineering",
        "mathematics",
        "natural-sciences",
        "physical-sciences",
        "sciences"
      ],
      "wikipedia": [
        "w-shingling"
      ]
    }
  ],
  "word": "w-shingling"
}
{
  "etymology_text": "w denotes the number of tokens in each shingle in the set.",
  "forms": [
    {
      "form": "w-shinglings",
      "tags": [
        "plural"
      ]
    }
  ],
  "head_templates": [
    {
      "args": {},
      "expansion": "w-shingling (plural w-shinglings)",
      "name": "en-noun"
    }
  ],
  "lang": "English",
  "lang_code": "en",
  "pos": "noun",
  "senses": [
    {
      "categories": [
        "English countable nouns",
        "English entries with incorrect language header",
        "English lemmas",
        "English multiword terms",
        "English nouns",
        "en:Computing"
      ],
      "glosses": [
        "In natural language processing, a set of unique \"shingles\" (contiguous subsequences of tokens in a document) that can be used to gauge the similarity of documents."
      ],
      "links": [
        [
          "computing",
          "computing#Noun"
        ],
        [
          "natural language processing",
          "natural language processing"
        ],
        [
          "set",
          "set"
        ],
        [
          "unique",
          "unique"
        ],
        [
          "shingle",
          "shingle"
        ],
        [
          "contiguous",
          "contiguous"
        ],
        [
          "subsequence",
          "subsequence"
        ],
        [
          "token",
          "token"
        ],
        [
          "document",
          "document"
        ],
        [
          "gauge",
          "gauge"
        ],
        [
          "similarity",
          "similarity"
        ]
      ],
      "raw_glosses": [
        "(computing) In natural language processing, a set of unique \"shingles\" (contiguous subsequences of tokens in a document) that can be used to gauge the similarity of documents."
      ],
      "topics": [
        "computing",
        "engineering",
        "mathematics",
        "natural-sciences",
        "physical-sciences",
        "sciences"
      ],
      "wikipedia": [
        "w-shingling"
      ]
    }
  ],
  "word": "w-shingling"
}

This page is a part of the kaikki.org machine-readable English dictionary. This dictionary is based on structured data extracted on 2024-05-01 from the enwiktionary dump dated 2024-04-21 using wiktextract (f4fd8c9 and c9440ce). The data shown on this site has been post-processed and various details (e.g., extra categories) removed, some information disambiguated, and additional data merged from other sources. See the raw data download page for the unprocessed wiktextract data.

If you use this data in academic research, please cite Tatu Ylonen: Wiktextract: Wiktionary as Machine-Readable Structured Data, Proceedings of the 13th Conference on Language Resources and Evaluation (LREC), pp. 1317-1325, Marseille, 20-25 June 2022. Linking to the relevant page(s) under https://kaikki.org would also be greatly appreciated.