Wiktionary data extraction errors and warnings

aardvark/English/noun

Return to 'Debug messages subpage 2280'

aardvark (English noun) aardvark/English/noun: invalid uppercase tag Received-Pronunciation not in or uppercase_tags: {"derived": [{"word": "aardvark cucumber"}, {"word": "aardvarking"}, {"word": "aardvarklike"}, {"word": "Brazilian aardvark"}, {"word": "aardvark to zymurgy"}], "etymology_templates": [{"args": {"1": "en", "2": "ine-pro", "3": "*h₁er-"}, "expansion": "", "name": "root"}, {"args": {"1": "en", "2": "af", "3": "aardvark"}, "expansion": "Borrowed from Afrikaans aardvark", "name": "bor+"}, {"args": {"1": "obsolete"}, "expansion": "(obsolete)", "name": "q"}, {"args": {"1": "en", "2": "dum", "3": "aerde"}, "expansion": "Middle Dutch aerde", "name": "der"}, {"args": {"1": "en", "2": "dum", "3": "varken"}, "expansion": "Middle Dutch varken", "name": "der"}, {"args": {"1": "af", "2": "aarde", "3": "vark", "nocat": "1", "pos1": "from Middle Dutch <i class=\"Latn mention\" lang=\"dum\">aerde</i>", "pos2": "from Middle Dutch <i class=\"Latn mention\" lang=\"dum\">varken</i>", "t1": "earth", "t2": "pig"}, "expansion": "aarde (“earth”, from Middle Dutch aerde) + vark (“pig”, from Middle Dutch varken)", "name": "af"}], "etymology_text": "Borrowed from Afrikaans aardvark (obsolete), erdvark, from aarde (“earth”, from Middle Dutch aerde) + vark (“pig”, from Middle Dutch varken). Early European colonists in South Africa noticed that the animal was similar to a pig, while aarde hints at the animal's habit of burrowing.", "forms": [{"form": "aardvarks", "tags": ["plural"]}], "head_templates": [{"args": {}, "expansion": "aardvark (plural aardvarks)", "name": "en-noun"}], "lang": "English", "lang_code": "en", "pos": "noun", "senses": [{"categories": ["Amharic terms with redundant script codes", "Bengali terms with redundant script codes", "English countable nouns", "English entries with incorrect language header", "English lemmas", "English nouns", "English terms borrowed from Afrikaans", "English terms derived from Afrikaans", "English terms derived from Middle Dutch", "English terms derived from Proto-Indo-European", "English terms derived from the Proto-Indo-European root *h₁er-", "English terms with quotations", "English terms with usage examples", "Entries with translation boxes", "Greek terms with non-redundant manual transliterations", "Mandarin terms with redundant transliterations", "Pages with 3 entries", "Pages with entries", "Requests for gender in Oromo entries", "Terms with Afrikaans translations", "Terms with Amharic translations", "Terms with Arabic translations", "Terms with Armenian translations", "Terms with Azerbaijani translations", "Terms with Basque translations", "Terms with Bavarian translations", "Terms with Bengali translations", "Terms with Bulgarian translations", "Terms with Catalan translations", "Terms with Cherokee translations", "Terms with Cornish translations", "Terms with Czech translations", "Terms with Danish translations", "Terms with Dutch translations", "Terms with Erzya translations", "Terms with Esperanto translations", "Terms with Estonian translations", "Terms with Finnish translations", "Terms with French translations", "Terms with Galician translations", "Terms with Georgian translations", "Terms with German translations", "Terms with Greek translations", "Terms with Hadza translations", "Terms with Hausa translations", "Terms with Hebrew translations", "Terms with Hungarian translations", "Terms with Icelandic translations", "Terms with Igala translations", "Terms with Interlingua translations", "Terms with Irish translations", "Terms with Italian translations", "Terms with Japanese translations", "Terms with Juǀ'hoan translations", "Terms with Korean translations", "Terms with Latin translations", "Terms with Limburgish translations", "Terms with Lithuanian translations", "Terms with Malay translations", "Terms with Maltese translations", "Terms with Mandarin translations", "Terms with Manx translations", "Terms with Norman translations", "Terms with Norwegian translations", "Terms with Oromo translations", "Terms with Persian translations", "Terms with Polish translations", "Terms with Portuguese translations", "Terms with Romanian translations", "Terms with Russian translations", "Terms with Scottish Gaelic translations", "Terms with Spanish translations", "Terms with Swahili translations", "Terms with Swedish translations", "Terms with Thai translations", "Terms with Tswana translations", "Terms with Turkish translations", "Terms with Ukrainian translations", "Terms with Welsh translations", "Terms with Wolof translations", "Terms with Yiddish translations", "Terms with Yoruba translations", "Terms with Zulu translations", "en:African insectivores"], "examples": [{"text": "The aardvark burrows in the ground and feeds mostly on termites, which it catches with its long, slimy tongue.", "type": "example"}, {"ref": "2007 August 15, Wayne A. Davis, “Replies to Green, Szabó, Jeshion, and Siebel”, in Philosophical Studies, volume 137, number 3, →DOI:", "text": "If ‘aardvark lover’ has that syntactic structure, then it determinately means “lover of aardvarks.", "type": "quote"}], "glosses": ["The nocturnal, insectivorous primarily eating ants and termites, burrowing, mammal Orycteropus afer, of the order Tubulidentata, somewhat resembling a pig, common in some parts of sub-Saharan Africa."], "links": [["nocturnal", "nocturnal"], ["insectivorous", "insectivorous"], ["mammal", "mammal"], ["Orycteropus afer", "Orycteropus afer#Translingual"], ["Tubulidentata", "Tubulidentata#Translingual"], ["pig", "pig"]], "synonyms": [{"word": "African anteater"}, {"word": "antbear"}, {"word": "ant bear"}, {"word": "anteater"}, {"word": "earth pig"}], "wikipedia": ["aardvark"]}], "sounds": [{"ipa": "/ˈɑːd.vɑːk/", "tags": ["Received-Pronunciation"]}, {"ipa": "/ˈɑɹd.vɑɹk/", "tags": ["US"]}, {"audio": "en-us-aardvark.ogg", "mp3_url": "https://upload.wikimedia.org/wikipedia/commons/transcoded/4/40/En-us-aardvark.ogg/En-us-aardvark.ogg.mp3", "ogg_url": "https://upload.wikimedia.org/wikipedia/commons/4/40/En-us-aardvark.ogg"}, {"audio": "aardvark.ogg", "mp3_url": "https://upload.wikimedia.org/wikipedia/commons/transcoded/f/fe/Aardvark.ogg/Aardvark.ogg.mp3", "ogg_url": "https://upload.wikimedia.org/wikipedia/commons/f/fe/Aardvark.ogg"}], "synonyms": [{"word": "aard-vark"}, {"word": "erdvark"}], "translations": [{"code": "af", "lang": "Afrikaans", "sense": "mammal", "word": "erdvark"}, {"code": "af", "lang": "Afrikaans", "sense": "mammal", "tags": ["obsolete"], "word": "aardvark"}, {"code": "am", "lang": "Amharic", "roman": "ǧart", "sense": "mammal", "word": "ጃርት"}, {"code": "ar", "lang": "Arabic", "roman": "ḵinzīr al-ʔarḍ", "sense": "mammal", "tags": ["masculine"], "word": "خِنْزِير اَلْأَرْض"}, {"code": "hy", "lang": "Armenian", "roman": "xoġovakatam", "sense": "mammal", "word": "խողովակատամ"}, {"code": "hy", "lang": "Armenian", "roman": "afrikyan mrǰnaker", "sense": "mammal", "word": "աֆրիկյան մրջնակեր"}, {"code": "az", "lang": "Azerbaijani", "sense": "mammal", "word": "borudişli"}, {"code": "eu", "lang": "Basque", "sense": "mammal", "word": "urde inurrijale"}, {"code": "bar", "lang": "Bavarian", "sense": "mammal", "word": "erdfàggi"}, {"code": "bn", "lang": "Bengali", "roman": "arḍobhark", "sense": "mammal", "word": "আর্ডভার্ক"}, {"code": "bn", "lang": "Bengali", "roman": "dokkhin aphrikar jontubiśeś", "sense": "mammal", "word": "দক্ষিণ আফ্রিকার জন্তুবিশেষ"}, {"code": "bg", "lang": "Bulgarian", "roman": "mravojad", "sense": "mammal", "tags": ["masculine"], "word": "мравояд"}, {"code": "ca", "lang": "Catalan", "sense": "mammal", "tags": ["masculine"], "word": "porc formiguer"}, {"code": "chr", "lang": "Cherokee", "roman": "dosvdali digayesgi", "sense": "mammal", "word": "ᏙᏒᏓᎵ ᏗᎦᏰᏍᎩ"}, {"code": "cmn", "lang": "Chinese Mandarin", "roman": "tǔtún", "sense": "mammal", "word": "土豚"}, {"code": "kw", "lang": "Cornish", "sense": "mammal", "tags": ["masculine"], "word": "porhel dor"}, {"code": "cs", "lang": "Czech", "sense": "mammal", "tags": ["masculine"], "word": "hrabáč"}, {"code": "cs", "lang": "Czech", "sense": "mammal", "tags": ["masculine"], "word": "hrabáč kapský"}, {"code": "da", "lang": "Danish", "sense": "mammal", "tags": ["neuter"], "word": "jordsvin"}, {"code": "nl", "lang": "Dutch", "sense": "mammal", "tags": ["neuter"], "word": "aardvarken"}, {"code": "myv", "lang": "Erzya", "roman": "modatuvo", "sense": "mammal", "word": "модатуво"}, {"code": "eo", "lang": "Esperanto", "sense": "mammal", "word": "orikteropo"}, {"code": "eo", "lang": "Esperanto", "sense": "mammal", "word": "terporko"}, {"code": "et", "lang": "Estonian", "sense": "mammal", "word": "tuhnik"}, {"code": "fi", "lang": "Finnish", "sense": "mammal", "word": "maasika"}, {"code": "fr", "lang": "French", "sense": "mammal", "tags": ["masculine"], "word": "oryctérope"}, {"code": "gl", "lang": "Galician", "sense": "mammal", "tags": ["masculine"], "word": "porco formigueiro"}, {"code": "gl", "lang": "Galician", "sense": "mammal", "tags": ["masculine"], "word": "oricteropo"}, {"code": "ka", "lang": "Georgian", "roman": "milḳbila", "sense": "mammal", "word": "მილკბილა"}, {"code": "de", "lang": "German", "sense": "mammal", "tags": ["neuter"], "word": "Erdferkel"}, {"code": "el", "lang": "Greek", "roman": "myrmigkofágos", "sense": "mammal", "tags": ["masculine"], "word": "μυρμηγκοφάγος"}, {"code": "el", "lang": "Greek", "note": "oρυκτερόπους m (orykterópous, literally “\"Digging/Mining Feet\"”)", "sense": "mammal"}, {"code": "hts", "lang": "Hadza", "sense": "mammal", "word": "usai"}, {"code": "ha", "lang": "Hausa", "sense": "mammal", "word": "dagbi"}, {"code": "he", "lang": "Hebrew", "sense": "mammal", "tags": ["masculine"], "word": "שנבוב"}, {"code": "hu", "lang": "Hungarian", "sense": "mammal", "word": "földimalac"}, {"code": "is", "lang": "Icelandic", "sense": "mammal", "tags": ["neuter"], "word": "jarðsvín"}, {"code": "igl", "lang": "Igala", "sense": "mammal", "word": "ọ̀gbọ̀wù"}, {"code": "ia", "lang": "Interlingua", "sense": "mammal", "word": "orycteropo"}, {"code": "ga", "lang": "Irish", "sense": "mammal", "tags": ["masculine"], "word": "arcán talún"}, {"code": "it", "lang": "Italian", "sense": "mammal", "tags": ["masculine"], "word": "oritteropo"}, {"code": "ja", "lang": "Japanese", "roman": "tsuchibuta", "sense": "mammal", "word": "土豚"}, {"code": "ktz", "lang": "Juǀ'hoan", "sense": "mammal", "word": "gǃkún"}, {"code": "ko", "lang": "Korean", "roman": "ttangdwaeji", "sense": "mammal", "word": "땅돼지"}, {"code": "la", "lang": "Latin", "sense": "mammal", "tags": ["masculine"], "word": "orycteropus"}, {"code": "li", "lang": "Limburgish", "sense": "mammal", "word": "eerdverke"}, {"code": "lt", "lang": "Lithuanian", "sense": "mammal", "word": "vamzdžiadančiai"}, {"code": "ms", "lang": "Malay", "sense": "mammal", "word": "ardvark"}, {"code": "ms", "lang": "Malay", "sense": "mammal", "word": "babi tanah"}, {"code": "mt", "lang": "Maltese", "sense": "mammal", "tags": ["masculine"], "word": "orikteropu"}, {"code": "gv", "lang": "Manx", "sense": "mammal", "tags": ["feminine"], "word": "muc hallooin"}, {"code": "nrf", "lang": "Norman", "sense": "mammal", "tags": ["masculine"], "word": "couochon d’tèrre"}, {"code": "no", "lang": "Norwegian", "sense": "mammal", "tags": ["neuter"], "word": "jordsvin"}, {"code": "om", "lang": "Oromo", "sense": "mammal", "word": "awaaldiigessa"}, {"code": "om", "lang": "Oromo", "sense": "mammal", "word": "olokee"}, {"code": "om", "lang": "Oromo", "sense": "mammal", "word": "waldiigessa"}, {"code": "fa", "lang": "Persian", "note": "خوک خاکی (xuk-e xâki, literally “earth pig”)", "sense": "mammal"}, {"code": "pl", "lang": "Polish", "sense": "mammal", "tags": ["masculine"], "word": "mrównik"}, {"code": "pl", "lang": "Polish", "sense": "mammal", "tags": ["neuter"], "word": "prosię ziemne"}, {"code": "pt", "lang": "Portuguese", "sense": "mammal", "tags": ["masculine"], "word": "oricterope"}, {"code": "pt", "lang": "Portuguese", "sense": "mammal", "tags": ["masculine"], "word": "porco-da-terra"}, {"code": "pt", "lang": "Portuguese", "sense": "mammal", "tags": ["masculine"], "word": "jimbo"}, {"code": "pt", "lang": "Portuguese", "sense": "mammal", "tags": ["masculine"], "word": "porco-formigueiro"}, {"code": "pt", "lang": "Portuguese", "sense": "mammal", "tags": ["feminine"], "word": "timba"}, {"code": "pt", "lang": "Portuguese", "sense": "mammal", "tags": ["masculine"], "word": "timbo"}, {"code": "pt", "lang": "Portuguese", "sense": "mammal", "tags": ["masculine"], "word": "aardvark"}, {"code": "ro", "lang": "Romanian", "sense": "mammal", "tags": ["masculine"], "word": "porcul termitelor"}, {"code": "ru", "lang": "Russian", "roman": "trubkozúb", "sense": "mammal", "tags": ["masculine"], "word": "трубкозу́б"}, {"code": "gd", "lang": "Scottish Gaelic", "sense": "mammal", "tags": ["masculine"], "word": "mathan-sheangan"}, {"code": "es", "lang": "Spanish", "sense": "mammal", "tags": ["masculine"], "word": "cerdo hormiguero"}, {"code": "sw", "lang": "Swahili", "sense": "mammal", "tags": ["class-1", "class-2"], "word": "mhanga"}, {"code": "sv", "lang": "Swedish", "sense": "mammal", "tags": ["neuter"], "word": "jordsvin"}, {"code": "th", "lang": "Thai", "sense": "mammal", "word": "อาร์ดวาร์ก"}, {"code": "tn", "lang": "Tswana", "sense": "mammal", "tags": ["class-10", "class-9"], "word": "thakadu"}, {"code": "tr", "lang": "Turkish", "sense": "mammal", "word": "karıncayiyen"}, {"code": "tr", "lang": "Turkish", "sense": "mammal", "word": "aardvark"}, {"code": "tr", "lang": "Turkish", "sense": "mammal", "word": "borudişli"}, {"code": "uk", "lang": "Ukrainian", "roman": "trubkozúb", "sense": "mammal", "tags": ["masculine"], "word": "трубкозу́б"}, {"code": "cy", "lang": "Welsh", "sense": "mammal", "tags": ["masculine"], "word": "baedd daear"}, {"code": "cy", "lang": "Welsh", "sense": "mammal", "tags": ["feminine"], "word": "grugarth"}, {"code": "wo", "lang": "Wolof", "sense": "mammal", "word": "njaxat bi"}, {"code": "yi", "lang": "Yiddish", "roman": "erdshvayn", "sense": "mammal", "tags": ["masculine"], "word": "ערדשווײַן"}, {"code": "yo", "lang": "Yoruba", "sense": "mammal", "word": "àfèìmòjò"}, {"code": "zu", "lang": "Zulu", "sense": "mammal", "tags": ["class-7", "class-8"], "word": "isambane"}], "word": "aardvark"}


This page is a part of the kaikki.org machine-readable dictionary. This dictionary is based on structured data extracted on 2024-12-21 from the enwiktionary dump dated 2024-12-04 using wiktextract (d8cb2f3 and 4e554ae). The data shown on this site has been post-processed and various details (e.g., extra categories) removed, some information disambiguated, and additional data merged from other sources. See the raw data download page for the unprocessed wiktextract data.

If you use this data in academic research, please cite Tatu Ylonen: Wiktextract: Wiktionary as Machine-Readable Structured Data, Proceedings of the 13th Conference on Language Resources and Evaluation (LREC), pp. 1317-1325, Marseille, 20-25 June 2022. Linking to the relevant page(s) under https://kaikki.org would also be greatly appreciated.