Wiktionary data extraction errors and warnings

Sinhalese inflections

Download data in csv format

Word Part of speech Count Form Tags Other examples (may be other parts of speech)
මහනවා verb 996 mahanawā romanization

Word Part of speech Count Form Tags Other examples (may be other parts of speech)
ඇතැම් det 62 ætæm romanization
det 62 ඇතම් alternative

Word Part of speech Count Form Tags Other examples (may be other parts of speech)
බ්‍රාහ්මණයා noun 40 බ්රාහ්මණයා canonical
noun 40 brāhmaṇayā romanization

Word Part of speech Count Form Tags Other examples (may be other parts of speech)
ගුළිය noun 39 guḷiya romanization
noun 39 ගුළි plural

Word Part of speech Count Form Tags Other examples (may be other parts of speech)
ව්‍යාඝ්‍රයා noun 4 ව්යාඝ්රයා canonical
noun 4 wyāghrayā romanization
noun 4 කොටි plural

Word Part of speech Count Form Tags Other examples (may be other parts of speech)
ග්‍රහලෝක noun 3 ග්රහලෝක canonical
noun 3 grahalōka romanization
noun 3 ග්රහලොව alternative

Word Part of speech Count Form Tags Other examples (may be other parts of speech)
හුත්තා noun 2 huttā romanization
noun 2 හුත්තලා plural
noun 2 හුත්ති feminine

Word Part of speech Count Form Tags Other examples (may be other parts of speech)
𑇤 num 2 𑇤 alternative

Word Part of speech Count Form Tags Other examples (may be other parts of speech)
සත්‍තු noun 1 සත්තු plural canonical
noun 1 sattu romanization

Word Part of speech Count Form Tags Other examples (may be other parts of speech)
තුවරා noun 1 * IPA⁽ᵏᵉʸ⁾: canonical
noun 1 t̪uʋəraː canonical


This page is a part of the kaikki.org machine-readable dictionary. This dictionary is based on structured data extracted on 2025-12-23 from the enwiktionary dump dated 2025-12-02 using wiktextract (6fdc867 and 9905b1f). The data shown on this site has been post-processed and various details (e.g., extra categories) removed, some information disambiguated, and additional data merged from other sources. See the raw data download page for the unprocessed wiktextract data.

If you use this data in academic research, please cite Tatu Ylonen: Wiktextract: Wiktionary as Machine-Readable Structured Data, Proceedings of the 13th Conference on Language Resources and Evaluation (LREC), pp. 1317-1325, Marseille, 20-25 June 2022. Linking to the relevant page(s) under https://kaikki.org would also be greatly appreciated.