Lemmatization, Morphological Annotation and Disambiguation of the Slovak Text - Web Interface Cover Image

Lematizácia, morfologická anotácia a dezambiguácia slovenského textu - webové rozhranie
Lemmatization, Morphological Annotation and Disambiguation of the Slovak Text - Web Interface

Author(s): Radovan Garabík, Kristína Bobeková
Subject(s): Theoretical Linguistics, Applied Linguistics, Morphology, Computational linguistics, Western Slavic Languages
Published by: Jazykovedný ústav Ľudovíta Štúra Slovenskej akadémie vied
Keywords: lemmatization; MSD; POS tagging; Slovak; web interface; morphological annotation;NLP;

Summary/Abstract: Lemmatization, morphological (or morphosyntactic) annotation (MSD) and disambiguation is a basic and indispensable step in Natural Language Processing of languages with a moderate level of inflection. We present a web interface demonstrating the de facto default lemmatization and MSD for Slovak, as used in major Slovak corpora (with several enhancements yet to be applied in the corpora). The interface can be used chiefly for presentation or pedagogical purposes, with the morphological tags expanded and explained using plain language in several languages, including two different terminological registers of Slovak (professional linguistic or a "common" one).

  • Issue Year: 86/2021
  • Issue No: 1
  • Page Range: 104-109
  • Page Count: 6
  • Language: Slovak