Sharing data through specialized corpus-based tools: The case of GramatiKat Cover Image

Sharing data through specialized corpus-based tools: The case of GramatiKat
Sharing data through specialized corpus-based tools: The case of GramatiKat

Author(s): Dominika Kováříková
Subject(s): Language and Literature Studies, Applied Linguistics
Published by: Jazykovedný ústav Ľudovíta Štúra Slovenskej akadémie vied
Keywords: specialized corpus tools; grammatical category; morphology; lexicography;Open Science;

Summary/Abstract: This paper presents a specialized corpus tool GramatiKat in the context of Open Science principles, namely data sharing, which offers opportunities for original research and facilitates verifiability of research and building on previous research. The tool is designed primarily for examining grammatical categories from the quantitative point of view. It offers grammatical profiles of particular lemmas (currently 14 thousand Czech nouns) and the proportion of individual grammatical categories within a part of speech, i.e., the standard behavior of a word class. The data in GramatiKat are pre-processed, statistically evaluated, and presented in charts and tables for clarity, and they are available to other linguists, especially from fields of morphology and lexicography. This article is aimed at providing inspiration and support to corpus and non-corpus linguists with utilization and enhanced use of the existing tools and with the creation of new specialized tools available to other users.

  • Issue Year: 72/2021
  • Issue No: 2
  • Page Range: 531-544
  • Page Count: 14
  • Language: English