Study of Linguistic Semantics by Means of Formalisation of Queries to Corpus Data Cover Image

Исследование языковой семантики с помощью формализации запросов к корпусным данным
Study of Linguistic Semantics by Means of Formalisation of Queries to Corpus Data

Author(s): Alfia Makarimovna Galieva, Olga Avenirovna Nevzorova
Subject(s): Theoretical Linguistics, Semantics, Eastern Slavic Languages
Published by: Казанский (Приволжский) федеральный университет
Keywords: corpus; Tatar language; search query; grammar; semantics;

Summary/Abstract: The advantages of using linguistic corpus data in education and research are obvious and well covered in specialized literature. This tool considerably simplifies acquisition of linguistic data and their processing. Two main corpora have been built for the Tatar language by now, each in open access: the Corpus of Written Tatar compiled in Kazan Federal University, (http://search.corpus.tatar/en) and the Tatar National Corpus (http://corpus.antat.ru/?lang=en) developed by researchers of the Institute of Applied Semiotics, Tatarstan Academy of Sciences, Russia. These corpora are being hourly replenished; the update of textual collections is mainly carried out through the use of media texts, which provides constant flow of fresh linguistic material. The Tatar language has complicated syntax and intricate agglutinative morphology, and corpus data is a reliable tool for enriching and deepening linguistic descriptions of Tatar. This paper is the first attempt to describe examples of complex queries to the search system of “Tugam Tel” Tatar National Corpus, these queries are aimed at studying complicated phenomena of Tatar linguistic semantics.

  • Issue Year: 158/2016
  • Issue No: 5
  • Page Range: 1315-1324
  • Page Count: 10
  • Language: Russian