High Quality Syntactic Annotated Corpus of Lithuanian – VILSINTEKS Cover Image

Aukštos kokybės sintaksiškai anotuotas lietuvių kalbos tekstynas VILSINTEKS
High Quality Syntactic Annotated Corpus of Lithuanian – VILSINTEKS

Author(s): Daiva Šveikauskienė
Subject(s): Theoretical Linguistics, Morphology, Semantics, Baltic Languages
Published by: Lietuvių Kalbos Institutas
Keywords: syntactic annotated corpus; graph representation of the syntactic structure; layers of annotation in the Lithuanian corpus; insufficiency of tree-representation for Lithuanian sentences;

Summary/Abstract: This paper presents a twofold annotation, which is used for the high quality annotation of the Lithuanian corpus. Comprehensive information about a sentence is given in a table and the syntactic structure of a sentence is presented in a picture. The experience of other languages is being used, and specific features of the Lithuanian language are taken into account. The insufficiency of the tree-representation for the syntactic structure of Lithuanian sentences is shown through the statistically annotated examples. The goal of the creation of the annotated corpus bearing exhaustive information is also clearly emphasized. The examples of the annotated sentences are given, which reflect the specific features of the Lithuanian language.

  • Issue Year: 2015
  • Issue No: 73
  • Page Range: 252-267
  • Page Count: 16
  • Language: English