Fiction Literature Sentence Cumbersomeness Statistical Analysis Cover Image

Grožinės literatūros sakinių sudėtingumo analizė
Fiction Literature Sentence Cumbersomeness Statistical Analysis

Author(s): Karolina Piaseckienė
Subject(s): Language and Literature Studies, Syntax
Published by: Panevėžio kolegija
Keywords: of the sentence complexity of the structure;Trigrama;linearly regression model

Summary/Abstract: The goal of this research is to explore the complexity of sentence structures of prose books for children.The following problems: 1) To see, or the algorithms based on trigrams and suitable, for example, for the English language are applicable to the Lithuanian language studies. 2) To make (if possible) the linear regression model, which described the parts of sentence in subject and predicate structures, and 3) to make (if possible) the model to describe the length of the sentence.The complexity of sentence structure can be conceived as depth and width of a sentence according to the words subordinate to the main parts of sentence (the subject and predicate). Also very important is the distance between the adjacent rank parts of the sentence.In this work were analyzed only the simple sentences the structure of which can be represented by a graph-tree. So, it cannot be argued that the trigram statistics are suitable for Lithuanian language. Though, for example, even the 93.13% of the distances from the subject to directly related parts of the sentence are not more than 2, but for predicate – its 71.32%. Was made the linear regression model, which described the parts of sentence in predicate structures. This model correctly predicts 34.34% of data.Was made the linear regression model, which described the length of the sentence, but only predicates and information about its go in this model.

  • Issue Year: 11/2015
  • Issue No: 1
  • Page Range: 134-139
  • Page Count: 6
  • Language: Lithuanian