A corpus of Czech essays from the turn of the 1900s Cover Image

A corpus of Czech essays from the turn of the 1900s
A corpus of Czech essays from the turn of the 1900s

Author(s): Petr Pořízka
Subject(s): Language and Literature Studies, Applied Linguistics
Published by: Jazykovedný ústav Ľudovíta Štúra Slovenskej akadémie vied
Keywords: annotation; corpus; corpus linguistics; quantitative analysis; literary essay; multidimensional analysis; orthography; specificity score; TXM

Summary/Abstract: A literary essay is an interesting unit for language analyses, as its stylistic means often exceed the boundaries of the genre of an artistic essay. The article presents a new corpus of Czech literary essays covering approximately fifty years from 1890 to 1940. Along with the characterisation of the corpus and its annotation, the paper focuses on the TXM corpus tool: In the second part of the study, we use selected texts to conduct an analysis of seven various authors through multidimensional cluster analysis, factorial correspondence analysis and a specificity score. The main parameter of the analyses was usage of parts of speech in texts by individual authors. At present, the Corpus of Czech Essays contains 40 essayist titles written by 15 authors covering various topics (music, visual arts, theatre, literature, etc.).

  • Issue Year: 72/2021
  • Issue No: 2
  • Page Range: 618-630
  • Page Count: 13
  • Language: English