The Content and Annotation of the Diachronic Corpus of Czech Cover Image

Obsah a značkování diachronního korpusu češtiny
The Content and Annotation of the Diachronic Corpus of Czech

Author(s): Boris Lehečka
Subject(s): Language and Literature Studies, Applied Linguistics
Published by: Univerzita Karlova v Praze - Filozofická fakulta, Vydavatelství
Keywords: diachronic corpus; corpus manager; vertical format; frequency;

Summary/Abstract: The paper discusses what kind of content and annotation should be included in the diachronic corpus of Old Czech. Based on his analysis of the current state of DIAKORP and the Old Czech Text Bank the author suggests solutions for how to treat the critical apparatus, foreign words in historical Czech texts and contemporaneous or later marginal or interlinear notes. He also discusses some aspects of the methodology of statistics computation in the diachronic corpus.

  • Issue Year: 97/2015
  • Issue No: 1
  • Page Range: 70-77
  • Page Count: 8
  • Language: Czech