The Use of Electronic Historical Dictionary Data in Corpus Design Cover Image

The Use of Electronic Historical Dictionary Data in Corpus Design
The Use of Electronic Historical Dictionary Data in Corpus Design

Author(s): Renata Bronikowska, Włodzimierz Gruszczyński, Maciej Ogrodniczuk, Marcin Woliński
Subject(s): Theoretical Linguistics, Applied Linguistics
Published by: Wydawnictwo Uniwersytetu Jagiellońskiego
Keywords: text corpus; text annotation; historical dictionary; historical corpus; Old Polish;inflectional analysis

Summary/Abstract: The History of the 17th and 18th c. Polish Language Laboratory, Institute of Polish Language, Polish Academy of Sciences, is in the process of creating two large databases: The Electronic Dictionary of the 17th−18th c. Polish and The Electronic Corpus of the 17th and 18th c. Polish Texts (up to 1772), the latter in cooperation with the Institute of Computer Science, Polish Academy of Sciences. It is expected that combining these two sets of data will help to achieve the objectives established for both database projects. The present article shows the benefits that the Corpus creators can get from the data gathered in the dictionary, with special emphasis put on the use of grammatical information included in the dictionary entries to design tools for automatic text annotation in the Corpus.

  • Issue Year: 11/2016
  • Issue No: 2
  • Page Range: 47-56
  • Page Count: 10
  • Language: English