National Corpus of Polish: What’s Next? Cover Image

Narodowy Korpus Języka Polskiego – co dalej?
National Corpus of Polish: What’s Next?

Author(s): Maciej Ogrodniczuk, Magdalena Derwojedowa, Marek Łaziński, Piotr Pęzik
Subject(s): Theoretical Linguistics, Studies of Literature
Published by: Wydział Polonistyki Uniwersytetu Warszawskiego
Keywords: NKJP; diachronia; odmiany języka; korpusy referencyjne; National Corpus of Polish; NC P, diachrony; varieties of Polish; reference corpora

Summary/Abstract: This article outlines possible directions for future work on the National Corpus of Polish, which is argued to be the basic resource for contemporary linguistic research on Polish. Among the most urgent objectives identified are: extending the body of texts included in the balanced component of the corpus, improving its linguistic representation as well as including diachronic, dialectal and parallel corpora. The paper also considers a number of technical and organizational challenges involved in the task of compiling and maintaining a large reference corpus — standardization of annotation formats, aggregation of disparate resources and development of dedicated corpus processing tools may be a case in point.

  • Issue Year: 2017
  • Issue No: 71
  • Page Range: 237-245
  • Page Count: 9
  • Language: Polish