Nowe wielowarstwowe znakowanie lingwistyczne zrównoważonego Narodowego Korpusu Języka Polskiego
New multilayer linguistic annotation of the balanced National Corpus of Polish
Author(s): Witold Kieraś, Marcin Woliński, Bartłomiej NitońSubject(s): Language and Literature Studies
Published by: Towarzystwo Miłośników Języka Polskiego
Keywords: corpus; natural language processing; morphosyntactic annotation; syntactic annotation
Summary/Abstract: The article describes the well-known and widely used National Corpus of Polish in a new setup. The update consists of the annotation scheme modification in the morphosyntactic layer (especially in its parts related to the grammatical gender), as well as adding new layers of annotation: the syntactic layer and the named entities layer. All three layers are indexed in the MTAS corpus search engine and can be referenced in CQL corpus queries.
Journal: Język Polski
- Issue Year: 2021
- Issue No: 2
- Page Range: 59-70
- Page Count: 12
- Language: Polish
