Bulgarian Sense-annotated Corpus – between the tradition and novelty Cover Image

Bulgarian Sense-annotated Corpus – between the tradition and novelty
Bulgarian Sense-annotated Corpus – between the tradition and novelty

Author(s): Svetla Koeva
Subject(s): Language and Literature Studies
Published by: Instytut Slawistyki Polskiej Akademii Nauk
Keywords: corpus studies; corpus annotation; annotation principles

Summary/Abstract: The Bulgarian Sense-annotated Corpus (BulSemCor) is compiled according to the general methodology established by the SemCor project. It is a subset of the Brown Corpus of Bulgarian semantically annotated with a corresponding synonym set (synset) in the Bulgarian wordnet. Unlike the bulk of sense-annotated corpora where only (sets of) content words are annotated, in BulSemCor each lexical unit has been assigned a sense. The main contributions achieved in the work on BulSemCor are briefly decides in the presented paper: definition of an annotation schema, compilation of an input corpus, development of a sense-annotated corpus, Bulgarian wordnet enlargement.

  • Issue Year: 2012
  • Issue No: 12
  • Page Range: 181-198
  • Page Count: 18
  • Language: English