Trilingual Aligned Corpus — Current State and New Applications
Trilingual Aligned Corpus — Current State and New Applications
Author(s): Ludmila Dimitrova, Violetta Koseska-Toszewa, Danuta Roszko, Roman RoszkoSubject(s): Theoretical Linguistics, Semantics, Western Slavic Languages, South Slavic Languages, Baltic Languages
Published by: Instytut Slawistyki Polskiej Akademii Nauk
Keywords: aligned trilingual corpus; digital resources; event; Petri net theory; semantic annotation; state
Summary/Abstract: This article describes current state of a trilingual parallel corpus consisted of texts in two Slavic (Bulgarian and Polish) and one Baltic language (Lithuanian). The corpus contains original literary texts (fiction, novels, and short stories) in one of the three languages with translations to the other two, and texts in other languages translated into Bulgarian, Polish, and Lithuanian. A part of the texts are aligned at the sentence level. The authors propose a semantic annotation of verbs appearing in these aligned texts that will facilitate contrastive studies of natural languages. A theoretical background for the proposed semantic annotation is briefly also discussed.
Journal: Cognitive Studies | Études cognitives
- Issue Year: 2014
- Issue No: 14
- Page Range: 13-20
- Page Count: 8
- Language: English