The usage and application of multilingual corpora Cover Image

Daugiakalbių tekstynų naudojimas ir taikymas
The usage and application of multilingual corpora

Author(s): Erika Rimkutė, Jolanta Kovalevskaitė, Vidas Daudaravičius
Subject(s): Language and Literature Studies
Published by: Vytauto Didžiojo Universitetas
Keywords: usage; application; multilingual; corpora;

Summary/Abstract: The main point of the paper is to introduce multilingual corpora as useful and powerful tools for translation research. The paper describes main types of multilingual corpora, principles of compilation, automatic alignment programs and application of corpora. Many researches point the advantages of multilingual corpora for various monolingual and multilingual linguistic analyses, while the practice of multilingual corpora in Lithuanian translation studies is still a new field of research. This paper gives an introduction to the most important steps in the process of compiling the multilingual corpus in the Centre of Computational Linguistics at Vytautas Magnus University. The online corpus consists of 3 parts: English-Lithuanian parallel corpus (this part contains 6.768 parallel sentences) and Czech- Lithuanian (4.1881 parallel sentences), Lithuanian- Czech (693 parallel sentences) parallel corpus. We introduce the development of software and search possibilities of this corpus. The article exemplifies the usage of the multilingual corpus and contrasts that to the usage of traditional bilingual dictionaries. This corpus is still being developed, and the most recent extensions, such as adding more languages and other text types, are presented in this paper.

  • Issue Year: 2006
  • Issue No: 45
  • Page Range: 41-62
  • Page Count: 22
  • Language: Lithuanian