Automatyczne wydobywanie terminologii dziedzinowej z korpusów tekstowych
Automatic extraction of domain terminology from text corpora
Author(s): Małgorzata Marciniak, Agnieszka Mykowiecka, Piotr RychlikSubject(s): Language and Literature Studies, Theoretical Linguistics, Applied Linguistics
Published by: Towarzystwo Miłośników Języka Polskiego
Keywords: domain terminology; text corpora
Summary/Abstract: Every knowledge domain or form of communication has its own characteristic vocabulary. In the traditionalapproach, dictionaries containing words and multi-word terms identifying important concepts and theirlexical equivalents were created by specialists in a subject area. This method, however, is very time-consumingand therefore inadequate, especially for rapidly changing domains. In this paper we present a computer program allowing for automatic identification of noun phrases being potential domain terms and their rankingaccording to some measure of significance.
Journal: Język Polski
- Issue Year: 2017
- Issue No: 1
- Page Range: 64-74
- Page Count: 11
- Language: Polish