Automatic extraction of domain terminology from text corpora Cover Image

Automatyczne wydobywanie terminologii dziedzinowej z korpusów tekstowych
Automatic extraction of domain terminology from text corpora

Author(s): Małgorzata Marciniak, Agnieszka Mykowiecka, Piotr Rychlik
Subject(s): Language and Literature Studies, Theoretical Linguistics, Applied Linguistics
Published by: Towarzystwo Miłośników Języka Polskiego
Keywords: domain terminology; text corpora

Summary/Abstract: Every knowledge domain or form of communication has its own characteristic vocabulary. In the traditionalapproach, dictionaries containing words and multi-word terms identifying important concepts and theirlexical equivalents were created by specialists in a subject area. This method, however, is very time-consumingand therefore inadequate, especially for rapidly changing domains. In this paper we present a computer program allowing for automatic identification of noun phrases being potential domain terms and their rankingaccording to some measure of significance.

  • Issue Year: 2017
  • Issue No: 1
  • Page Range: 64-74
  • Page Count: 11
  • Language: Polish