Building of Broadcast News Database for Evaluation of the Automated Subtitling Service Cover Image

Building of Broadcast News Database for Evaluation of the Automated Subtitling Service
Building of Broadcast News Database for Evaluation of the Automated Subtitling Service

Author(s): Matúš Pleva, Jozef Juhar
Subject(s): Media studies, ICT Information and Communications Technologies
Published by: Žilinská univerzita v Žilině
Keywords: broadcast news; segmentation; speech recognition; transcriber;

Summary/Abstract: This paper describes the process of recording, annotation, correction and evaluation of the new Broadcast News (BN) speech database named KEMT-BN2, as an extension for our older KEMT-BN1 and COST-278 databases used for automatic Slovak continuous speech recognition development. The database utilisation and statistics are presented. This database was prepared for evaluation of the automated BN transcription system, developed in our laboratory, which is mainly used for subtitle generation for recorded BN shows. The speech database is the key part of the acoustic models training for specific domains and also for speaker and anchor adapted models creation.

  • Issue Year: 15/2013
  • Issue No: 2A
  • Page Range: 124-128
  • Page Count: 5
  • Language: English