Development of a Database and Models for Children’S Speech in the Slovak Language for Speech-Oriented Applications Cover Image

Development of a Database and Models for Children’S Speech in the Slovak Language for Speech-Oriented Applications
Development of a Database and Models for Children’S Speech in the Slovak Language for Speech-Oriented Applications

Author(s): Ján Staš, Stanislav Ondáš, Matúš Pleva, Matej Horváth, Richard Ševc, Patrik Michalanský
Subject(s): Language studies, Applied Linguistics, Language acquisition, Psycholinguistics, Computational linguistics, Western Slavic Languages, ICT Information and Communications Technologies
Published by: SAV - Slovenská akadémia vied - Jazykovedný ústav Ľudovíta Štúra Slovenskej akadémie vied
Keywords: acoustic model; automatic speech recognition; data augmentation; children’s speech; speech database;

Summary/Abstract: Children’s speech differs significantly from adult speech due to physiological and cognitive developmental factors. Key differences include higher pitch, a shorter vocal tract, greater formant frequencies, slower speaking rates, and greater variability in pronunciation and articulation. These differences result in acoustic mismatches between children’s and adult speech, making traditional automatic speech recognition models trained on adult speech less effective for children. Additionally, linguistic differences, such as limited vocabulary and evolving grammar, further contribute to this challenge. This paper focuses on the creation of a children’s speech database for the low-resource Slovak language. This database has been used to train acoustic models for the automatic recognition of spontaneous children’s speech in Slovak. In this research, we compared three different approaches to speech recognition, with self-supervised learning achieving results comparable to similar studies in this area, despite using relatively small amounts of training data.

  • Issue Year: 76/2025
  • Issue No: 1
  • Page Range: 223-233
  • Page Count: 11
  • Language: English
Toggle Accessibility Mode