Similarity and dissimilarity indices for the speakers of the Croatian language in filtered and non–filtered conditions Cover Image

Indeksi sličnosti i različitosti kod govornika hrvatskoga jezika u nefiltriranim i filtriranim uvjetima
Similarity and dissimilarity indices for the speakers of the Croatian language in filtered and non–filtered conditions

Author(s): Gordana Varošanec-Škarić, Iva Pavić, Gabrijela Kišiček
Subject(s): Language and Literature Studies
Published by: Hrvatsko filološko društvo
Keywords: speaker identification; similarity index (R); dissimilarity index (SDDD); long term average spectra (LTASS); filtered speech; non–filtered speech; spontaneous speech; reading speech

Summary/Abstract: The main goal of this research was to compare the acoustic–statistical measurements of similarity index (R) and dissimilarity index (SDDD) on the basis of long term average spectra (LTASS). The collected speech samples consisted of both spontaneous speech and reading of a standardized text which were recorded in controlled conditions. The recorded speakers (N = 86) originated from 8 larger and dialectically different Croatian towns. The cities chosen for this research represented Sthokavian dialect (Zadar, Šibenik, Dubrovnik), Chakavian dialect (Rijeka, Pula), Kaikavian dialect (Zagreb, Čakovec) and Chakavian–Sthokavian variety (Split). Speech samples (duration 60 s) were edited for each speaker in both conditions (filtered and non–filtered speech) and compared on the basis of LTASS – non filtered conditions (0 – 10 kHz) and filtered (0.8 – 4 kHz). Using index R and index SDDD intraspeaker variations and interspeaker variations were compared respectively for male and female speakers. Results of intraspeaker variations showed that average values of similarity index (R) in non–filtered conditions were between 0.94 for male speakers in reading texts to 0.98 for female speakers in reading text and spontaneous speech. Results of interspeaker variations showed lower values of index R in the non–filtered conditions: from 0.86 in spontaneous speech to 0.94 in reading text for female speakers. Average values of R in filtered conditions for intraspeaker variations were between 0.83 for both female and male speakers in spontaneous speech to 0.95 in reading texts. Average values of R index in filtered conditions for interspeaker variations were significantly lower; from 0.57 for male spontaneous speech to 0.9 for female reading texts. Average values of index SDDD in non–filtered conditions for intraspeaker variations were generally lower – from 2.27 for female speakers to 3 for male speakers in reading. SDDD index showed higher values in non–filtered conditions for interspeaker variations; from 4.75 in female reading speech and male spontaneous speech to 5.12 for male reading speech. In filtered conditions intraspeaker variations resulted with SDDD index between 2.14 for male reading speech to 3.01 for female spontaneous speech. As expected, results in filtered conditions for interspeaker variations showed higher values of SDDD index, from 3.06 for female to 4.71 for male reading speech. The differences between similarity index (R) in intraspeaker variations were statistically significant for female speakers (p < 0.0001) and for male speakers (p < 0.05) in both spontaneous speech and reading.

  • Issue Year: 40/2014
  • Issue No: 77
  • Page Range: 61-76
  • Page Count: 16
  • Language: Croatian