Confronting representative spectral structures of vocal tract pulse responses

Downloads

Authors

  • Z. Wojan Gdańsk University of Technology, Faculty of Electronics, Telecommunications and Informatics, Department of Marine Electronics Systems, Gabriela Narutowicza 11/12, 80-952 Gdańsk, Poland
  • W. Lis Gdańsk University of Technology, Faculty of Electronics, Telecommunications and Informatics, Department of Marine Electronics Systems, Gabriela Narutowicza 11/12, 80-952 Gdańsk, Poland
  • K. Wojan University of Gdańsk, Institute of East Slavonic Studies, Russian Language Chair, Wita Stwosza 55, 80-952 Gdańsk, Poland

Abstract

Before the resources of a number of ethnic language systems can be confronted, multiple linguistic procedures must be applied to identify the possible common acoustic features in the utterances of the language users. There is a clear difference in the articulatory habits characteristic of the particular languages, which is why significant similarities must be sought in the oblique fragments of speech that convey the information code (sense) rather than in the acoustic representation of free speech. The paper discusses methods and results of automatic classification of selected lexemes of three language systems by confronting their digital representations. Digital representation includes sets of acoustic parameters as discussed in the previous OSA papers, which the authors termed as: representative spectral structures of vocal tract pulse responses. Representative structures... were produced by cepstral smoothing of averaged acoustic parameters taken from multiple utterances by speakers of different genders and ages. The paper includes spectrograms of the material used for confrontation and graphic illustrations of the results. Automatic classification of speech sounds of a pair of lexemes of two different languages is in fact the process of identifying the percentage proportions of convergent and divergent energies of lines in a set of parallel elements of both spectral matrixes being confronted, with time and frequency coordinates.

Keywords:

speech sound, information code, vocal tract, homomorphic analysis, homonymy.