Automatic speech signal segmentation with chosen parametrization method

Downloads

Authors

  • Cz. BASZTURA Institute of Telecommunication and Acoustics, Wrocław Technical University, Poland
  • T. SAWCZYN Institute of Telecommunication and Acoustics, Wrocław Technical University, Poland

Abstract

This paper is dedicated to the problem of automatic segmentation of a speech signal into so-called phonetic segments, i.e. speech signal segment with homogeneous physical structure which can be ascribed with adequate phonetic mean. This is the second trend in segmentation, as opposed to speech signal segmentation into short fixed segments. A segmentation algorithm is presented. It is based on calculations of the phonetic function at speech, which makes it possible to find the boundaries of these phonetic segments. The usability of three different parametrization methods - based on the analysis of zero-crossings, spectral analysis and linear prediction coding - was analyzed. No significant differences were observed in the efficiency of investigated parameters.

References

[1] R. ANDRE-OBRECHT, A new statistical approach for the automatic segmentation of continuous speech signals, IEEE Trans. on Acoustics, Speech and Signal Processing, 36, 1, 29-39 (1988).

[2] Cz. BASZTURA, J. JURKIEWICZ, E. TRYBURCY, Phonetic function of speech F.F.M. Applied in continuous speech signal segmentation (in Polish) Archiwum Akustyki, 4, 4, 121-130 (1979).

[3] Cz. BASZTURA, Acoustic sources signals and images (in Polish), WKiŁ., Warszawa 1988.