Image similarity functions in non-parametric algorithms of voice identification

Downloads

Authors

  • Cz. BASZTURA Institute of Telecommunication and Acoustics of the Wrocław Technical University, Poland
  • J. ZUK Institute of Telecommunication and Acoustics of the Wrocław Technical University, Poland

Abstract

This paper is dedicated to the question of the choice of a function of similarity between images in non-parametric alogorithms of voice recognition. The usefulness of 10 similarity functions (8 distances and 2 nearness'es) in three non-parametric identification algorithms – NN (nearest neighbour), k-NN (k-nearest neighbours) and NM (nearest mean) – was investigated for three sets of parameters (1 natural and 2 normalized). Results obtained for a population of speakers from a closed set with size M = 20 (after 10 repetitions of the learning and test sequences) have proved that the Camberr distance function prevails in all types of parameters and algorithms. Other functions ensure a differentiated discrimination force strongly dependent on the algorithm and form of parameters. Limited usefulness of the square of Mahalonobis distance in comparison to other similarity functions was proved, as well as generally worse results for the NM algorithm.

References

[1] Cz. BASZTURA, Sources, signals and acoustic images (in Polish), WKiŁ, Warszawa 1988.

[2] Cz. BASZTURA, J. JURKIEWICZ, Analysis of zero-crossings of a speech signal in a short-term model of automatic speaker identification (in Polish) Arch. Akustyki 13, 3, 203-214 (1978).

[3] Cz. BASZTURA, Similarity functions of acoustic images as indicators of objective evaluation of speech quality transmission (in Polish) Arch. Akustyki 22, 3, 217-233 (1987).

[4] A. J. GRAY, J. D. MARKEL, Distance measures for speech processing, IEEE ASSP-24, 5, 380-391 (1976).