A mathematical model for the determination of speech signal parameters with the aid of instant memory
Abstract
In this paper the concept is presented of a system for the determination of basic parameters of speech signals, including formants and their transients. These parameters are described in stages by means of characteristic functions of the sound features. Values of the characteristic functions for the j-th stage of the sound features are arguments for the characteristic functions of other features of the (j+1)-st stage. If the values of these functions equal unity, they are automatically recorded in the memory. The length of the time of storage of these values is determined by physical properties of the sound. Physical properties at the (j+1)-st stage are described by means of relations that establish the sequence of appearance of the sound features as a function of time at the output of the j-th stage. The system enables the analysis of sounds in real time, as well as the automatic segmentation of sound as a function of time. The accuracy of the identification of speech does not depend on the frequency of the larynx tone or on the speech rate, etc. With the system it is possible to use bandpass filters with comparatively broad transmission band widths. A formal description of the system by means of characteristic functions of the features allows direct design and construct of the system by means of generally available integrated circuits.References
[1] J. L. FLANGAN, Speech analysis, synthesis and perception, Springer-Verlag, Berlin 1971.
[2] R. JACOBSON, C. FANT, M. HALLE, Preliminaries to speech analysis. The distinctive features and their correlates, MIT Press, Cambridge 1964
.
[3] J. L. KULIKOWSKI, Cybernetic identification systems, PWN, Warszawa 1972 [in Polish].
[4] H. KUBZELA, Automatic extraction of the frequency of larynx tone as well as of first formants of speech signal, IFTR Reports, PAN, Warszawa 1973.
[6] M. A. SAPOŻKOV, Speech signal in telecommunications and cybernetics, PWN, Warszawa 1965 [in Polish].
[6] Z. M. WÓJCIK, Conception of instant memory in the identification of sounds, Works by IBIB-PAN, Warszawa 1975 [in Polish].
[2] R. JACOBSON, C. FANT, M. HALLE, Preliminaries to speech analysis. The distinctive features and their correlates, MIT Press, Cambridge 1964
.
[3] J. L. KULIKOWSKI, Cybernetic identification systems, PWN, Warszawa 1972 [in Polish].
[4] H. KUBZELA, Automatic extraction of the frequency of larynx tone as well as of first formants of speech signal, IFTR Reports, PAN, Warszawa 1973.
[6] M. A. SAPOŻKOV, Speech signal in telecommunications and cybernetics, PWN, Warszawa 1965 [in Polish].
[6] Z. M. WÓJCIK, Conception of instant memory in the identification of sounds, Works by IBIB-PAN, Warszawa 1975 [in Polish].