Simplified system for isolated word recognition

Downloads

Authors

  • R. GUBRYNOWICZ Institute of Fundamental Technological Research, Polish Academy of Sciences, Poland
  • K. MARASEK Institute of Fundamental Technological Research, Polish Academy of Sciences, Poland
  • W. MIKIEL Institute of Fundamental Technological Research, Polish Academy of Sciences, Poland
  • W. WIĘŹLAK Institute of Fundamental Technological Research, Polish Academy of Sciences, Poland

Abstract

This paper presents a general-purpose system for recognition of a limited set of words uttered in isolation. Such a system is intended for voice control of robot's movements. In order to minimize the number of operations performed during the recognition process and to limit the memory requirements frequency analysis of the signal was performed in adequately selected bands. Output signals from filters undergo detection and through an A/D converter are introduced into a computer where they undergo further processing logarithmic conversion and linear time standarization, among others. This leads to a reduction of the number range in further calculations. The DTW algorithm was used in the recognition process, while templates of individual words are introduced once, in principle separately for individual operators. The developed system speaker-dependent, in principle was verified experimentally for various vocabularies (containing 20 to 60 words) uttered by 11 voices (including 1 female voice). The average recognition accuracy for a 60 word wocabulary exceeded 98% for individual voices, while in a case of recognition whithout system accomodation to given voice the average error of recognition increased by about 10%.

References

[1] J. ACKENHAUSEN, S. S. ALI, D. BISHOP, L. F. ROSA, R. THORKILDSEN, Single board general - purpose speech recognition system, AT and T Technical Journal, 65, 5, 48-59 (1986).

[2] J. ALLEN, A perspective on man - machine communication by speech, Proc. IEEE, 73, 11, 1541-1500 (1985).

[3] B. S. ATAL, R. R. RABINER, Speech research directions, AT and T Technical Journal, 65, 5, 75-85 (1986).

[4] R. E. CROCHIERE, J. L. FLANAGAN, Speech processing: An evolving technology, AT and T Technical Journal, 65, 5, 2-11 (1986).

[5] A. FARAGO, S. GORDOS, G. LUGOSI, Methods for decreasing the response time in isolated word speech recognition, Proc. Speech Research, 89, Budapest, 255-258 (1989).