Previous [ 1] [ 2] [ 3] [ 4] [ 5] [ 6] [ 7] [ 8] [ 9] [ 10] [ 11] [ 12] [ 13] [ 14] [ 15] [ 16] [ 17] [ 18]

@

Journal of Information Science and Engineering, Vol. 22 No. 5, pp. 999-1013 (September 2006)

MAP-Based Perceptual Modeling for Noisy Speech Recognition

Yung-Ji Sher1,3, Yeou-Jiunn Chen4, Yu-Hsien Chiu5 Kao-Chi Chung1 and Chung-Hsien Wu2
1Institute of Biomedical Engineering
2Department of Computer Science and Information Engineering
National Cheng Kung University
Tainan, 701 Taiwan
3Department of Physical Therapy
Shu Zen College of Medicine and Management
Kaohsiung, 821 Taiwan
4Department of Electrical Engineering
Southern Taiwan University of Technology
Tainan, 710 Taiwan
5Computer and Communications Research Laboratories
Industrial Technology Research Institute
Hsinchu, 310 Taiwan

This study presents a maximum a posteriori (MAP) based perceptual modeling approach to deal with the issue of recognition degradation in noisy environment. In this approach, MAP-based noise detection is first applied to identify the noise segment in an utterance. Subtractive-type enhancement algorithm with masking properties of the human auditory system is then used to reduce the noise effect. Finally, MAP-based incremental noise model adaptation is developed to overcome the model inconsistencies between training and testing environments. For performance evaluation of the proposed approach, a Mandarin keyword recognition system was constructed. The experimental results show that the proposed approach achieves a better recognition rate compared to the audible noise suppression (ANS) and parallel model combination (PMC) methods.

Keywords: noisy speech recognition, speech enhancement, audible noise suppression, MAP-based perceptual modeling, noise detection, incremental model adaptation

Full Text () Retrieve PDF document (200609_01.pdf)

Received August 16, 2005; accepted January 17, 2006.
Communicated by Jhing-Fa Wang, Pau-Choo Chung and Mark Billinghurst.