Optimized Wavelet-based Speech Enhancement for Speech Recognition in Noisy and Reverberant Conditions
We present an improved speech enhancement method based on Wiener filtering in the wavelet domain for automatic speech recognition (ASR). The wavelet coefficients that are contaminated by the effects of late reflection and background noise are filtered using a Wiener gain. We optimize the wavelet parameters for speech, background noise and late reflection to achieve a better estimate of the Wiener gain for effective filtering. Wiener gains to compensate for the effects of late reflection and background noise are independently estimated and then combined. Moreover, we introduce the noise profile and reverberation time identification to cope with different noise and reverberant conditions. Experimental results in large vocabulary continuous speech recognition (LVCSR) show that the proposed method outperforms the conventional methods.
Randy Gomez Tatsuya Kawahara
Kyoto University, Academic Center for Computing and Media Studies (ACCMS), Sakyo-ku, Kyoto 606-8501, Kyoto University, Academic Center for Computing and Media Studies (ACCMS),Sakyo-ku, Kyoto 606-8501,
国际会议
2011亚太信号与信息处理协会年度峰会(APSIPAASC 2011)
西安
英文
1-4
2011-10-18(万方平台首次上网日期,不代表论文的发表时间)