Optimized Wavelet-based Speech Enhancement for Speech Recognition in Noisy and Reverberant Conditions

摘要：

We present an improved speech enhancement method based on Wiener filtering in the wavelet domain for automatic speech recognition (ASR). The wavelet coefficients that are contaminated by the effects of late reflection and background noise are filtered using a Wiener gain. We optimize the wavelet parameters for speech, background noise and late reflection to achieve a better estimate of the Wiener gain for effective filtering. Wiener gains to compensate for the effects of late reflection and background noise are independently estimated and then combined. Moreover, we introduce the noise profile and reverberation time identification to cope with different noise and reverberant conditions. Experimental results in large vocabulary continuous speech recognition (LVCSR) show that the proposed method outperforms the conventional methods.

作者: Randy Gomez Tatsuya Kawahara

作者单位: Kyoto University, Academic Center for Computing and Media Studies (ACCMS), Sakyo-ku, Kyoto 606-8501, Kyoto University, Academic Center for Computing and Media Studies (ACCMS),Sakyo-ku, Kyoto 606-8501,

会议类型: 国际会议

会议名称: 2011亚太信号与信息处理协会年度峰会(APSIPAASC 2011)

会议地点: 西安

会议语种:英文

页码: 1-4

在线出版日期: 2011-10-18（万方平台首次上网日期，不代表论文的发表时间）

会议专题

Optimized Wavelet-based Speech Enhancement for Speech Recognition in Noisy and Reverberant Conditions