AR Model-based Bayesian Speech Enhancement for Nonstationary Environment

摘要：

A new technique for enhancing audio signal from a noisy nonstationary environment is presented in the paper. Autoregressive (AR) model is used to efficiently exploit the temporally correlated information of audio and noise signals during a short stationary frame. The temporal models of signals and noisy process are combined to construct a state space. The state space appropriately describes that the observed noisy signal is generated from two underlying sources which evolve with Markovian dynamics across successive step times. In the state space, the clean speech and the noise are two hidden source signals. The recovery of clean speech and the estimation of all the model parameters are carried out within the variational Bayesian framework. The original speech can be estimated as a state using a variational Kalman smoother. The experimental results show that our approach can obtain better performance in terms of signal-to-noise ratio (SNR) measure.

作者: Qinghua Huang Kai Liu

作者单位: School of Communication and Information Engineering, Shanghai University, Shanghai, P.R.China

会议类型: 国际会议

会议名称: The Second International Joint Conference on Computational Science and Optimization(CSO 2009)(2009 国际计算科学与优化会议)

会议地点: 三亚

会议语种:英文

页码: 918-921

在线出版日期: 2009-04-24（万方平台首次上网日期，不代表论文的发表时间）

会议专题

AR Model-based Bayesian Speech Enhancement for Nonstationary Environment