Statistical Model-Based Voice Activity Detection Algorithm in DCT Transformation Domain
A statistical model-based voice activity detection (VAD) algorithm in discrete cosine transformed (DCT) domain is proposed. This algorithm uses a Hidden Markov Model (HMM) with two states to estimate the probability of voice activity and employs the decision-directed (DD) parameter estimation method for the likelihood ratio test. Tiicu develops a smoothed likelihood ratio (SLR) instead of conventional likelihood ratio (LR) in order to alleviate the delayed term of LR. Simulation results show that the proposed VAD outperforms the G.729 B VAD and the traditional discrete Fourier transformation (DFT) based VAD in various noise environments.
voice activity detection discrete cosine transform decision-directed method smoothed likelihood ratio Hidden Markov model
Ou Shifeng Gao Ying
Institute of Science and Technology for Opto-electronic Information, Yantai University, Yantai, China
国际会议
武汉
英文
673-677
2010-06-06(万方平台首次上网日期,不代表论文的发表时间)