Voice Activity Detection with Decision Trees in Noisy Environments
An improved project based on double thresholds method in noisy environments is proposed for robust endpoints detection. Firstly, in this method, the distribution of zero crossing rate (ZCR) on the preprocessed signal is taken into account, and then the speech signal is divided into different parts to obtain appropriate thresholds with decision trees on the basis of the ZCR distribution. Finally, the double thresholds method, focusing on different importance of the energy and ZCR, is taken in the corresponding situation to determine the input segment is speech or non-speech. Simulation results indicate that the proposed method with decision trees obtains more accurate data than the traditional double thresholds method.
voice activity detection (VAD) zero crossing rate (ZCR) distribution density thresholds
Hu da-li Yi liangzhong Pei zheng Luo Bing
School of Mathematics & Computer Engineering, Xihua University, Chengdu 610039 Sichuan Police College, Chengdu 646000
国际会议
三亚
英文
749-752
2012-01-06(万方平台首次上网日期,不代表论文的发表时间)