Kernel Fitting for Speech Detection and Enhancement
A kernel fitting algorithm is proposed for speech denoising to improve the precision of voice activity detection (VAD) and the performance of speech enhancement, of some popular algorithms. In the algorithm, a noisy speech frame is filtered by kernel fitting, and then its power spectral density is estimated and weighted by a gain factor constructed from frame energy and zero-crossing rate, so that a speech signal is obviously discriminated from a nonspeech one. By incorporation of the VAD outputs and the noise effect into the kernel fitting process, a speech frame is enhanced with better performance than the spectra subtraction algorithm. Experiments are taken on a real life speech signal plus simulated noises, and the results show the potentiality of the proposed algorithms in speech detection and enhancement.
Speech detection speech enhancement kernel fitting cepstral coefficients power spectral density spectra subtraction
Benyong Liu Jing Zhang Xiang Liao
Institute of Intelligent Information Processing / College of Computer Science and Information Techno Key Lab of Audio-Visual Material Examination Guizhou Public Security Department Guiyang 550001, Chin
国际会议
2010 IEEE 10th International Conference on Signal Processing(第十届信号处理国际会议 ICSP 2010)
北京
英文
534-537
2010-08-24(万方平台首次上网日期,不代表论文的发表时间)