会议专题

Modified MFCCs for robust speaker recognition

Mel-scale frequency cepstrum coefficients (MFCCs) arc commonly used featues in speaker recognition systems, but MFCC values are not very robust in the presence of noise, thus, the modified MFCCs (named as SMN-CMN- MFCC) based on the general noisy speech model is proposed in this paper, which uses spectrum mean normalization (SMN) to suppress the additive noise, and uses cepstral mean normalization (CMN) to remove the effect of convolutional noise. Theoretical analyses show that the combination of SMN and CMN can inhibit additive and convolutional noise at the same time. To verify the performance of the SMN-CMN-MFCC, we have conducted some speaker recognition tests. With the same convolutional noise component, the additive white noise experiments and the additive factory noise experiments showthat SMN-CMN-MFCC provides 10.5% and 9.6% relative improvement than the conventional MFCC and AM FCC features, respectively.

Mel-scale Frequency Cepstral Coefficients feature extraction speaker recognition

Wang Hong Pan Jingui Wang Hong

State Key Laboratory for Novel Software Technology,Nanjing University Nanjing, China Institute of Computer Application and Research, Changji University Changji, China

国际会议

2010 IEEE International Conference on Intelligent Computing and Intelligent Systems(2010 IEEE 智能计算与智能系统国际会议 ICIS 2010)

厦门

英文

276-279

2010-10-29(万方平台首次上网日期,不代表论文的发表时间)