Speaker Recognition Based on Dynamic MFCC Parameters
The Mel-frequency cepstral coefficient is the most widely used feature in speech and speaker recognition. However, the traditional MFCC is very sensitive to noise interference, which tends to drastically degrade the performance of recognition systems because of the mismatches between training and testing. In this paper, we proposed a new speaker recognition algorithm based on the dynamic MFCC parameters. As the human auditory system can sensitively perceive the pitch changes in the speech, the algorithm, which combines the speaker information obtained by the MFCC with the pitch, can dynamically construct a set of Mel-filters according to the results of pitch detection. The Mel-filters are then used to extract the dynamic MFCC parameter, which represents the speakers identity characteristics, and enhance accuracy of speaker recognition. The experimental results show that the method can perform well in a real environment and improve much on robustness in a noisy environment. The recognition rate in different signal-to-noise ratio conditions is obviously excelled to that of traditional MFCC with 5 to 6 percentage points higher on average.
speaker recognition Mel Frequency Cepstral Coefficients pitch detection feature eztraction
Wang Yutai Li Bo Jiang Xiaoqing Liu Feng Wang Lihao
School of Information Science and Engineering, University of Jinan Jinan 250022, China
国际会议
2009图像分析与信号处理国际会议(2009 International Conference on Image Analysis and Signal Processing)
浙江台州
英文
406-409
2009-04-11(万方平台首次上网日期,不代表论文的发表时间)