Speaker Recognition Based on Dynamic MFCC Parameters

摘要：

The Mel-frequency cepstral coefficient is the most widely used feature in speech and speaker recognition. However, the traditional MFCC is very sensitive to noise interference, which tends to drastically degrade the performance of recognition systems because of the mismatches between training and testing. In this paper, we proposed a new speaker recognition algorithm based on the dynamic MFCC parameters. As the human auditory system can sensitively perceive the pitch changes in the speech, the algorithm, which combines the speaker information obtained by the MFCC with the pitch, can dynamically construct a set of Mel-filters according to the results of pitch detection. The Mel-filters are then used to extract the dynamic MFCC parameter, which represents the speakers identity characteristics, and enhance accuracy of speaker recognition. The experimental results show that the method can perform well in a real environment and improve much on robustness in a noisy environment. The recognition rate in different signal-to-noise ratio conditions is obviously excelled to that of traditional MFCC with 5 to 6 percentage points higher on average.

关键词： speaker recognition Mel Frequency Cepstral Coefficients pitch detection feature eztraction

作者: Wang Yutai Li Bo Jiang Xiaoqing Liu Feng Wang Lihao

作者单位: School of Information Science and Engineering, University of Jinan Jinan 250022, China

会议类型: 国际会议

会议名称: 2009图像分析与信号处理国际会议(2009 International Conference on Image Analysis and Signal Processing)

会议地点: 浙江台州

会议语种:英文

页码: 406-409

在线出版日期: 2009-04-11（万方平台首次上网日期，不代表论文的发表时间）

会议专题

Speaker Recognition Based on Dynamic MFCC Parameters