A Speech Recognition Approach with MFCC and Fractal Dimension

摘要：

A modified approach present here combines Mel-frequency cepstral coefficients (MFCC) and fractal dimension as mixed feature parameter to carry out the speech recognition.Due to the respective advantages on expressing speech signal of MFCC and fractal dimension, fractal dimension denotes the self-similarity, periodicity and randomness of speech signal; meanwhile MFCC feature parameter describes speech nonlinearity. Besides, BP neural network is introduced into the whole speech recognition training procedure. Experimental results demonstrate fractal dimension is able to reflect the speech feature to some extent,makeup the shortcoming of traditional speech feature. The recognition performance is then improved by introducing fractal dimension feature.

关键词： Speech Recognition MFCC Fractal Dimension neural network mixed parameter

作者: Minghai Yao Jing Hu

作者单位: College of Information Engineering, Zhejiang University of Technology Hangzhou, 310032, China

会议类型: 国际会议

会议名称: 2006 International Symposium on Distributed Computing and Applications to Business,Engineering and Science(2006年国际电子、工程及科学领域的分布式计算应用学术研讨会)

会议地点: 杭州

会议语种:英文

页码: 349-351

在线出版日期: 2006-10-12（万方平台首次上网日期，不代表论文的发表时间）

会议专题

A Speech Recognition Approach with MFCC and Fractal Dimension