会议专题

A Speech Recognition Approach with MFCC and Fractal Dimension

A modified approach present here combines Mel-frequency cepstral coefficients (MFCC) and fractal dimension as mixed feature parameter to carry out the speech recognition.Due to the respective advantages on expressing speech signal of MFCC and fractal dimension, fractal dimension denotes the self-similarity, periodicity and randomness of speech signal; meanwhile MFCC feature parameter describes speech nonlinearity. Besides, BP neural network is introduced into the whole speech recognition training procedure. Experimental results demonstrate fractal dimension is able to reflect the speech feature to some extent,makeup the shortcoming of traditional speech feature. The recognition performance is then improved by introducing fractal dimension feature.

Speech Recognition MFCC Fractal Dimension neural network mixed parameter

Minghai Yao Jing Hu

College of Information Engineering, Zhejiang University of Technology Hangzhou, 310032, China

国际会议

2006 International Symposium on Distributed Computing and Applications to Business,Engineering and Science(2006年国际电子、工程及科学领域的分布式计算应用学术研讨会)

杭州

英文

349-351

2006-10-12(万方平台首次上网日期,不代表论文的发表时间)