Vowel Recognition Based on Frequency Ranges Determined by Bandwidth Approach
Automatic speech recognition (ASR) has made great strides with the development of digital signal processing hardware and software especially using English as the language of choice. In this paper, a new feature extraction method is presented to identify vowels recorded from 80 Malaysian speakers. The features were obtained from Vocal Tract Model based on Bandwidth (BW) approach. Bandwidth approach identifies frequency bands based on the first peak of vowel frequency responses. Mean and maximum energies were calculated from these Bandwidth frequency bands. Classification results from Bandwidth Approach were compared with the first 3-formant features using Linear Predictive method. A Multi-Layer Perceptron (MLP) and Multinomial Logistic Regression (MLR) were used to classify the vowels. MLR and MLP shows comparable classification results for BW approach of 96.40% and 96.59% respectively. Bandwidth approach obtained 5.49% higher classification rate than 3-formant features using MLP.
M.P.Paulraj S.Yaacob S.A.Mohd Yusof
Universiti Malaysia Perils, Malaysia Universiti Utara Malaysia, Malaysia
国际会议
2008 International Conference on Audio,Language and Image Processing(2008国际声音、语言、图像过程大会)
镇江
英文
75-79
2008-07-07(万方平台首次上网日期,不代表论文的发表时间)