会议专题

A Novel Spectro-Temporal Feature Extraction Method for Phoneme Classification

In this paper, we propose a new type of feature extraction method inspired by the model of auditory cortical processing. The output of the cortical model is a 4-D spectrotemporal representation of the sound that each point of this space indicates the amount of energy at the corresponding time, frequency, rate and scale. In the proposed model, one proper rate and one proper scale are selected among the rates and scales. Therefore, the output of the cortical model decreases the dimensions from a 4-D space toa 2-Dspace. In most ASR systems, HMM classifier model is used to solve the variable length problem after a framing procedure which affects the feature extraction stage and it causes to spoil the temporal information of the phoneme signal in the features level. In the proposed model, this problem is handled in the feature extraction stage. In this paper, some fixed length features are achieved by the analysis of spectro-temporal space for each phoneme. Since the provided feature has a fixed-dimension, we use a classical classifier as support vector machine for a phoneme classification task. In order to evaluate the performance of the proposed model, we performed a phoneme classification task on seven subset of the TMIT corpus. The phoneme classification results achieved on consonants and vowels showed the average performance improvement of 5.15% and 9.65% relative to the HMM-MFCC + MFCC approach. In addition, the average improvements are 8.7% and 2.68% relative to the SVM-MFCC approach, respectively.

auditory model spectro-temporal analysis feature extraction phoneme classification

Mehdi Fartash Saeed Setayeshi Farbod Razzazi

Department of Computer Engineering, Science and Research Branch, Islamic Azad University, Tehran, Ir Department of Medical Radiation, Amairkabir University of Technology, Tehran, Iran Department of Electrical Engineering, Science and Research Branch, Islamic Azad University, Tehran,

国际会议

2010 IEEE 10th International Conference on Signal Processing(第十届信号处理国际会议 ICSP 2010)

北京

英文

569-572

2010-08-24(万方平台首次上网日期,不代表论文的发表时间)