AUTOMATIC EMOTION RECOGNITION IN SPEECH SIGNAL USING TEAGER ENERGY OPERATOR AND MFCC FEATURES

摘要：

A new approach to feature extraction for the automatic emotion classification in speech signals was described and tested in this work.The method was based on the Teager energy operator combined with mel-frequency cepstral coefficients (TEO-MFCC).The proposed TEO-MFCC method was tested using speech recordings collected from the Speech Under Simulated and Actual Stress (SUSAS) database with three simulated emotions (angry, neutral and soft).The Gaussian mixture model (GMM) was used as classifier. The average classification accuracy for three emotions reached up to 73％, much higher than purely guess (33％ for three emotions).Especially, the system showed good performance on the classification of emotion angry.The correct recognition rate for emotion angry was 83％, while it was only 59％ for emotion neutral and 77％ for emotion soft.

关键词： Automatic emotion recognition Teager energy operator Mel-frequency cesptral coefficient Gaussian mixture models

作者: LING HE MARGARET LECH NICHOLAS ALLEN

作者单位: Department of Medical Informatics and Engineering,Sichuan University,China School of Electrical and Computer Engineering,RMIT University,Australia Faculty of Medicine,Dentistry and Health Sciences,The University of Melbourne,Australia

会议类型: 国际会议

会议名称: 2011 3rd International Conference on Computer Technology and Development(2011第三届计算机技术与发展国际会议 ICCTD2011)

会议地点: 成都

会议语种:英文

页码: 2313-2317

在线出版日期: 2011-11-25（万方平台首次上网日期，不代表论文的发表时间）

会议专题

AUTOMATIC EMOTION RECOGNITION IN SPEECH SIGNAL USING TEAGER ENERGY OPERATOR AND MFCC FEATURES