Teager_Mel and PLP Fusion Feature Based Speech Emotion Recognition

摘要：

　　Although a number of features derived from linear speech production theory have been investigated as speech emotion indicators,the recognition accuracy still stays unsatisfactory for realistic applications.In this paper,Teager_Mel,a novel speech emotion feature is proposed based on Teager Energy Operator (TEO) and the Mel perception characteristics.Due to such advantages as nonlinear and simple,TEO appears to be appropriate for speech emotion description.From the auditory psychophysical point of view,Perceptual Linear Predictive (PLP) features are also investigated as an extension to Teager_Mel.A Support Vector Machine (SVM) classifier is then adopted to the fusion of Teager_Mel and PLP features on a Chinese discrete emotional speech corpus (Dis-EC) that includes four emotions: happiness,anger,sorrow and surprise.Comparing with the previous studies based on prosodic features,the application of Teager_Mel features can achieve a recognition accuracy improvement of 10.4%,and similarly 8.2% for PLP features.The recognition accuracy reaches79.7% while using the fusion features,which appears to be the most attractive in relative researches.

关键词： Teager Energy Operator (TEO) Perceptual Linear Predictive (PLP) Feature Fusion Speech Emotion Recognition

作者: Xiao Chen Haifeng Li Lin Ma Xinlei Liu Jing Chen

作者单位: School of Computer Science and Technology Harbin Institute of Technology Harbin, Heilongjiang 150001, China

会议类型: 国际会议

会议名称: 2015 Fifth International Conference on Instrumentation and Measurement,Computer,Communication and Control (IMCCC2015)(第五届仪器测量、计算机通信与控制国际会议)

会议地点: 秦皇岛

会议语种:英文

页码: 1109-1114

在线出版日期: 2015-09-18（万方平台首次上网日期，不代表论文的发表时间）

会议专题

Teager_Mel and PLP Fusion Feature Based Speech Emotion Recognition