Audio-Visual Emotion Recognition Based on a DBN Model with Constrained Asynchrony

摘要：

This paper presents an audio visual multi-stream DBn model (Asy_DBN) for emotion recognition with constraint asynchrony, in which audio state and visual state transit individually in their corresponding stream but the transition is constrained by the allowed maximum audio visual asynchrony. Emotion recognition experiments of Asy_DBN with different asynchrony constraints are carried out on an audio visual speech database of four emotions, and compared with the single streaM HMM, state synchronous HMM (Syn_HMM) and state synchronous DBN model, as well the state asynchronous DBN model without asynchrony constraint. Results show that by setting the appropriate maximum asynchrony constraint between audio and visual streams, the proposed audio visual asynchronous DBN model gets the highest emotion recognition performance, with an improvement of 15% over SynHMM.

关键词： audio visual multi-stream asynchronous DBN model

作者: Danqi Chen Dongmei Jiang Use Ravyse Hichem Sahli

作者单位: VUB-NPU Joint Research Group on AVSP Northwestern Polytechnic University, Xian, China Shaanxi Provi VUB-NPU Joint Research Group on AVSP Vrije Universiteit Brussel (VUB) - Interdisciplinary Institute

会议类型: 国际会议

会议名称: The Fifth International Conference on Image and Graphics(第五届国际图像图形学学术会议 ICIG 2009)

会议地点: 西安

会议语种:英文

页码: 912-916

在线出版日期: 2009-09-20（万方平台首次上网日期，不代表论文的发表时间）

会议专题

Audio-Visual Emotion Recognition Based on a DBN Model with Constrained Asynchrony