Video Based Emotion Recognition Using CNN and BRNN
Video-based Emotion recognition is a rather challenging computer vision task.It not only needs to model spatial information of each image frame,but also requires considering temporal contextual correlations among sequential frames.For this purpose,we propose a hierarchical deep network architecture to extract high-level spatial-temporal features.In this architecture,two classic deep neural networks,convolutional neutral networks(CNN)and bi-directional recurrent neutral networks(BRNN),are employed to respectively capture facial textural characteristics in spatial domain and dynamic emotion changes in temporal domain.We endeavor to coordinate the two networks by optimizing each of them,so as to boost the performance of the emotion recognition.In the challenging competition,our method achieves a promising performance compared with the baselines.
Convolutional neutral networks (CNN) Bi-directional recurrent neutral networks (BRNN) Emotion recognition
Youyi Cai Wenming Zheng Tong Zhang Qiang Li Zhen Cui Jiayin Ye
Key Laboratory of Child Development and Learning Science,Ministry of Education,Research Center for Learning Science,Southeast University,Nanjing 210096,China
国际会议
第七届全国模式识别学术会议(The 7th Chinese Conference on Pattern Recognition,CCPR2016)
成都
英文
679-691
2016-11-03(万方平台首次上网日期,不代表论文的发表时间)