会议专题

Hierarchical Multimodal Transformer with Localness and Speaker Aware Attention for Emotion Recognition in Conversations

  Emotion Recognition in Conversations(ERC)aims to pre-dict the emotion of each utterance in a given conversation.Existing approaches for the ERC task mainly suffer from two drawbacks:(1)fail-ing to pay enough attention to the emotional impact of the local context;(2)ignoring the effect of the emotional inertia of speakers.To tackle these limitations,we first propose a Hierarchical Multimodal Transformer as our base model,followed by carefully designing a localness-aware atten-tion mechanism and a speaker-aware attention mechanism to respectively capture the impact of the local context and the emotional inertia.Exten-sive evaluations on a benchmark dataset demonstrate the superiority of our proposed model over existing multimodal methods for ERC.

Multimodal emotion recognition Hierarchical multimodal transformer Local context modeling Emotional inertia

Xiao Jin Jianfei Yu Zixiang Ding Rui Xia Xiangsheng Zhou Yaofeng Tu

Nanjing University of Science and Technology,Nanjing,China ZTE Corporation,Shenzhen,China

国际会议

9th CCF International Conference on Natural Language Processing and Chinese Computing (NLPCC 2020)

郑州

英文

892-904

2020-10-14(万方平台首次上网日期,不代表论文的发表时间)