会议专题

Building Corpus with Emoticons for Sentiment Analysis

  Corpus is an essential resource for data driven natural language processing systems,especially for sentiment analysis.In recent years,people increasingly use emoticons on social media to express their emotions,attitudes or preferences.We believe that emoticons are a non-negligible feature of sentiment analysis tasks.However,few existing works focused on sentiment analysis with emoticons.And there are few related corpora with emoticons.In this paper,we create a large scale Chinese Emoticon Sentiment Corpus of Movies(CESCM).Different to other corpora,there are a wide variety of emoticons in this corpus.In addition,we did some baseline sentiment analysis work on CESCM.Experimental results show that emoticons do play an important role in sentiment analysis.Our goal is to make the corpus widely available,and we believe that it will offer great support to sentiment analysis research and emoticon research.

Emoticon Sentiment analysis Corpus

Changliang Li Yongguan Wang Changsong Li Ji Qi Pengyuan Liu

Kingsoft AI Laboratory,33,Xiaoying West Road,Beijing 100085,China Beijing Language and Culture University,15,Xueyuan Road,Beijing 100083,China Peking University,5,Yiheyuan Road,Beijing 100871,China

国际会议

2018自然语言处理与中文计算国际会议(NLPCC2018)

呼和浩特

英文

309-318

2018-08-26(万方平台首次上网日期,不代表论文的发表时间)