Chinese Sentiment Analysis Exploiting Heterogeneous Segmentations
The Chinese language is a character-based language,with no explicit separators between words like English.Traditionally,word segmentation is conducted to convert Chinese sentences into word sequences,thus the same framework of English sentiment analysis can be exploited for Chinese.These work uses a specified word segmentor as a prerequisite step,yet ignores the fact that different segmentation styles exist in Chinese word segmentation,such as CTB,PKU,MSR and etc.In this paper,we study the influences of these heterogeneous segmentations for Chinese sentiment analysis,and then integrate these segmentations,based on both discrete and neural models.Experimental results show that different segmentations do affect the final performances,and the integrated models can achieve better performances.
Sentiment Analysis Heterogeneous Segmentations Neural Network
Da Pan Meishan Zhang Guohong Fu
School of Computer Science and Technology,Heilongjiang University Harbin 150080,China
国内会议
第十五届全国计算语言学学术会议(CCL2016)暨第四届基于自然标注大数据的自然语言处理国际学术研讨会(NLP-NABD-2016)
烟台
英文
1-12
2016-10-14(万方平台首次上网日期,不代表论文的发表时间)