Integrating Topic Model and Heterogeneous Information Network for Aspect Mining with Rating Bias
Recently,there is a surge of research on aspect mining,where the goal is to predict aspect ratings of shops with reviews and overall ratings.Traditional methods assumed that aspect ratings in a specific review text are of the same level,which equal to the corresponding overall rating.However,recent research reveals a different phenomenon: there is an obvious rating bias between aspect ratings and overall ratings.Moreover,these methods usually analyze aspect ratings of reviews with topic models at textual level,while totally ignore potentially structural information among multiple entities(users,shops,reviews),which can be captured by a Heterogeneous Information Network(HIN).In this paper,we present a novel model integrating Topic model and HIN for Aspect Mining with rating bias(called THAM).Firstly,a phrase-level LDA model is designed to extract topic distributions of reviews by using textual information.Secondly,making full use of structural information,we constructs a topic propagation network,and propagate topic distributions in this heterogeneous network.Finally,by setting review as the sharing factor,the two parts are integrated into a uniform optimization framework.Experimental results on two real datasets demonstrate that THAM achieves significant performance improvement,compared to the state of the arts.
Aspect mining Rating bias Topic model Topic propagation network Heterogeneous information network
Yugang Ji Chuan Shi Fuzhen Zhuang Philip S.Yu
Beijing Key Laboratory of Intelligent Telecommunications Software and Multimedia,Beijing University Key Laboratory of Intelligent Information Processing of Chinese Academy of Sciences(CAS),Institute o University of Illinois at Chicago,Chicago,USA
国际会议
澳门
英文
160-171
2019-04-14(万方平台首次上网日期,不代表论文的发表时间)