Topic Detection Based on Semantics,Time and Social Relationship
Short text sparsity,oral language,and polysemy are the main problems when dealing with social network data,which make the traditional methods hard to obtain the true meaning of social network data.Due to the above issues,topic detection for social network data is not that easy.And to solve the above problems,we propose an original Clustering Algorithm based on Semantics,Time,and Social relationship(CASTS)for topic detection.Firstly,to overcome short text sparsity and polysemy problems,the CASTS leverages the Bidirectional Encoder Representations from Transformers(BERT),which can pre-train on large-scale social network short text data to obtain concise text representation with rich semantics.Secondly,by combining the short text representation,time,and social relationship,the CASTS can efficiently detect topics.Finally,we conduct experiments on Weibo dataset to verify the correctness and effectiveness of CASTS.
Topic detection Semantics Time Social relationship
Pengchao Cheng Junping Du Feifei Kou Zhe Xue Peihua Chen
Beijing Key Laboratory of Intelligent Telecommunication Software and Multimedia,School of Computer S Xiaoi Research,Shanghai Xiaoi Robot Technology Co.,Ltd.,Shanghai 201803,China
国际会议
江苏镇江
英文
691-698
2019-09-20(万方平台首次上网日期,不代表论文的发表时间)