Short Text Similarity Measurement Based on Coupled Semantic Relation and Strong Classification Features
Measuring the similarity between short texts is made difficult by the fact that two texts that are semantically related may not contain any words in common.In this paper,we propose a novel short text similarity measure which aggregates coupled semantic relation(CSR)and strong classification features(SCF)to provide a richer semantic context.On the one hand,CSR considers both intra-relation(i.e.cooccurrence of terms based on the modified weighting strategy)and interrelation(i.e.dependency of terms via paths that connect linking terms)between a pair of terms.On the other hand,Based on SCF for similarity measure is established based on the idea that the more similar two texts are,the more features of strong classification they share.Finally,we combine the above two techniques to address the semantic sparseness of short text.We carry out extensive experiments on real world short texts.The results demonstrate that our method significantly outperforms baseline methods on several evaluation metrics.
Short text Coupled semantic relation Strong classification feature Short text similarity
Huifang Ma Wen Liu Zhixin Li Xianghong Lin
College of Computer Science and Technology,Northwest Normal University,Lanzhou 730000,China;Guangxi College of Computer Science and Technology,Northwest Normal University,Lanzhou 730000,China Guangxi Key Laboratory of Multi-source Information Mining and Security,Guangxi Normal University,Gui
国际会议
澳门
英文
135-147
2019-04-14(万方平台首次上网日期,不代表论文的发表时间)