An Unsupervised Approach for Constructing Word Similarity Network

摘要：

　　To evaluate how much a pair of entities or documents are similar is a common problem for current applications.Most approaches for this problem are based on the co-occurrence.However, different terms or words may represent the same entity or similar semantic in the real world since a concept often has more than one way of expression.Existing works always focus on computing semantic relatedness of words.But relatedness cannot reflect the similarity most of the time; on the other hand, most of their corpus are from common data sources such as Wikipedia and are not useful for the specialized vocabulary.In this paper, we propose a novel unsupervised approach for evaluating the semantic similarity between words by mapping texts to vector space and computing prior information.In our approach, we construct a model that can identify the words representing the same entity in special context even though they dont belong to the same concept.At last, we construct a network of words in which paths between words can reflect the evolution process of concepts.Our experimental results show that that our approach gives an effective solution to discover the semantic relationship between words, especially for words in specialty domains.

关键词： semantic similarity text processing text mapping similarity network

作者: Yu Hu Tiezheng Nie Derong Shen Yue Kou

作者单位: School of Information Science and Engineering Northeastern University Shenyang, China

会议类型: 国际会议

会议名称: The 12th Web Information System and Application Conference第十二届全国Web信息系统及其应用学术会议(WISA2015)、全国第十次语义Web 与本体论学术研讨会(SWON2015)、全国第九次电子政务技术及应用学术研讨会(EGTA2015)

会议地点: 济南

会议语种:英文

页码: 265-268

在线出版日期: 2015-09-11（万方平台首次上网日期，不代表论文的发表时间）

会议专题

An Unsupervised Approach for Constructing Word Similarity Network