Chinese Word Similarity Computing Based on Combination Strategy
Chinese word similarity computing is a fundamental task for natural language processing.This paper presents a method to calculate the similarity between Chinese words based on combination strategy.We apply Baidubaike to train Word2Vector model,and then integrate different methods,Dictionary-based method,Word2Vector-based method and Chinese FrameNet(CFN)-based method,to calculate the semantic similarity between Chinese words.The semantic Dictionary-based method includes dictionaries such as HowNet,DaCilin,Tongyici Cilin(Extended)and Antonym.The experiments are performed on 500 pairs of words and the Spearman correlation coefficient of test data is 0.524,which shows that the proposed method is feasible and effective.
Chinese Word Similarity Computing Combination Strategy Semantic Dictionary Word2Vector Chinese FrameNet
Shaoru GUO Yong GUAN Ru LI Qi ZHANG
School of Computer and Information Technology,Shanxi University,Taiyuan,China School of Computer and Information Technology,Shanxi University,Taiyuan,China;Key Laboratory of Mini College of Mathematics and Computer Science,Fuzhou University,Fujian
国际会议
第五届自然语言处理与中文计算会议(NLPCC-ICCPOL2016)
昆明
英文
1-9
2016-12-02(万方平台首次上网日期,不代表论文的发表时间)