Research on Cross Language Text Keyword Extraction Based on Information Entropy and TextRank

摘要：

　　In order to extract keywords from crosslanguage documents as accurately as possible,especially for the language whose keyword extraction technology is not mature,a text keyword extraction method based on information entropy and TextRank is proposed to extract the accurate keywords from the translated Chinese documents.This method determines the basic importance of words according to the information entropy of words,and then uses the information entropy of words to vote iteratively through the TextRank algorithm.This method solves the shortcoming that TextRank algorithm easily extracts frequent non key words into keywords.The experimental results show that the proposed method can extract keywords more accurately than TextRank in the processing of cross-lingual bilingual translated documents.

关键词： component information entropy TextRank keyword extraction Cross language keyword extraction

作者: Xiaoyu Zhang Yongbin Wang Lin WuName

作者单位: Internet Information Research Institute Communication University of China Beijing,China

会议类型: 国际会议

会议名称: The 12th International Symposium Antennas, Propagation, and EM Theory(ISAPE 2018)?(第十二届天线、传播与电磁理论国际学术会议)

会议地点: 杭州

会议语种:英文

页码: 1-4

在线出版日期: 2018-12-03（万方平台首次上网日期，不代表论文的发表时间）

会议专题

Research on Cross Language Text Keyword Extraction Based on Information Entropy and TextRank