会议专题

Research on Cross Language Text Keyword Extraction Based on Information Entropy and TextRank

  In order to extract keywords from crosslanguage documents as accurately as possible,especially for the language whose keyword extraction technology is not mature,a text keyword extraction method based on information entropy and TextRank is proposed to extract the accurate keywords from the translated Chinese documents.This method determines the basic importance of words according to the information entropy of words,and then uses the information entropy of words to vote iteratively through the TextRank algorithm.This method solves the shortcoming that TextRank algorithm easily extracts frequent non key words into keywords.The experimental results show that the proposed method can extract keywords more accurately than TextRank in the processing of cross-lingual bilingual translated documents.

component information entropy TextRank keyword extraction Cross language keyword extraction

Xiaoyu Zhang Yongbin Wang Lin WuName

Internet Information Research Institute Communication University of China Beijing,China

国际会议

The 12th International Symposium Antennas, Propagation, and EM Theory(ISAPE 2018)?(第十二届天线、传播与电磁理论国际学术会议)

杭州

英文

1-4

2018-12-03(万方平台首次上网日期,不代表论文的发表时间)