Research on Cross Language Text Keyword Extraction Based on Information Entropy and TextRank
In order to extract keywords from crosslanguage documents as accurately as possible,especially for the language whose keyword extraction technology is not mature,a text keyword extraction method based on information entropy and TextRank is proposed to extract the accurate keywords from the translated Chinese documents.This method determines the basic importance of words according to the information entropy of words,and then uses the information entropy of words to vote iteratively through the TextRank algorithm.This method solves the shortcoming that TextRank algorithm easily extracts frequent non key words into keywords.The experimental results show that the proposed method can extract keywords more accurately than TextRank in the processing of cross-lingual bilingual translated documents.
component information entropy TextRank keyword extraction Cross language keyword extraction
Xiaoyu Zhang Yongbin Wang Lin WuName
Internet Information Research Institute Communication University of China Beijing,China
国际会议
杭州
英文
1-4
2018-12-03(万方平台首次上网日期,不代表论文的发表时间)