A Novel Automatic Summarization Method from Chinese Document
As the rapid development of the Web, automatic summarization has become more and more important for handling the huge amount of text information in the Web. This paper proposes an automatic summarization method based on compound-word recognition and keyword extraction, called CASKE. CASKE firstly recognizes the compound-words in a document, labels P.O.S. And revises word segmentation. Then, it extracts keywords, and calculates sentence weights by keyword weights. Finally it selects with proportion the sentences with large weights to construct summary. The generated summary has good continuity and is readable. Experiment results show that the generated summaries are similar with manual reference summaries, achieving 68.31% Precision and 66.72% Recall in average.
automatic summarization compound-word keyword extraction sentence weight natural languange processing
Xing-lin LIU Qian-li MA Qi-lun ZHENG Gu-li LIN
School of Computer Science and Engineering South China Univ. of Tech.Guangzhou, China School of Comp School of Computer Science and Engineering South China Univ. of Tech. Guangzhou, China
国际会议
重庆
英文
540-544
2011-01-21(万方平台首次上网日期,不代表论文的发表时间)