会议专题

A Novel Automatic Summarization Method from Chinese Document

As the rapid development of the Web, automatic summarization has become more and more important for handling the huge amount of text information in the Web. This paper proposes an automatic summarization method based on compound-word recognition and keyword extraction, called CASKE. CASKE firstly recognizes the compound-words in a document, labels P.O.S. And revises word segmentation. Then, it extracts keywords, and calculates sentence weights by keyword weights. Finally it selects with proportion the sentences with large weights to construct summary. The generated summary has good continuity and is readable. Experiment results show that the generated summaries are similar with manual reference summaries, achieving 68.31% Precision and 66.72% Recall in average.

automatic summarization compound-word keyword extraction sentence weight natural languange processing

Xing-lin LIU Qian-li MA Qi-lun ZHENG Gu-li LIN

School of Computer Science and Engineering South China Univ. of Tech.Guangzhou, China School of Comp School of Computer Science and Engineering South China Univ. of Tech. Guangzhou, China

国际会议

2011 3rd International Conference on Computer and Automation Engineering(ICCAE 2011)(2011年第三届IEEE计算机与自动化工程国际会议)

重庆

英文

540-544

2011-01-21(万方平台首次上网日期,不代表论文的发表时间)