会议专题

PARSUMIST: A Persian Tezt Summarizer

The rapid growth of online information services causes the problem of information explosion. Automatic text summarization techniques are essential for dealing with this problem. The process of compacting a source document to reduce complexity and length, retaining the most important information is called text summarization. This paper introduces PARSUMIST; a text summarization system for Persian documents. It can generate generic or topic /query-driven extract summaries for single or multiple Persian documents, using a combination of statistical, semantic and heuristic improved methods. In this paper we will first review the related works in this field and especially in Persian text summarization. Then we will present the architecture of PARSUMIST, its components and its features. The last section will evaluate the system and compare it to other existing ones.

Automatic tezt summarization multi document summarization eztraction lezical chains Persian

Mehrnoush SHAMSFARD Tara AKHAVAN Mona ERFANI JOURABCHI

Computer Engineering Dept. Shahid Beheshti University, Tehran, Iran Computer Engineering Dept., Shahid Behehti University, Tehran, Iran

国际会议

International Conference on Natural Language Processing and Knowledge Engineering(IEEE自然语言处理与知识工程国际会议 IEEE NLP-KE 2009)

大连

英文

1-7

2009-09-24(万方平台首次上网日期,不代表论文的发表时间)