会议专题

Constructing Corpus for Query-oriented XML Text Summarization

XML Retrieval is becoming the focus study of the field of Information Retrieval and Database. Summarization of the results which come from the XML search engines will alleviate the read burden of users. However, as the basis of this study, the construction of the query-oriented XML text summarization corpus has not yet received enough attention. In this paper, we introduce our works on constructing this kind of corpus, including the selection of topics and XML elements/documents, construction process and the feature of the constructed corpus. Up to now, the corpus has 25 English query topics, including 422 elements for summarization, and 32 Chinese topics which including 402 elements. For each topic, 4 pieces of extracted summaries and 4 pieces of generated summaries are made manually by 4 experts.

Query-oriented XML Automatic summarization Corpus

Shihan WU Dexi LIU Xianpei JIAO

Jiangxi Key Laboratory of Data and Knowledge Engineering School of Information Technology, Jiangxi University of Finance & Economics Nanchang Jiangxi, China

国际会议

2010 International Conference on Management of e-Commerce and e-Government(第四届电子商务与电子政务管理国际会议 ICMeCG 2010)

成都

英文

45-49

2010-10-23(万方平台首次上网日期,不代表论文的发表时间)