会议专题

MULTI-DOCUMENT SUMMARIZATION SYSTEMS COMPARISON

  This paper compared two multi-document summarization systems we developed.One system used hierarchical sentence clustering algorithm to find the important information,while the other system mainly adopted hierarchical Latent Dirichlet Allocation (hLDA) topic model to obtain the sub-topics of multi-document data.Both of the two systems are evaluated and compared on TAC 2010/TAC 2011 data using the ROUGE testing method with same parameters setting.The results have shown that the hLDA system has got some improvement compared with the clustering system.And normally in ROUGE testing,results from non-stopwords are better than those from stopwords.

Multi-document summarization System comparison Hierarchical sentence clustering HLDA

Lei Li Wei Heng Pingan Liu

School of Computer Science and Technology,Beijing University of Posts and Telecommunications,Beijing 100876,China

国际会议

2012 2nd IEEE International Conference on Cloud Computing and Intelligence Systems (2012年第2届IEEE云计算与智能系统国际会议(IEEE CCIS2012))

杭州

英文

1891-1895

2012-10-30(万方平台首次上网日期,不代表论文的发表时间)