MULTI-DOCUMENT SUMMARIZATION SYSTEMS COMPARISON
This paper compared two multi-document summarization systems we developed.One system used hierarchical sentence clustering algorithm to find the important information,while the other system mainly adopted hierarchical Latent Dirichlet Allocation (hLDA) topic model to obtain the sub-topics of multi-document data.Both of the two systems are evaluated and compared on TAC 2010/TAC 2011 data using the ROUGE testing method with same parameters setting.The results have shown that the hLDA system has got some improvement compared with the clustering system.And normally in ROUGE testing,results from non-stopwords are better than those from stopwords.
Multi-document summarization System comparison Hierarchical sentence clustering HLDA
Lei Li Wei Heng Pingan Liu
School of Computer Science and Technology,Beijing University of Posts and Telecommunications,Beijing 100876,China
国际会议
杭州
英文
1891-1895
2012-10-30(万方平台首次上网日期,不代表论文的发表时间)