会议专题

Improved Query Language Model Generation Using Mixture Models

This paper presents a novel method of using mixture models with document segmentation to improve the performance of pseudo relevance feedback for document retrieval.Pseudo-relevance feedback in information retrieval traditionally selects a number of topranked documents,from which terms highly related to the initial query are selected to form a new query model for query expansion.However,the query expansion often causes query shift since not all the contents within the document are coherently relevant.In this paper,we propose to build mixture models by combining the original query information with the expanded query language model using aspect model.The experimental results have shown that our proposed methods significantly outperform a number of strong baselines on several large scale TREC collections.

Qiang Huang Yong Huang HongYan Yi

School of Computing Sciences University of East Anglia Norwich,UK School of Information Engineering HeNan Institute of Science and Technology XinXiang,China

国际会议

2010 4th International Conference on Intelligent Information Techonlogy Application(第四届智能信息技术应用国际学术研讨会 IITA 2010)

秦皇岛

英文

8-11

2010-11-05(万方平台首次上网日期,不代表论文的发表时间)