Completely-Arbitrary Passage Retrieval in Language Modeling Approach
Passage retrieval has been expected to be an alternative method to resolve length-normalization problem,since passages have more uniform lengths and topics,than documents.An important issue in the passage retrieval is to determine the type of the passage.Among several different passage types,the arbitrary passage type which dynamically varies according to query has shown the best performance.However,the previous arbitrary passage type is not fully examined,since it still uses the fixed-length restriction such as n consequent words.This paper proposes a new type of passage,namely completely-arbitrary passages by eliminating all possible restrictions of passage on both lengths and starting positions,and by extremely relaxing the type of the original arbitrary passage.The main advantage using completely-arbitrary passages is that the proximity feature of query terms can be well-supported in the passage retrieval,while the non-completely arbitrary passage cannot clearly support.Experimental result extensively shows that the passage retrieval using the completelyarbitrary passage significantly improves the document retrieval,as well as the passage retrieval using previous non-completely arbitrary passages,on six standard TREC test collections,in the context of language modeling approaches.
passage retrieval complete-arbitrary passage language modeling approach
Seung-Hoon Na In-Su Kang Ye-Ha Lee Jong-Hyeok Lee
Department of Compueter Science,POSTECH,AITrc,Republic of Korea Korea lnstitute of Science and Technology lnformation(KISTI),Republicof Korea
国际会议
4th Asia Information Retrieval Symposium(AIRS 2008)(第四届亚洲信息检索研讨会)
哈尔滨
英文
22-33
2008-01-16(万方平台首次上网日期,不代表论文的发表时间)