A Time-Sensitive Model for Microblog Retrieval
Microblog, as a way of online communication, can generate large amounts of information in a very short period.Therefore, how to retrieve the latest relevant information becomes a hot research area.Different from tra ditional information retrieval (IR), the microblog retrieval emphasizes fresh contents of the information.In order to solve this problem, we extend the tradi tional IR methods by taking into account the posting time.We propose a time sensitive retrieval model, which takes the time factor as a prior probability.In the retrieval model, we introduce the pseudo relevance feedback technology as a query expansion approach to improve retrieval performance.Furthermore, we introduce a strategy to filter the initial retrieval results, which takes post quality factors into account including entropy and link features.Experiments on Twitter corpus show that our algorithm is effective to improve the retrieval perfor mance, and the retrieval results can meet the real time retrieval need well.
Microblog Time-Sensitive Retrieval Model Entropy
Cunhui Shi Bo Xu Hongfei Lin Qing Guo
School of Computer Science and Technology, Dalian University of Technology, Liaoning, Dalian, 116024
国际会议
Second CCF Conference,NLPCC2013(第二届自然语言处理与中文计算会议)
重庆
英文
402-409
2013-11-15(万方平台首次上网日期,不代表论文的发表时间)