会议专题

Termset-based Indexing and Query Processing in P2P Search

Multi-term query is a common issue in information retrieval system. In large-scale P2P information retrieval, the method of indexing and query processing based on single-term results in large bandwidth cost We take into account the correlation among terms and propose a termset-based indexing and query processing method suited for information retrieval in structured P2P overlay. Employing statistics, metadata and query log, we construct a dynamic termset corpus, and the index is built based on termset. When processing query, the peer extracts the termsets from the query terms, and each termset is treated as a key. Several methods are applied to reduce bandwidth consumption. We also present a method of query expansion to be a complement when there are no sufficient results. The experiments show that our method has good performance, and it is suitable for large-scale distributed information retrieval.

P2P IR multi-term query termset query expansion

Wang Zhenhua Shen Derong Yu Ge

College of Information Science and Engineering Northeastern University (NEU) Shenyang, China

国际会议

2009 International Forum on Computer Science-Technology and Applications(2009年国际计算机科学技术与应用论坛 IFCSTA 2009)

重庆

英文

1226-1229

2009-12-25(万方平台首次上网日期,不代表论文的发表时间)