会议专题

Cross-Language Information Retrieval Based on Weight Computation of Query Keywords Translation

In cross-language information retrieval (CLIR), the query sentence is often combined with a series of query keywords, rather than a complete natural sentence. Lack of necessary contextual syntactic information in such a query sentence makes it impossible to achieve a unique translation of the query sentence with acceptable precision. In this paper, we convert the translation of query sentence to the weight computation of the translations of the query keyword based on large-scale bilingual parallel corpora, and thereafter reconstruct the query sentence in target language. The experimental results show that the approach achieves an average retrieval accuracy of 93.4% in the front 10 retrieval results and 89.1% in the front 100 retrieval results, while the retrieval error rate is reduced by 63.62% over the purely dictionary-based baseline.

CLIR weight computation query sentencee translation of query keyword

ZHANG Xiao-fei HUANG He-yan ZHANG Ke-liang

Research Center of Computer and Language Information Engineering,Chinese Academy of Sciences,Beijing Center for Computational Linguistics Luoyang University of Foreign Languages Luoyang,China

国际会议

2009 IEEE International Conference on Intelligent Computing and Intelligent Systems(2009 IEEE 智能计算与智能系统国际会议)

上海

英文

2068-2071

2009-11-20(万方平台首次上网日期,不代表论文的发表时间)