The Effects of High Quality Translations of Named Entities in Cross-Language Information Exploration
Named entities (NEs) are the expressions in human languages that explicitly link notations in languages to the entities in the real world. They play important role in cross-language information retrieval (CLIR) because most users’ requests have been found to have NEs, and majority of out-of-vocabulary terms are NEs. Therefore, missing their translations has a significant impact to the retrieval effectiveness. In this paper, we examined the effect of high quality translations of NEs in event driven information exploration, where the existence of NEs is even more common. With the focus on the effect of NE translations obtained by using information extraction (IE) techniques, we conducted several experiments using TDT test collections. Our results demonstrate that NEs and their translations play critical roles in improving CLIR effectiveness, and it makes positive impact in CLIR to use high quality translations of NEs obtained by IE techniques.
Named entity cross-language information exploration
Dan WU Daqing HE Heng JI, Ralph GRISHMAN
School of Information Management,Wuhan University Wuhan,Hubei,China School of Information Sciences,University of Pittsburgh Pittsburgh,PA,USA Computer Science Department,New York University New York,NY,USA
国际会议
北京
英文
2008-10-19(万方平台首次上网日期,不代表论文的发表时间)