ENGLISH-CHINESE OOV TRANSLATION BASED ON PAT-TREE
In Cross-Language Information Retrieval (CLIR) process, Out-Of-Vocabulary (OOV) or the unknown word translation Is a significant and challenging issue. Specifically, for English-Chinese OOV translation, OOV term detection and extraction of translation pair still remain to be key problems. In this paper, an English-Chinese OOV translation pattern based on PAT-Tree is proposed. Web-mining is utilized as the corpus source to collect translation pairs, and translation candidates are acquired by Chinese OOV term extraction based on PAT-Tree. The experimental results show that the proposed approach can outperform some of the current translation engines, and is especially efficient in English-Chinese OOV translation.
Cross-Language Information Retrieval (CLIR) English-Chinese OOV translation Out-of-Vocabulary (OOV) term eztraction PAT-Tree
YANG WANG YUE-JIE ZHANG TAO ZHANG
School of Computer Science, Shanghai Key Laboratory of Intelligent Information Processing, Fudan Uni School of Information Management and Engineering, Shanghai University of Finance and Economics, Shan
国际会议
2009 International Conference on Machine Learning and Cybernetics(2009机器学习与控制论国际会议)
保定
英文
1732-1736
2009-07-12(万方平台首次上网日期,不代表论文的发表时间)