Research of Character Level SMT for Patent
This paper introduces character level patent translation from Japanese to English. The challenge of this task lies in the segmentation errors caused by the overwhelming OOV (out-of-vocabulary) words exist in the patent texts. In order to solve the problem, we introduce character level SMT. We compare our method with word level SMT model, and find that the two approaches achieve close BLEU scores.
Statistical machine translation patent translation
Liao Xian-Jun Yang Mu-Yun
Library of Academy of Equipment Command & Technology, Beijing 101416, China School of Computer Science and Technology, Harbin Institute of Technology, Harbin 150001, China
国际会议
2010 International Conference on Circuit and Signal Processing(2010年电路与信号处理国际会议 ICCSP 2010)
上海
英文
267-269
2010-12-25(万方平台首次上网日期,不代表论文的发表时间)