A RULE-BASED METHOD FOR COMMAS DISAMBIGUATION IN CHINESE PATENT TEXT
We described a rule-based method for disambiguating Chinese commas in patent text,which will be beneficial to the work on Chinese-English Patent MT.We annotated ten thousand sentences of patent text,and made a number of rules according to the annotated results.Experiments were conducted on 5 intact patent documents containing 1219 commas,and our model achieves an accuracy of over 90% overall.
Rule-based method Commas disambiguation Chinese patent text MT
Qianqian Song Yun Zhu Lixia Wang Yaohong Jin
Institute of Chinese Information Processing;CPIC-BNU Joint Laboratory of Machine Translation Beijing Normal University,Beijing 100875,China
国际会议
杭州
英文
1988-1992
2012-10-30(万方平台首次上网日期,不代表论文的发表时间)