会议专题

ANALYSIS OF HE(和) IN PATENT CHINESE-ENGLISH MACHINE TRANSLATION

  In machine translation,He(和) mainly acts as noun,preposition and conjunction for patent text.Different parts of speech have different ways to translate in machine translatioh system,especially when it acts as conjunction which refers to coordinate structure.In traditional linguistic,there are papers researched He(和)but only concerned on how to distinguish the preposition and conjunction of it.In papers that studied coordinate structure,researches of He(和) are not systemic.In this paper,we worked out rules and strategy using semantic and grammatical information based on Hierarchical Network of Concepts theory to distinguish the noun,preposition and conjunction of He(和) and identify the beginning and end boundaries of coordinate structures connected by He(和).Compared with Google translator and another rule based translation system we got a relatively satisfied enhancement on the identification precision.

He(和) part of speech identification coordinate structure boundary machine translation patent

Yanli Liu Yaohong Jin

Institute of Chinese Information Processing,CPIC-BNU Joint Laboratory of Machine Translation Beijing Normal University,Beijing,100875,China

国际会议

2012 2nd IEEE International Conference on Cloud Computing and Intelligence Systems (2012年第2届IEEE云计算与智能系统国际会议(IEEE CCIS2012))

杭州

英文

1961-1966

2012-10-30(万方平台首次上网日期,不代表论文的发表时间)