会议专题

A Method to Construct Chinese-Japanese Named Entity Translation Equivalents Using Monolingual Corpora

  The traditional method of Named Entity Translation Equivalents extraction is often based on large-scale parallel or comparable corpora.But the practicability of the research results is constrained by the relatively scarce of the bilingual corpus resources.We combined the features of Chinese and Japanese, and proposed a method to automatically extract the Chinese-Japanese NE translation equivalents based on inductive learning from monolingual corpus.This method uses the Chinese Hanzi and Japanese Kanji comparison table to calculate NE instances similarity between Japanese and Chinese.Then, we use inductive learning method to obtain partial translation roles of NEs through extracting the differences between Chinese and Japanese high similarity NE instances.In the end, the feedback process refreshes the Chinese and Japanese NE similarity and translation role sets.Experimental results show that the proposed method is simple and efficient, which overcome the shortcoming that the traditional methods have a dependency on bilingual resource.

named entity translation equivalents Chinese Hanzi and Japanese Kanji comparison table inductive learning method

Kuang Ru Jinan Xu Yujie Zhang Peihao Wu

School of Computer and Information Technology, Beijing Jiaotong University, Beijing, China

国际会议

Second CCF Conference,NLPCC2013(第二届自然语言处理与中文计算会议)

重庆

英文

164-175

2013-11-15(万方平台首次上网日期,不代表论文的发表时间)