A Method to Construct Chinese-Japanese Named Entity Translation Equivalents Using Monolingual Corpora
The traditional method of Named Entity Translation Equivalents extraction is often based on large-scale parallel or comparable corpora.But the practicability of the research results is constrained by the relatively scarce of the bilingual corpus resources.We combined the features of Chinese and Japanese, and proposed a method to automatically extract the Chinese-Japanese NE translation equivalents based on inductive learning from monolingual corpus.This method uses the Chinese Hanzi and Japanese Kanji comparison table to calculate NE instances similarity between Japanese and Chinese.Then, we use inductive learning method to obtain partial translation roles of NEs through extracting the differences between Chinese and Japanese high similarity NE instances.In the end, the feedback process refreshes the Chinese and Japanese NE similarity and translation role sets.Experimental results show that the proposed method is simple and efficient, which overcome the shortcoming that the traditional methods have a dependency on bilingual resource.
named entity translation equivalents Chinese Hanzi and Japanese Kanji comparison table inductive learning method
Kuang Ru Jinan Xu Yujie Zhang Peihao Wu
School of Computer and Information Technology, Beijing Jiaotong University, Beijing, China
国际会议
Second CCF Conference,NLPCC2013(第二届自然语言处理与中文计算会议)
重庆
英文
164-175
2013-11-15(万方平台首次上网日期,不代表论文的发表时间)