Chinese Named Entity Recognition with New Contextual Features
Chinese Named Entity Recognition (NER) is studied in two directions: inner structure and outer surroundings. Inner structural analyses induce constitutions of person, location and organization name from the point of linguistics. However inner structural rules for named entities only provide necessary conditions for a sequence of Chinese characters being an entity name but not sufficient. Whether a string being a proper name or not is also determined by its contextual information or sometimes common sense. We build Chinese NER system based on supervised machine learning using features induced from simple inner structure and contextual information. We compare some NER approaches. The experimental results indicate complicated cases of various NER strategies. Then this paper turns to explore contextual features of named entities on large scale corpus, seeking for contextual evidence for different strategies of NER and mark words giving clues to the occurrence of NE. Finally we apply some conclusions to improve NER system by enriching features in model and enhance the performance distinctly.
Named entity recognition contextual features recognition model conditional random fields
Ying QIN Taozheng ZHANG Xiaojie WANG
National Research Center for Foreign Language Education,Beijing Foreign Studies University,Beijing,C Beijing University of Posts and Telecommunications,Beijing,China
国际会议
北京
英文
2008-10-19(万方平台首次上网日期,不代表论文的发表时间)