Improved Rules-based Algorithm for Identification of Chinese Place Name
This paper presents a new identification method of Chinese place name to improve the searching accuracy and efficiency1.It firstly preprocess the text corpus by Forward Maximum Matching word segmentation algorithm, and then modify the intermediate result according to the rules compiled by means of the characteristic of place names.A number of tests are done to compare its actual performance with traditional algorithm.The test corpus comes from title texts on some tourism BBS websites.The result shows that the searching accuracy and recall rate are both greatly improved as the modifying module is added.
Chinese information processing Identification of Chinese place name Chinese word segmentation forward maximum matching algorithm
Rui Li Liping Qian Yu Song Wei Cai
Beijing University of Civil Engineering and Architecture Beijing,China School of Civil Engineering and Architecture Ningbo University of Technology Ningbo,China
国际会议
太原
英文
218-221
2011-02-26(万方平台首次上网日期,不代表论文的发表时间)