会议专题

An improved maximum match algorithm for Chinese words segmentation

On the basis of careful analysis existing Chinese words segmentation,this paper puts forward an improved maximum match (short for MM) algorithm for Chinese words segmentation. According to the dictionary structure,this algorithm limits match to the scope of some phrases which have the same first word. So it improves the operating efficiency of the algorithm. Whats more,this algorithm changes the subtracting word MM algorithm into adding word MM algorithm. And it avoids the phenomenon of phrase containing phrase and improves the accuracy of words segmentation. From the experimental results,this algorithm is a good improvement in the mechanical words segmentation algorithm.

MM algorithm segmentation words entries improvement accuracy

Hong Zhang Yanhong Ma Pengshou Xie Zhongxian Bao

Lanzhou University of Technology,Lanzhou730050,P.R.China Gansu Electric Power Corporation Wind Power Technology Center,Lanzhou

国际会议

2011 International Conference on Opto-Electronics Engineering and Information Science(2011光电电子工程与信息科学国际会议 ICOEIS 2011)

西安

英文

574-577

2011-12-23(万方平台首次上网日期,不代表论文的发表时间)