Improved Viterbi Algorithm-Based HMM2 for Chinese Words Segmentation
In order to solve problems caused by the individualism of Chinese architecture more and more researchers focus on Hybrid and improved Hidden Markov Model. However, as the foundation of Chinese natural language processing, studies on Chinese words segmentation based on Secondorder Hidden Markov Model (HMM2) are not abundant. A words frequency weighted smoothing method and a Threshold-Viterbi algorithm are proposed and combined to build a Improved Viterbi Algorithm-based HHM2(IV-HMM2) model in this article to overcome the sparse problem and improve the accuracy. Experimental rusults demonstrate that the improved model has better performance and lower overhead than traditional HMM2.
Threshold-Viterbi Algorithm IV-HMM2 Chinese words segmentation
Lei La Qiao Guo Dequan Yang Qimin Cao
School of Automation Beijing Institute of Technology Beijing, China
国际会议
杭州
英文
266-269
2012-03-23(万方平台首次上网日期,不代表论文的发表时间)