New Word Detection Algorithm for Chinese Based on Extraction of Local Context Information
Chinese segmentation is an important issue inChinese text processing.The traditional segmentationmethods those depend on an existing dictionary sufferthe drawbacks when encounter unknown words.Thepaper proposed a segmenting algorithm for Chinesebased on extracting local context information.It addedthe context information of the testing text into the localPPM statistical model so as to guide the detection ofnew words.The algorithm focusing on the process ofonline segmentation and new word detection achievesa good effect in the close or opening test,andoutperforms some well-known Chinese segmentationsystem to a certain extent.
Hua-Lin Zeng Chang-Le Zhou Xiao-Dong Shi Tang-Qiu Li Chang Su
Department of Cognitive Science,Xiamen University,Xiamen 361005,China
国际会议
厦门
英文
797-801
2008-11-17(万方平台首次上网日期,不代表论文的发表时间)