会议专题

New Word Detection Algorithm for Chinese Based on Extraction of Local Context Information

Chinese segmentation is an important issue inChinese text processing.The traditional segmentationmethods those depend on an existing dictionary sufferthe drawbacks when encounter unknown words.Thepaper proposed a segmenting algorithm for Chinesebased on extracting local context information.It addedthe context information of the testing text into the localPPM statistical model so as to guide the detection ofnew words.The algorithm focusing on the process ofonline segmentation and new word detection achievesa good effect in the close or opening test,andoutperforms some well-known Chinese segmentationsystem to a certain extent.

Hua-Lin Zeng Chang-Le Zhou Xiao-Dong Shi Tang-Qiu Li Chang Su

Department of Cognitive Science,Xiamen University,Xiamen 361005,China

国际会议

2008 3rd International Conference on Intelligent System and Knowledge Engineering(第三届智能系统与知识工程国际会议)(ISKE 2008)

厦门

英文

797-801

2008-11-17(万方平台首次上网日期,不代表论文的发表时间)