会议专题

Chinese Lettered-words Extraction for Language Monitoring

Lettered-words are frequently used in Chinese.Lettered-word falls into two parts.They are lettered-word with Chinese characters and letteredword without Chinese characters.Chinese characters in lettered-word have no specialization.When letteredword with Chinese characters are scattered in Chinese texts,it is difficult to recognize the boundaries.As a result,lettered-word with Chinese characters becomes a difficulty for lettered-words identification and extraction.In this paper,a method to extract lettered-words with Chinese characters and lettered-words without Chinese characters separately is proposed for the first time.An experiment on language monitoring of lettered-words using shows that the proposed method achieves a high recall and precision.

Chinese lettered-word lettered-word with Chinese characters lettered-word without Chinese characters extraction

Qiuping WANG

Literature School.Communication University of China Beijing,China

国际会议

2010 4th International Conference on Intelligent Information Techonlogy Application(第四届智能信息技术应用国际学术研讨会 IITA 2010)

秦皇岛

英文

344-347

2010-11-05(万方平台首次上网日期,不代表论文的发表时间)