会议专题

STUDY OF SEGMENT DICTIONARY BASED ON TWO-DIMENSIONAL ARRAY

Chinese word automatic segmentation is the foundation of Chinese Information Processing, and it has widely application in many fields. In this paper, a new dictionary mechanism is presented: According to the Chinese characteristic of the high frequency of one word and two words we put forward such an idea that we can build up index table by the first two words as the keywords, and this index table is a two-dimensional array. This algorithm directly locates data by establishing a corresponding relationship between the first two Chinese characters’ internal code. In this way, we can directly find out the two-word words by using the two-dimensional array. This approach can significantly reduce the times of queries, so as to further accelerate the speed of segmentation.

Segmentation Dictionary Dictionary Mechanism two-dimensional array

Chengcheng Li Hong Wu

School of Computer & Information Engineering, Inner Mongolia Normal University, Hohhot, China

国际会议

2010 3rd IEEE International Conference on Broadband Network & Multimedia Technology(2010年第三届IEEE宽带网络与多媒体国际会议 IC-BNMT 2010)

北京

英文

674-676

2010-10-26(万方平台首次上网日期,不代表论文的发表时间)