会议专题

Segmentation and Tagging of Oracle Inscriptions Based on Lucene and Dictionary

Segmentation and Part of Speech tagging of Oracle inscriptions are the premise and foundation for establishment of Oracle Corpus and computer-aided Oracle textual research and explication. As for segmentation of Oracle inscriptions, this paper proposes a positive match cut algorithm, which adopts language analyzer based on Lucene and supplemented with Oracle Dictionary. Then the segmented words by the algorithm are tagged. Experiments show that the correct rate is more than 90%.

component Oracle inscriptions segmentation Lucene

KAI Jin-yu LI Na LIU Yong-ge

School of Computer and Information Engineering, Anyang Normal University Oracle Information Processing Key Laboratory Anyang, China

国际会议

2010年IEEE多媒体信息网络与安全国际会议

南京

英文

608-610

2010-11-01(万方平台首次上网日期,不代表论文的发表时间)