会议专题

Research of Web Information Mining by using Crawler Techniques

As the Internet rapidly becomes one of the most important information medium,web information mining has been the focus of several recent research projects and papers. There are massive documents in certain formats on the Internet while web crawlers building up with millions of computers scratch the Internet pages every second.Why not combine these two efficiently?This paper describe a new thought that mining web information by using crawler techniques.After explain the basic principle of crawler techniques,we present the architecture of the new web information mining system.For the initial test, the system is applied to mine certain standard formatted documents;the experimental data is reported in section IV.By the analysis of the result,we can approve that the system shows high efficiency,flexibility and low cost by using crawler techniques.

Qing-Cheng Li Shan Lin Zhen-hua Dong

Department of Information Technical Science,Nankai University, Tianjin,300072,China

国际会议

2008 IEEE International Conference on Onformation and Automation(IEEE 信息与自动化国际会议)

张家界

英文

1603-1607

2008-06-20(万方平台首次上网日期,不代表论文的发表时间)