Research of Web Information Mining by using Crawler Techniques

摘要：

As the Internet rapidly becomes one of the most important information medium,web information mining has been the focus of several recent research projects and papers. There are massive documents in certain formats on the Internet while web crawlers building up with millions of computers scratch the Internet pages every second.Why not combine these two efficiently?This paper describe a new thought that mining web information by using crawler techniques.After explain the basic principle of crawler techniques,we present the architecture of the new web information mining system.For the initial test, the system is applied to mine certain standard formatted documents;the experimental data is reported in section IV.By the analysis of the result,we can approve that the system shows high efficiency,flexibility and low cost by using crawler techniques.

作者: Qing-Cheng Li Shan Lin Zhen-hua Dong

作者单位: Department of Information Technical Science,Nankai University, Tianjin,300072,China

会议类型: 国际会议

会议名称: 2008 IEEE International Conference on Onformation and Automation(IEEE 信息与自动化国际会议)

会议地点: 张家界

会议语种:英文

页码: 1603-1607

在线出版日期: 2008-06-20（万方平台首次上网日期，不代表论文的发表时间）

会议专题

Research of Web Information Mining by using Crawler Techniques