会议专题

STUDY ON WEB DATA EXTRACTION

A web data extraction system is provided, which adopting web page comparison and analysis within a website. On the basis of treeing and blocking web pages, the data block of web page is retrieved after compared and analyzed, and then the data is extracted via the comparison and judgement of more than one page of the same structure and format so as to actualize in-depth mining of technical information. The systems architecture and composition, and the process of the system tested on the physical property databases of chemistry are elaborated.

web data eztraction database web data gathering web page comparison and analysis

Wensheng Li Lan Shan Ying Zhao Juhong Zhang

School of Computer Science and Technology, Beijing University of Posts & Telecommunications, Beijing School of Information Science and Technology, Beijing University of Chemical Technology, Beijing 100

国际会议

China-Ireland International Conference on Information and Communications Technologies 2008(2008 中国-爱尔兰信息与通信技术国际会议 CIICT 2008)

北京

英文

1-4

2008-09-26(万方平台首次上网日期,不代表论文的发表时间)