Low-Load Server Crawler: Design and Evaluation
This paper proposes a method of crawling Web servers connected to the Internet without imposing a high processing load. We are using the crawler for a field survey of the digital divide, including the ability to connect to the network. Rather than employing normal Web page crawling algorithm, which usually collect all pages found on the target server, we have developed server crawling algorithm, which collect only minimum pages from the same server and achieved low-load and high-speed crawling of servers.
Global Digital Divide Server crawler
Katsuko T. Nakahira Tetsuya Hoshino Yoshiki Mikami
Nagaoka University of Technology 16031 Kamitomiokamachi,Nagaoka Niigata, Japan Nagaoka University ofTechnology 16031 Kamitomiokamachi,Nagaoka Niigata, Japan
国际会议
第十七届国际万维网大会(the 17th International World Wide Web Conference)(WWW08)
北京
英文
2008-04-21(万方平台首次上网日期,不代表论文的发表时间)