会议专题

Low-Load Server Crawler: Design and Evaluation

This paper proposes a method of crawling Web servers connected to the Internet without imposing a high processing load. We are using the crawler for a field survey of the digital divide, including the ability to connect to the network. Rather than employing normal Web page crawling algorithm, which usually collect all pages found on the target server, we have developed server crawling algorithm, which collect only minimum pages from the same server and achieved low-load and high-speed crawling of servers.

Global Digital Divide Server crawler

Katsuko T. Nakahira Tetsuya Hoshino Yoshiki Mikami

Nagaoka University of Technology 16031 Kamitomiokamachi,Nagaoka Niigata, Japan Nagaoka University ofTechnology 16031 Kamitomiokamachi,Nagaoka Niigata, Japan

国际会议

第十七届国际万维网大会(the 17th International World Wide Web Conference)(WWW08)

北京

英文

2008-04-21(万方平台首次上网日期,不代表论文的发表时间)