Research on the Page Replacement Model in Search Engine Collector
The method of repeat URL filtering for the existing search engine collector is analyzed, and some shortcomings are pointed out.On the base of virtual memory page replacement algorithm in operation system,a page replacement model is introduced the search engine collector.And then the disk page structure and memory page structure are designed respectively.Finally,a fingerprint search algorithm is given.Through the practical application of the models and technologies in our projects,we find they can solve the speed problem for filtering tens of thousands of URL in the small-capacity memory.
search engine collector page-replacement filter finger mark URL
Meiren Zhang Yongfeng Li Yongfeng Li
School of Mathematics and Information engineering,Taizhou University,Linhai,Zhejiang 317000,China School of Computer Science and Techenology,WuHan University of Technology ,Wuhan,Hubei 430070,China
国际会议
大连
英文
1187-1192
2008-07-27(万方平台首次上网日期,不代表论文的发表时间)