Search Engine Design Based on Web Service and Lucene
Web service is selected as the key technology to design a search engine based on B/S architecture. Several spider clients are deployed on many computers to finish crawling web pages in this B/S architecture after all web pages have been analyzed and purified. And some useful web pages are stored into database for index design. These spider clients are controlled by a spider server. Finally some indexes are created according to these web pages by Lucene so that the search engine could offer search services for any web user. Furthermore, the spider clients communicate with the server by Microsoft Message Queue and lots of important data-operated functions are designed as many web methods in web service. Final experiments show that distributed spider system of the search engine is more efficient than only one spider system.
Search engine web service remoting b/s architecture spider lucene
Hongbin Zhang Juefu Liu
School of Information Engineering East China Jiaotong University Nanchang, China
国际会议
2009 WASE International Conference on Information Engineering(2009年国际信息工程会议)(ICIE 2009)
太原
英文
1117-1120
2009-07-10(万方平台首次上网日期,不代表论文的发表时间)