An information update method towards internal search engine
To enterprises or other organizations, how to efficiently manage unstructured and semi-structured data on the web becomes an important problem.Internal search engine is well-used to deal with it, but how to efficiently find the latest updates of web sources is still a research issue.In this paper, we proposed a graph-based method to efficiently locate the updated information of an organizations web resources, which is based on modeling an organizations information resources with a graph and marking each web page with a parameter update cycle that represents the possibility of a web page to be updated and is taken as a factor to tune the algorithm of update identification.By this method, the latest updated information can be located in time.The experiments results show the effectiveness of our method.
internal search engine crawling strategies update method
Zhifan Bian Yukun Li Tinghai Yue Pengfei Lei Dexin Zhao Yingyuan Xiao
Tianjin University of Technology, 300384 Key Laboratory of Intelligence Computing and Novel Software Technology, Tianjin, China Key Laboratory of Computer Vision and System, Ministry of Education Tianjin, China
国际会议
济南
英文
211-216
2015-09-11(万方平台首次上网日期,不代表论文的发表时间)