SUB-COMPONENT OPTIMIZATION FOR WEB-DIST MEASUREMENT
In recent years, machine learning techniques are taken into account in more and more web-based systems in order to design intelligent mechanisms for organizing, indexing, and retrieving web content, and it is necessary for researches and applications to calculate persuasive distance of web pages.General methodologies are fit for extracting the differences between HTML documents of web pages; however, it cannot be used to tell the actual distance, between the content of web pages and the facade displayed in internet explorers.Previously, content distance, style distance, and hybrid distance have been proposed to make measurement result more practical. In this paper, in order to make more effective description on web-dist functions, a sub-component based optimization methodology is proposed, and the efficiency will be proved through some practical applications.
Web mining optimization distance function web page
Q.P.ZHANG L.L.LAI
Energy Systems Group, City University, London, United Kingdom
国际会议
2006 International Conference on Machine Learning and Cybernetics(IEEE第五届机器学习与控制论坛)
大连
英文
4069-4074
2006-08-13(万方平台首次上网日期,不代表论文的发表时间)