Data Preprocessing in Web Text Mining
the development of highly efficient and effective search engines is accelerated by the abundant WWW information and peoples need for high quality information.Web text mining is one of the key techniques for search engines.But Web data is much complex which enlarges the difficulty in web text mining.To get good mining results, Web page preprocessing is necessary before any text mining starting.Here given the pages set collected from the Robot of search engines, we discussed some essential work to present pages in vectors, such as the term selection, weights presentation, etc.The purpose is to make preparation for the following Web text mining task.
data preprocessing Web text mining search engine
Jiang Yongbo Zhang Ruili
Business School Qingdao Technological University Qingdao,China Teaching and Research Section of Computational Engineering Navy Aircraft Engineering Institute Qingd
国际会议
成都
英文
643-647
2011-07-15(万方平台首次上网日期,不代表论文的发表时间)