会议专题

Multi-factor Matching Method for Basic Information of Science and Technology Experts Based on Web Mining

The accuracy rate of information extracting by Web mining is not high because of the diversity and complexity of Web page. In order to increase the accuracy rate of information extracting by Web mining for building the science and technology basic information system, a novel multi-factor matching is proposed in this paper. The proposed method integrates the position of every word among the keywords corpus in normalized text and the multifactor matching method between keywords corpus and normalized text which extracted from Web page by URL. The extracted results include the name, sex, birth, hometown and professional title of science and technology experts respectively. Experiments show that the accuracy rates obtain 95.64 percent and the recall rates achieve 99.69 percent respectively. The results show as by proposed method can satisfied the application requirements.

science and technology experts keywords corpus normalized text multi-factor matching Web mining

Pei Zhou Quanyin Zhu

Faculty of Computer Engineering Huaiyin Institute of Technology Huaian, Jiangsu Province, China

国际会议

2012 IEEE 3rd International Conference on Software Engineering and Service Science(第三届IEEE软件工程与服务科学国际会议 ICSESS2010)

北京

英文

718-720

2012-06-22(万方平台首次上网日期,不代表论文的发表时间)