会议专题

Association Analysis and Case Study Framework Based on the Name Distinction

The research of distinction of name ambiguity in the field of information retrieval could enhance searching effect. Therefore, it plays an important role to mine the data of name ambiguity in order to obtain useful knowledge. In this paper, we focus on the problem of traditional evaluation and ranking method used in the clustering. Traditional evaluation and ranking method ignores the association among the subinformation and simply considers that pieces of subinformation are mutual independent. We present an effective data mining method framework based on the case study and association analysis. The method framework is evaluated on the dataset of name ambiguity from the database of CDBLP. The dataset includes the information of coauthor name, workplace, publication, years and ranking of the author of papers. The experimental results show that one piece of main sub-information assisted by some minors could form a stronger rule very useful for the distinction of name ambiguity. Also some combinations of pieces of minor sub-information could produce a stronger rule. The association rules selected by the experiment could be easily explained and commonsensible. Considering the association rules coming from the objective data and data mining method, they are more reliable.

name distinction case study association anaylsis feature extraction data mining

Bo Wu Wandong Cai Yongjun Li

Department of Computer Science Northwest Polytechnic University Xi an, Shaanxi, 710129, China

国际会议

The 2010 International Conference on Computer Application and System Modeling(2010计算机应用与系统建模国际会议 ICCASM 2010)

太原

英文

285-289

2010-10-22(万方平台首次上网日期,不代表论文的发表时间)