Association Analysis and Case Study Framework Based on the Name Distinction

摘要：

The research of distinction of name ambiguity in the field of information retrieval could enhance searching effect. Therefore, it plays an important role to mine the data of name ambiguity in order to obtain useful knowledge. In this paper, we focus on the problem of traditional evaluation and ranking method used in the clustering. Traditional evaluation and ranking method ignores the association among the subinformation and simply considers that pieces of subinformation are mutual independent. We present an effective data mining method framework based on the case study and association analysis. The method framework is evaluated on the dataset of name ambiguity from the database of CDBLP. The dataset includes the information of coauthor name, workplace, publication, years and ranking of the author of papers. The experimental results show that one piece of main sub-information assisted by some minors could form a stronger rule very useful for the distinction of name ambiguity. Also some combinations of pieces of minor sub-information could produce a stronger rule. The association rules selected by the experiment could be easily explained and commonsensible. Considering the association rules coming from the objective data and data mining method, they are more reliable.

关键词： name distinction case study association anaylsis feature extraction data mining

作者: Bo Wu Wandong Cai Yongjun Li

作者单位: Department of Computer Science Northwest Polytechnic University Xi an, Shaanxi, 710129, China

会议类型: 国际会议

会议名称: The 2010 International Conference on Computer Application and System Modeling(2010计算机应用与系统建模国际会议 ICCASM 2010)

会议地点: 太原

会议语种:英文

页码: 285-289

在线出版日期: 2010-10-22（万方平台首次上网日期，不代表论文的发表时间）

会议专题

Association Analysis and Case Study Framework Based on the Name Distinction