Association Analysis and Case Study Framework Based on the Name Distinction
The research of distinction of name ambiguity in the field of information retrieval could enhance searching effect. Therefore, it plays an important role to mine the data of name ambiguity in order to obtain useful knowledge. In this paper, we focus on the problem of traditional evaluation and ranking method used in the clustering. Traditional evaluation and ranking method ignores the association among the subinformation and simply considers that pieces of subinformation are mutual independent. We present an effective data mining method framework based on the case study and association analysis. The method framework is evaluated on the dataset of name ambiguity from the database of CDBLP. The dataset includes the information of coauthor name, workplace, publication, years and ranking of the author of papers. The experimental results show that one piece of main sub-information assisted by some minors could form a stronger rule very useful for the distinction of name ambiguity. Also some combinations of pieces of minor sub-information could produce a stronger rule. The association rules selected by the experiment could be easily explained and commonsensible. Considering the association rules coming from the objective data and data mining method, they are more reliable.
name distinction case study association anaylsis feature extraction data mining
Bo Wu Wandong Cai Yongjun Li
Department of Computer Science Northwest Polytechnic University Xi an, Shaanxi, 710129, China
国际会议
太原
英文
285-289
2010-10-22(万方平台首次上网日期,不代表论文的发表时间)