会议专题

Integrating Frequent Itemsets Mining with Relational Database

Frequent itemsets mining is becoming increasingly important since the size of databases grows even larger. Currently database systems are dominated by relational database. However the performance of SQL based data mining is known to fall behind specialized implementation and expensive mining tools being on sale. In this paper we analyzed a famous frequent itemsets discovery algorithms FP-growth,and propose a new implementation approach called DBFP-Growth to create disk-based FP-tree based on ORACLE PL/SQL, it can execute faster than using SQL directly. Also presents a novel SQL-based method DRRW, which can remove duplicate records from database without temp table generation.

Database Frequent itemset Data mining

Qiu Yong

Shandong Institute of Business and Technology,YanTai 264005 China

国际会议

第八届国际电子测量与仪器学术会议(Proceedings of 2007 8th International Conference on Electronic Measurement & Instruments)

西安

英文

2007-08-16(万方平台首次上网日期,不代表论文的发表时间)