会议专题

MRData: a MapReduce-Based Tool for Heterogeneous Data Integration

As the volume of data increasing sharply and the relationship among different data sources becoming intricately, how to integrate mass data sources and how to find latent information from the integrated data is a matter of urgency. At present, industry tends to adopt distributed computing model to solve the integration of massive data. Aiming at getting the valuable and in-depth information, visualization is a critical step in data analysis and data mining. We design a tool called MRData for heterogeneous data integration which has two features: 1) parallel data processing based on Hadoop which is a distributed platform; 2) visual analysis. And at last, experiments verify the efficiency of MRData.

data integration hadoop mapreduce visualization

Liutong Xu Kai Jin Hongqiao Tian

Beijing Key Laboratory of Intelligent Telecommunications Software and Multimedia Beijing University of Posts and Telecommunications Beijing, China 100876

国际会议

2010 International Conference of Informationa Science and Management Engineering(2010年信息科学与管理工程国际学术会议 ISME 2010)

西安

英文

849-852

2010-08-07(万方平台首次上网日期,不代表论文的发表时间)