MRData: a MapReduce-Based Tool for Heterogeneous Data Integration
As the volume of data increasing sharply and the relationship among different data sources becoming intricately, how to integrate mass data sources and how to find latent information from the integrated data is a matter of urgency. At present, industry tends to adopt distributed computing model to solve the integration of massive data. Aiming at getting the valuable and in-depth information, visualization is a critical step in data analysis and data mining. We design a tool called MRData for heterogeneous data integration which has two features: 1) parallel data processing based on Hadoop which is a distributed platform; 2) visual analysis. And at last, experiments verify the efficiency of MRData.
data integration hadoop mapreduce visualization
Liutong Xu Kai Jin Hongqiao Tian
Beijing Key Laboratory of Intelligent Telecommunications Software and Multimedia Beijing University of Posts and Telecommunications Beijing, China 100876
国际会议
西安
英文
849-852
2010-08-07(万方平台首次上网日期,不代表论文的发表时间)