会议专题

Efficient Top-k Skyline Computation in MapReduce

  Skyline is widely used in multi-objective decisionmaking, data visualization and other fields.With the rapid increasing of data volume, skyline of big data has also attracted more and more attention.However, skyline of big data has its own shortcomings.When the dimension increases, skyline results will be numerous, and we would like to select k points from the result sets.In this paper, we propose the top-k skyline of big data.It is a Distributed Top-k Skyline Method in MapReduce, called MR-DTKS.Firstly, we convert the multidimensional data to a single value to determine the dominance relationship of two data points.Secondly, we calculate the score by using the converted values to filter out most of unwanted data objects.Finally, we choose k data objects having the strongest dominating capacity.A large number of experiments show that our method is effective,and has good flexibility and scalability on real data sets as well as synthetic data sets.

skyline top-k skyline big data MapReduce

Baoyan Song Aili Liu Linlin Ding

School of Information, Liaoning University,Shenyang, China

国际会议

The 12th Web Information System and Application Conference第十二届全国Web信息系统及其应用学术会议(WISA2015)、全国第十次语义Web 与本体论学术研讨会(SWON2015)、全国第九次电子政务技术及应用学术研讨会(EGTA2015)

济南

英文

67-70

2015-09-11(万方平台首次上网日期,不代表论文的发表时间)