Cocktail: A Hybrid System Combining Hadoop and Storm
Hadoop and Storm are playing a significant role in Cloud Computing and either of them has its own applicable area.Cocktail is a new hybrid system that combines Hadoop and Storm into one single system,leveraging the functions of two computing frameworks.The design and implementation of Cocktail includes a SQL-like query language making the implementation of details transparent for users,an intelligent framework selector based on cost model to choose appropriate framework automatically,and an efficient resource scheduling and task execution framework.Cocktail has a wide range of application scenarios from batch processing to stream computing,using Storm to process real-time data and Hadoop to process large-scale data.We compare the performance,throughput and scalability of Cocktail with SummingBird to demonstrate the practicability and capability.According to benchmark,for smallscale data,the performance of Cocktail is close to Summingbird based on Storm and 20%~40% faster than Summingbird based on Hadoop.And for large-scale data,Cocktails throughput is 40% higher than Summingbirds throughout based on Storm.
Hadoop Storm Hybrid System
Yong Zhao Ying Zhang Yiting Yao Youfu Li Peng Liu
School of Computer Science and Engineering University of Electronic Science and Technology of China Chengdu, China
国际会议
重庆
英文
20-25
2015-12-19(万方平台首次上网日期,不代表论文的发表时间)