DDGrid: A Grid Computing Environment with Massive Concurrency and Fault-tolerance Support
Grid Computing is an effective computing paradigm widely used in solving complex problems. There are a variety of existing grid middleware systems which support operation of grid infrastructures, including CNGrid GOS, EGEE gLite, Globus Toolkit, and OSG Condor etc. These grid infrastructures focus on encapsulating underlying computing and storage resources and providing necessary basic services such as batch job service, information service, scheduling service, and cross-domain security, etc. Some other features such as fault-tolerance, massive concurrency support are vital to the success of real applications, especially complex and long running applications. These features have not been the focus point of the current grid systems. DDGrid, a key project supported by CNGrid (China National Grid), is aiming at establishing a grid computing environment that can utilize computing resources scattered over the Internet to carry out virtual-screening operations which requires computing power that a single institute or company cant afford. In our design and implementation of DDGrid, we propose a master/worker mode which effectively utilizes computing resources that the underlying grid infrastructure provides and tries to provide additional features of fault-tolerance and massive concurrency support that are essential to the real applications.
Master/Worker Fault-tolerance Massive Concurrency
Yongjian Wang Zhongzhi Luan Depei Qian Yuanqiang Huang Ting Chen Biao Han Yinan Ren Kunqian Yu Hualiang Jiang
Sino-German Joint Software Institute Beihang University Beijing, China Drug Discovery and Design Center Shanghai Institute of Materia Medica, Chinese Academy of Sciences,
国际会议
第七届网格与协同计算国际会议(Seventh International Conference on Grid and Cooperative Computing GCC 2008)
深圳
英文
5-14
2008-10-24(万方平台首次上网日期,不代表论文的发表时间)