A Parallelization Cost Model for GPU
Using GPU for general computing has become an important research direction in high performance computing technology. However, this is not a lossless optimization method. Due to the impact of device initialization cost, data transmission delay, specific characteristics of programs, and other factors, the general computing on GPU may not always achieve the desired speedup, and sometimes results in program execution performance degradation. On the basis of in-depth analysis of GPU internal processing mechanisms, the main factors affecting GPU implementation performance are pointed out, and a parallel cost model for GPU based on static program analysis is proposed to provide judgement basis for using GPU in general computing.
GPU parallel cost model warp
Zhang Dan Zhao Rongcai Han Lin Wang Tao Qu jin
China National Digital Switching System Engineering and Technology Research Center Zhengzhou, Henan Province, China
国际会议
成都
英文
515-519
2010-06-12(万方平台首次上网日期,不代表论文的发表时间)