A Parallelization Cost Model for GPU

摘要：

Using GPU for general computing has become an important research direction in high performance computing technology. However, this is not a lossless optimization method. Due to the impact of device initialization cost, data transmission delay, specific characteristics of programs, and other factors, the general computing on GPU may not always achieve the desired speedup, and sometimes results in program execution performance degradation. On the basis of in-depth analysis of GPU internal processing mechanisms, the main factors affecting GPU implementation performance are pointed out, and a parallel cost model for GPU based on static program analysis is proposed to provide judgement basis for using GPU in general computing.

关键词： GPU parallel cost model warp

作者: Zhang Dan Zhao Rongcai Han Lin Wang Tao Qu jin

作者单位: China National Digital Switching System Engineering and Technology Research Center Zhengzhou, Henan Province, China

会议类型: 国际会议

会议名称: 2010 International Conference on Computer and Communication Technologies in Agriculture Engineering(计算机与通信技术在农业工程国际会议 CCTAE 2010)

会议地点: 成都

会议语种:英文

页码: 515-519

在线出版日期: 2010-06-12（万方平台首次上网日期，不代表论文的发表时间）

会议专题

A Parallelization Cost Model for GPU