Scalable Multi-GPU Decoupled Parallel Rendering Approach in Shared Memory Architecture
As the performance-price ratio of the GPU becomes higher, lots of systems are able to accommodate more than one GPU in node. Each GPU in node can afford powerful rendering ability. It is very important to effectively organize parallel rendering pipeline to fully exploit the compute units of the system. But lots of parallel rendering systems usually join hardware rendering stage with composition stage in the display thread and this frequently leads to GPU stall. In this paper, we describe a decoupled parallel rendering approach and enable the two stages to execute in parallel. With the frame buffer in the main memory, the full image rendering time is totally decided by the GPU rendering ability when the rendering task is large enough. Theoretical analysis and experiment results both evidence that the performance of our method is much better than the coupled parallel rendering method. We also test the scalability of the approach and get a linear performance speedup with the GPU number when the rendering task is large enough. The approach is easy to be implemented and any parallel rendering application can benefit from it.
Scalable Rendering Multi-GPU Composition Parallel Rendering
Huahai Liu Pan Wang Kewen Wang Xun Cai Liang Zeng Sikun Li
School of Computer National University of Defense Technology Changsha, China
国内会议
北京
英文
1-7
2011-11-04(万方平台首次上网日期,不代表论文的发表时间)