会议专题

A Parallel Algebraic Multigrid Solver on Graphics Processing Units

The paper presents a multi-GPU implementation of the preconditioned conjugate gradient algorithm with an algebraic multigrid preconditioner (PCG-AMG) for an elliptic model problem on a 3D unstructured grid. An efficient parallel sparse matrix-vector multiplication scheme underlying the PCG-AMg algorithm is presented for the many-core GPU architecture. A performance comparison of the parallel solver shows that a singe Nvidia Tesla C1060 GPU board delivers the performance of a sixteen node Infiniband cluster and a multi-GPU configuration with eight GPUs is about 100 times faster than a typical server CPU core.

Gundolf Haase Manfred Liebmann Craig C. Douglas Gernot Plank

Institute for Mathematics and Scientific Computing, University of Graz Institute for Mathematics and Scientific Computing,University of Graz Department of Mathematics, Uni Department of Mathematics, University of Wyoming Computing Laboratory, Oxford University

国际会议

The Second International Conference on High Performance Computing and Applications(第二届高性能计算及应用国际会议)

上海

英文

38-47

2009-08-10(万方平台首次上网日期,不代表论文的发表时间)