Performance-Based Parallel Loop Self-scheduling on Heterogeneous Multicore PC Clusters

摘要：

In recent years, Multicore computers have been widely included in cluster systems. They adopt shared memory architectures. However, previous researches on parallel loop self-scheduling did not consider the feature of multicore computers. It is more suitable for shared-memory multiprocessors to adopt OpenMP for parallel programming. In this paper, we propose a performance-based approach that partitions loop iterations according to the performance weighting of cluster nodes. Because the iterations assigned to one MPI process will be processed in parallel by OpenMP threads running by the processor cores in the same computational node, the number of loop iterations to be allocated to one computational node at each scheduling step also depends on the number of processor cores in that node. Experimental results show that the proposed approach performs better than previous schemes.

关键词： Self-scheduling Parallel loop scheduling Multicore Cluster OpenMP MPI

作者: Chao-Tung Yang Jen-Hsiang Chang Chao-Chin Wu

作者单位: High-Performance Computing Laboratory Department of Computer Science, Tunghai University Taichung, 4 Department of Computer Science and Information Engineering National Changhua University of Education

会议类型: 国际会议

会议名称: The Second International Conference on High Performance Computing and Applications(第二届高性能计算及应用国际会议)

会议地点: 上海

会议语种:英文

页码: 509-514

在线出版日期: 2009-08-10（万方平台首次上网日期，不代表论文的发表时间）

会议专题

Performance-Based Parallel Loop Self-scheduling on Heterogeneous Multicore PC Clusters