Auto-tuning Mapping Strategy for Parallel CFD Program
We present an auto-tuning mapping strategy for mapping grid blocks to multi-processors and multinodes in a parallel CFD program. We first calculate the communication matrices from the topology construction of the grids, then use two heuristics in tuning the possible partitions, which largely shrinks the searching space and obtains the optimal mapping under these searching constraints, making out a compromise between tuning cost and system performance. Experiments are carried out for a CFD calculation case on a high performance computing platform. Compared with general block mapping, cycle mapping and random mapping strategies, our strategy has an extraordinary advantage over the others in load balance and communication overhead.
auto-tuning mapping strategy heuristic
Liu Fang Wang Zhenghua Che Yonggang
School of Computer, National University of Defense and Technology Changsha, China
国际会议
杭州
英文
222-226
2012-10-28(万方平台首次上网日期,不代表论文的发表时间)