Automatic Computation and Data Decomposition Algorithm with No Communication
For distributed memory machines the speed that processor accesses local memories is much faster than its speed to access remote memories. Thus how to decompose data and computation properly to achieve maximum parallelism and minimum communication is a key issue of automatic parallel compilation. In this paper, the authors present an automatic decomposition algorithm based on constraint equations. Using the algorithm, a data and computation decomposition result with no communication can be achieved. By releasing these constraint equations, the authors get decomposition result with more parallelism. As the result may have communications, so an improved method to eliminate some communications by data replication is also presented.
parallel recognition compiler computation partition data distribution
Han Lin Zhao Rongcai Pang Jianmin Dong Chunli
National Digital Switching System Engineering & Technological R&D Center Zhengzhou, Henan China
国际会议
杭州
英文
270-273
2006-10-12(万方平台首次上网日期,不代表论文的发表时间)