会议专题

Automatic Computation and Data Decomposition Algorithm with No Communication

For distributed memory machines the speed that processor accesses local memories is much faster than its speed to access remote memories. Thus how to decompose data and computation properly to achieve maximum parallelism and minimum communication is a key issue of automatic parallel compilation. In this paper, the authors present an automatic decomposition algorithm based on constraint equations. Using the algorithm, a data and computation decomposition result with no communication can be achieved. By releasing these constraint equations, the authors get decomposition result with more parallelism. As the result may have communications, so an improved method to eliminate some communications by data replication is also presented.

parallel recognition compiler computation partition data distribution

Han Lin Zhao Rongcai Pang Jianmin Dong Chunli

National Digital Switching System Engineering & Technological R&D Center Zhengzhou, Henan China

国际会议

2006 International Symposium on Distributed Computing and Applications to Business,Engineering and Science(2006年国际电子、工程及科学领域的分布式计算应用学术研讨会)

杭州

英文

270-273

2006-10-12(万方平台首次上网日期,不代表论文的发表时间)