Exploring Storage Optimizations to Accelerate Parallel Out-of-core Matrix Product
The separate development between the storage system and scientific application has hidden most of internal implementation strategies of the storage system from the scientific application designers. This would worsen the existing I/O bottleneck problem of most scientific applications to some extent To address this problem, this study chooses the matrix and its parallel out-of-core product algorithm to study the interactions between the applications and the parallel storage system. Especially, the data distribution and access interfaces of the matrix are analyzed and optimized firstly. Then, the communication among processes are incorporated into the parallel out-of-core matrix multiplication algorithm to reduce its disk access times. Experiments show that the proposed optimizations can reduce the time spent in accessing data from disk and accelerate the parallel out-of-core matrix product.
Parallel out-of-core matrix product Parallel storage system Acceleration
Bin Dong Xiuqiao Li Limin Xiao Li Ruan
State Key Laboratory of Software Development Environment, School of Computer Science and Engineering, Beihang University, Beijing 100191, P.R.China
国际会议
昆明、丽江
英文
1-2
2011-04-15(万方平台首次上网日期,不代表论文的发表时间)