会议专题

GPU-Meta-Storms:Computing the similarities among massive microbial communities using GPU

  With the development of next-generation sequencing and metagenomic technologies,the number of metagenomic samples of microbial communities is increasing with exponential speed.The comparison among metagenomic samples could facilitate the data mining of the valuable yet hidden biological information held in the massive metagenomic data.However,current methods for metagenomic comparison are limited by their ability to process very large number of samples each with large data size.In this work,we have developed an optimized GPU-based metagenomic comparison algorithm,GPU-Meta-Storms,to evaluate the quantitative phylogenetic similarity among massive metagenomic samples,and implemented it using CUDA (Compute Unified Device Architecture) and C++ programming.The GPU-Meta-Storms program is optimized for CUDA with non-recursive transform,register recycle,memory alignment and so on.Our results have shown that with the optimization of the phylogenetic comparison algorithm,memory accessing strategy and parallelization mechanism on many-core hardware architecture,GPU-Meta-Storms could compute the pair-wise similarity matrix for 1920 metagenomic samples in 4 minutes,which gained a speed-up of more than 1000 times compared to CPU version Meta-Storms on single-core CPU,and more than 100 times on 16-core CPU.Therefore,the high-performance of GPU-Meta-Storms in comparison with massive metagenomic samples could thus enable in-depth data mining from massive metagenomic data,and make the real-time analysis and monitoring of constantly-changing metagenomic samples possible.

Metagenome Phylogenetic Comparison GPU High Performance Computing

Xiaoquan Su Xuetao Wang JianXu Kang Ning

Shandong Key Laboratory of Energy Genetics, CAS Key Laboratory of Biofuels and BioEnergy Genome Center, QingdaoInstitute of Bioenergy and Bioprocess Technology, Chinese Academy of Sciences, Qingdao 266101, Shandong Province,People’s Republic of China

国际会议

7th International Conference on Systems Biology(第7届计算系统生物学国际研讨会)(ISB2013)

安徽黄山

英文

69-74

2013-08-22(万方平台首次上网日期,不代表论文的发表时间)