Screening Data for Phylogenetic Analysis of Land Plants: A parallel Approach
Screening data for phylogenetic analysis from large datasets is a known computational problem of dataintensive application. In this paper, we implement an approach to screen sequence data for The Platform for Phylogenetic Analysis of Land Plants (PALPP), using the MapReduce paradigm to parallelize the Basic Local Alignment Search Tool (BLAST) and to manage its execution, using machine virtualization to encapsulate its execution environment and commonly using data sets into flexibly deployable virtual machines. Two methods of BLAST using Hadoop are implemented and the evaluation of the approach is also presented.
Data screening BLAST Hadoop MapReduce PALPP
Liu Yong Gao Yanping Zhou Yuanchun Li Jianhui Meng Zhen Liu Qi Gao Yanping Zhou Yuanchun Li Jianhui Liu Yong Meng Zhen Liu Qi
Scientific Data Center Computer Network Information Center, CAS Beijing, China
国际会议
杭州
英文
305-308
2010-10-21(万方平台首次上网日期,不代表论文的发表时间)