会议专题

A graph theoretic algorithm for removing redundant protein sequences

Many biological sequence databases have redundant sequences which are not helpful to statistical analysis and require more computational time and resources to process. This lead us to design a new and fast program to generate a nonredundant sequence set. A graph theoretic algorithm was designed to process BLAST output and remove redundant proteins from a protein sequence database. We have developed a program named BlastCuller which can be used to generate a non-redundant protein database. BlastCuller is a flexible program with a parameter of sequence similarity cutoff, which can be a decimal from 0.0 to 1.0.This program can be downloaded from http://pcal.biosino.org/BlastCuller.html.

redundant proteins independent set BLAST graph theory

Pengfei Liu Zhenbing Zeng Ziliang Qian KaiYan Feng Yudong Cai

Software Engineering Institute of East China Normal University,3663 North Zhongshan Road,Shanghai 20 Software Engineering Institute of East China Normal University,3663 North Zhongshan Road,Shanghai 20 Bioinformatics Center,Key Lab of Systems Biology,Shanghai Institutes for Biological Sciences,Chinese Division of Imaging Science & Biomedical Engineering,The University of Manchester,Room G424 Stopford CAS-MPG Partner Institute for Computational Biology,Shanghai Institutes for Biological Sciences,Chin

国际会议

The 3rd International Conference on Bioinformatics and Biomedical Engineering(iCBBE 2009)(第三届生物信息与生物医学工程国际会议)

北京

英文

1-3

2009-06-11(万方平台首次上网日期,不代表论文的发表时间)