Network-based support vector machine for classification of microarray samples

摘要：

Background: The importance of network-based approach to identifying biological markers for diagnostic classification and prognostic assessment in the context of microarray data has been increasingly recognized. To our knowledge, there have been few, if any, statistical tools that explicitly incorporate the prior information of gene networks into classifier building. The main idea of this paper is to take full advantage of the biological observation that neighboring genes in a network tend to function together in biological processes and to embed this information into a formal statistical framework.Results: We propose a network-based support vector machine for binary classification problems by constructing a penalty term from the F ∞-norm being applied to pairwise gene neighbors with the hope to improve predictive performance and gene selection. Simulation studies in both low-and high-dimensional data settings as well as two real microarray applications indicate that the proposed method is able to identify more clinically relevant genes while maintaining a sparse model with either similar or higher prediction accuracy compared with the standard and the L1 penalized support vector machines.Conclusions: The proposed network-based support vector machine has the potential to be a practically useful classification tool for microarrays and other high-dimensional data.

作者: Yanni Zhu Xiaotong Shen Wei Pan

作者单位: Division of Biostatistics, School of Public Health, University of Minnesota, Minneapolis, Minnesota School of Statistics, University of Minnesota, Minneapolis, Minnesota 55455, USA

会议类型: 国际会议

会议名称: The 7th Asia-Pacific Bioinformatics Conference(第七届亚太生物信息学大会)

会议地点: 北京

会议语种:英文

页码: 220-230

在线出版日期: 2009-01-01（万方平台首次上网日期，不代表论文的发表时间）

会议专题

Network-based support vector machine for classification of microarray samples