会议专题

Shrunken Dissimilarity Measure for Genome-wide SNP Data Classification

Recent development of high-resolution single-nucleotide polymorphism (SNP) arrays allows detailed assessment of genome-wide human genome variations. However, SNP data typically has a large number of SNPs (e,g., 400 thousand SNPs in genome-wide Parkinson disease SNP data) and a few hundred of samples. Conventional classification methods may not be effective when applied to such genome-wide SNP data. In this paper, we propose to develop and use shrunken dissimilarity measure to analyze and select relevant SNPs for classification problems. Examples for HapMap data and Parkinson data are given to demonstrate the effectiveness of the proposed method and illustrate it has the potential to become a useful analysis tool for SNP data sets. In particular, we find some SNPs in chromosome 2 that they contain in some genes which is relevant to Parkinson disease.

Shrinkage Dissimilarity measure Categorical centroids Single nucleotide polymorphism Genome-wide Classification

Haiyong Liao Yang Liu Michael K.Ng

Department of Mathematics,Hong Kong Baptist University,Kowloon Tong,Hong Kong

国际会议

The 3rd International Symposium on Optimization and System Biology(第三届最优化与系统生物学国际会议 OSB09)

张家界

英文

73-80

2009-09-20(万方平台首次上网日期,不代表论文的发表时间)