会议专题

Prediction of O-linked Glycosylation Sites in Protein Sequence by PCA-LDA

O-glycosylation is one of the main types of the mammalian protein glycosylation, it occurs on the particular site of serine and threonine. In this paper, a new method of PCA-LDA is used for the prediction of O-glycosylation site under all kinds of window size (5,7,9,11,21,31,41,51). The new method of PCA-LDA is the combination of PCA and LDA, we also call it hybrid discriminant analysis(HDA). The test protein sequence which is encoded by the sparse coding is projected to the one-dimensional subspace and then by calculating the Mahanalobis distance between the projection and each class center, the test protein sequence is assigned into the “nearest class, so it can be known that whether a particular site of serine and threonine is glycosylated. The result of experiments shows that the proposed method of HDA is more effective and accurate. The prediction accuracy is about 75%-92.5%.

prediction glycosylation protein sparse coding HDA classification

Xue-mei Yang

College of Mathematics and Information Science Xianyang Normal Univ. Xianyang, 712000,China

国际会议

2009 Ninth International Conference on Hybrid Intelligent Systems(第九届混合智能系统国际会议 HIS 2009)

沈阳

英文

1-4

2009-08-12(万方平台首次上网日期,不代表论文的发表时间)