会议专题

EPBC: Enhanced Possibilistic Biclustering With Application to Gene Ezpression Analysis

Biclustering is an important data mining technique that allows identifying groups of genes which behave similarly under a subset of conditions for analyzing gene expression data from microarray technology. As a gene may play more than one biological role in conjunction with distinct groups of genes, possibilistic biclustering algorithms can give much insight towards different biological processes that each gene might participate into, along with providing a degree of participation as well, and the conditions under which its participation is most effective. This paper proposes modifications to the possibilistic biclustering algorithm introduced by Maurizio Filippone, et. al, in 2004 termed as PBC in which the mean square residue is minimized and at the same time the size of a bicluster is maximized by computing the zeros of the derivative of their objective function with respect to rows and columns memberships. Their algorithm suffers from some serious drawback. First in computing the derivative of their objective function they consider the residue as a constant even though changing a membership of a row or a column affects the residue of each entry in the bicluster since it changes the average of the whole bicluster and the average of each column or row respectively. Furthermore, their algorithm is strongly sensitive to its two input parameters. In this paper the derivatives are accurately computed also their objective function is modified such that only single parameter is needed which allow us to develop a procedure for approximating a range for suitable values for this parameter. Whereas the accurate computation of the derivatives slightly increases the runtime of the proposed algorithm, experimental study on Yeast and several artificial datasets with embedded constant and additive modules having different noise levels shows that our algorithm can offer substantial improvements in terms of the quality of the output biclusters over several previously proposed biclustering algorithms.

Possibilistic biclustering fuzzy clustering biclustering bidimensional clustering microarray data analysis gene ezpression data

Mohamed A.Mahfouz Mohamed A.Ismail

Department of Computers and Systems Engineering Faculty of Engineering,Alexandria University Alexandria 21544,Egypt

国际会议

The 3rd International Conference on Bioinformatics and Biomedical Engineering(iCBBE 2009)(第三届生物信息与生物医学工程国际会议)

北京

英文

1-6

2009-06-11(万方平台首次上网日期,不代表论文的发表时间)