Visualization and Analysis for Multidimensional Gene Expressions Signature of Cigarette Smoking
Biologists often use gene chip to get massive experimental data in the field of bioscience and chemical sciences. Facing a large amount of experimental data, researchers often need to find out a few interesting data or simple regulations. This paper presents a set of methods to visualize and analyze the data for gene expression signatures of people who smoke. We use the latest research data from National Center for Biotechnology Information. Totally, there are more than 400 thousand expressions data. Using these data, we can use parallel coordinates method to visualize the different gene expressions between smokers and nonsmokers and we can distinguish non-smokers, former smokers and current smokers by using the different colors. It can be easy to find out which gene is more important during the lung cancer angiogenesis in the smoking people. In another way, we can use a hierarchical model to visualize the inner relation of different genes. The location of the nodes shows different expression moment and the distance to the root shows the sequence of the expression. We can use the ring layout to represent all the nodes, and connect the different nodes which are related with color lines. Combined with the parallel coordinates method, the visualization result show the important genes and some inner relation obviously, which is useful for examination and prevention of lung cancer.
cigarette gene expressions signature visualization parallel coordinates parallel data
Wang Changbo Xiao Zhao Zhang Tianlun Cui Jin Pang Chenming
Software Engineering Institute, East China Normal University, Shanghai, 200062
国际会议
桂林
英文
1-8
2011-11-01(万方平台首次上网日期,不代表论文的发表时间)