会议专题

A Common-Subsequence-Based Approach for Mining Deep Order Preserving Submatrix

  As an effective biclustering model,order-preserving submatrix(OPSM)has been widely applied to biological gene expression data mining.Recently,biologists hope to find deep OPSMs with long patterns and comparatively few support rows,which are not only useful on the interpretation of gene regulatory networks but also have essential biological significance.Unfortunately,the traditional exact mining algorithms based on Apriori principle could not deal with the deep OPSM problem,since they often take a large minimum support threshold for pattern pruning,and inevitably miss some significant deep OPSMs.Therefore,this paper proposes a new exact algorithm for mining deep OPSMs,which obtain all the deep OPSMs by finding the common subsequences shared by every two rows.Experiments have been done in both real and synthetic data sets,and the results show that our algorithm is suitable for the full mining of deep OPSMs with a small support,which could even find all the deep OPSMs with a minimum support threshold of 2.Compared with the traditional sequential pattern mining algorithms which depend on relatively large support threshold,our algorithm is an effective one to solve the deep OPSM problem.

biclustering deep order preserving submatrix common subsequence

Yun Xue Tiechen Li Zhiwen Liu Zhengling Liao Hua Xiao Hongya Zhao Xiaohui Hu

School of Physics and Telecommunication Engineering South China Normal University Guangzhou,China Industrial Center Shenzhen Polytechnic Shenzhen,China

国际会议

The 2014 10th International Conference on Natural Computation (ICNC 2014) and the 2014 11th International Conference on Fuzzy Systems and Knowledge Discovery (FSKD 2014)(第十届自然计算和第十一届模糊系统与知识发现国际会议)

厦门

英文

342-348

2014-08-19(万方平台首次上网日期,不代表论文的发表时间)