Efficiently Detecting Frequent Patterns in Biological Sequences
Most of the existing algorithms for mining frequent patterns could produce lots of projected databases and short candidate patterns which could increase the time and memory cost of mining. In order to overcome such shortcoming, we propose two fast and efficient algorithms named SBPM and MSPM for mining frequent patterns in single and multiple biological respectively. We first present the concept of primary pattern, and then use prefix tree for mining frequent primary patterns. A pattern growth approach is also presented to mine all the frequent patterns without producing large amount of irrelevant patterns. Our experimental results show that our algorithms not only improve the performance but also achieve effective mining results.
biological sequence primary pattern frequent pattern mining prefix tree
Wei Liu Ling Chen
Institute of Information Science and Technology Yangzhou University Yangzhou 225127 , China
国际会议
重庆
英文
102-107
2011-10-21(万方平台首次上网日期,不代表论文的发表时间)