Efficiently Detecting Frequent Patterns in Biological Sequences

摘要：

Most of the existing algorithms for mining frequent patterns could produce lots of projected databases and short candidate patterns which could increase the time and memory cost of mining. In order to overcome such shortcoming, we propose two fast and efficient algorithms named SBPM and MSPM for mining frequent patterns in single and multiple biological respectively. We first present the concept of primary pattern, and then use prefix tree for mining frequent primary patterns. A pattern growth approach is also presented to mine all the frequent patterns without producing large amount of irrelevant patterns. Our experimental results show that our algorithms not only improve the performance but also achieve effective mining results.

关键词： biological sequence primary pattern frequent pattern mining prefix tree

作者: Wei Liu Ling Chen

作者单位: Institute of Information Science and Technology Yangzhou University Yangzhou 225127 , China

会议类型: 国际会议

会议名称: 第8届全国web信息系统及应用学术会议

会议地点: 重庆

会议语种:英文

页码: 102-107

在线出版日期: 2011-10-21（万方平台首次上网日期，不代表论文的发表时间）

会议专题

Efficiently Detecting Frequent Patterns in Biological Sequences