会议专题

Clustering Sequential Data into Hierarchical Patterns

Sequential data, i.e. text string, is a common yet important data type. Automatically discovering patterns for sequential data is useful but challenging. In this paper, we address this task by clustering strings into hierarchical patterns. Such pattern hierarchy is particularly helpful for users to discover meaningful patterns as well as to interpret the encapsulated knowledge. We present the clustering algorithm in details and evaluate it on a large, real dataset of street addresses. The experiments demonstrate the effectiveness of our approach, making it a useful tool for analyzing and interpreting sequential data.

sequential data pattern hierarchical clustering

Xinying Song Johnson Apacible

School of Computer Science and Technology Harbin Institute of Technology Harbin, China Microsoft Research Redmond, Washington, U.S.A.

国际会议

2011 2nd International Conference on Data Storage and Data Engineering(DSDE 2011)(2011年第二届数据存储与数据工程国际会议)

西安

英文

154-158

2011-05-13(万方平台首次上网日期,不代表论文的发表时间)