Utility-Based Anonymization for Multiple Sensitive Attributes

摘要：

　　Privacy-preserving data publication problem has attracted more and more attention in recent years.A lot of related research works have been done towards dataset with single sensitive attribute.However,usually,original data contains more than one sensitive attribute.In this paper,we apply k-anonymity principle to solve the data publication problem for dataset with multiple sensitive attributes.We first cluster the sensitive attribute values based on a utility matrix.We calculate the utility for both numeric attribute and categorical attribute respectively.Then we use a greedy strategy to partition the tuples into equivalence classes.Our method can guarantee that the size of the equivalence class is k except the last one,which reduces the information loss.And we can guarantee the diversity of the sensitive value in an equivalence class,which can protect the privacy against the homogeneity attack and similarity attack.Experiments on real dataset show that our method performs well on information loss,which indicates that we can guarantee the data utility while protecting the personal privacy.

会议类型: 国内会议

会议名称: 第八届中国可信计算与信息安全学术会议

会议地点: 湖北恩施

会议语种:英文

页码: 1-7

在线出版日期: 2014-09-13（万方平台首次上网日期，不代表论文的发表时间）

会议专题

Utility-Based Anonymization for Multiple Sensitive Attributes