P3S2:Practical Secure Protocol for Speech Data Publishing

摘要：

　　Speech data publishing discloses users'data privacy,and thus entails more privacy risks for users.Existing work sanitized the content,voice,and,voiceprint of speech data without considering the consistence among these three aspects,and therefore cannot protect users'data privacy.To this end,we propose a practical secure protocol for speech data publishing P3S2,the first attempt towards taking the corrections among the three factors into consideration when it sanitizes users'speech data.To concrete,it designs a three-dimension sanitization that utilizes feature learning to capture the set of characteristics in each dimension,and then sanitizes speech data in each dimension using the learned features.As a result,the correlations among the three dimensions of the sanitized speech data are guaranteed.Furthermore,it utilizes two real world datasets,TED talks and LibriSpeech to evaluate the performance of P3S2 in terms of the data privacy preservation.

关键词： Speech data publishing data privacy feature learning and data sanitization

作者: Ping Zhao Jiaxin Sun Anqi Zhang Sifan Ni Guanglin Zhang

作者单位: College of Information Science and Technology Donghua University Shanghai China

会议类型: 国际会议

会议名称: 2019国图灵大会(ACM Turing Celebration conference-China 2019 )

会议地点: 成都

会议语种:英文

页码: 517-521

在线出版日期: 2019-05-17（万方平台首次上网日期，不代表论文的发表时间）

会议专题

P3S2:Practical Secure Protocol for Speech Data Publishing