P3S2:Practical Secure Protocol for Speech Data Publishing
Speech data publishing discloses users'data privacy,and thus entails more privacy risks for users.Existing work sanitized the content,voice,and,voiceprint of speech data without considering the consistence among these three aspects,and therefore cannot protect users'data privacy.To this end,we propose a practical secure protocol for speech data publishing P3S2,the first attempt towards taking the corrections among the three factors into consideration when it sanitizes users'speech data.To concrete,it designs a three-dimension sanitization that utilizes feature learning to capture the set of characteristics in each dimension,and then sanitizes speech data in each dimension using the learned features.As a result,the correlations among the three dimensions of the sanitized speech data are guaranteed.Furthermore,it utilizes two real world datasets,TED talks and LibriSpeech to evaluate the performance of P3S2 in terms of the data privacy preservation.
Speech data publishing data privacy feature learning and data sanitization
Ping Zhao Jiaxin Sun Anqi Zhang Sifan Ni Guanglin Zhang
College of Information Science and Technology Donghua University Shanghai China
国际会议
2019国图灵大会(ACM Turing Celebration conference-China 2019 )
成都
英文
517-521
2019-05-17(万方平台首次上网日期,不代表论文的发表时间)