Protecting gender and identity with disentangled speech representations

被引:6
|
作者
Stoidis, Dimitrios [1 ]
Cavallaro, Andrea [1 ]
机构
[1] Queen Mary Univ London, Ctr Intelligent Sensing, London, England
来源
基金
英国工程与自然科学研究理事会;
关键词
privacy; soft biometrics; disentangled representation learning; variational autoencoder; IDENTIFICATION;
D O I
10.21437/Interspeech.2021-2163
中图分类号
R36 [病理学]; R76 [耳鼻咽喉科学];
学科分类号
100104 ; 100213 ;
摘要
Besides its linguistic content, our speech is rich in biometric information that can be inferred by classifiers. Learning privacy-preserving representations for speech signals enables downstream tasks without sharing unnecessary, private information about an individual. In this paper, we show that protecting gender information in speech is more effective than modelling speaker-identity information only when generating a nonsensitive representation of speech. Our method relies on reconstructing speech by decoding linguistic content along with gender information using a variational autoencoder. Specifically, we exploit disentangled representation learning to encode information about different attributes into separate subspaces that can be factorised independently. We present a novel way to encode gender information and disentangle two sensitive biometric identifiers, namely gender and identity, in a privacyprotecting setting. Experiments on the LibriSpeech dataset show that gender recognition and speaker verification can be reduced to a random guess, protecting against classification-based attacks.
引用
收藏
页码:1699 / 1703
页数:5
相关论文
共 50 条
  • [41] Social Representations, Gender and Identity: Interactions and Practices in a Context of Vulnerability
    Flores-Palacios, Fatima
    Serrano Oswald, S. Erendira
    [J]. PAPERS ON SOCIAL REPRESENTATIONS, 2019, 28 (02):
  • [42] Animating Face using Disentangled Audio Representations
    Mittal, Gaurav
    Wang, Baoyuan
    [J]. 2020 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2020, : 3279 - 3287
  • [43] Learning Disentangled Representations of Video with Missing Data
    Massague, Armand Comas
    Zhang, Chi
    Feric, Zlatan
    Camps, Octavia
    Yu, Rose
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
  • [44] Linear Disentangled Representations and Unsupervised Action Estimation
    Painter, Matthew
    Hare, Jonathon
    Prugel-Bennett, Adam
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
  • [45] Learning Debiased and Disentangled Representations for Semantic Segmentation
    Chu, Sanghyeok
    Kim, Dongwan
    Han, Bohyung
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [46] On Disentangled Representations Learned from Correlated Data
    Traeuble, Frederik
    Creager, Elliot
    Kilbertus, Niki
    Locatello, Francesco
    Dittadi, Andrea
    Goyal, Anirudh
    Schoelkopf, Bernhard
    Bauer, Stefan
    [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139 : 7412 - 7422
  • [47] Handwritten Text Generation via Disentangled Representations
    Liu, Xiyan
    Meng, Gaofeng
    Xiang, Shiming
    Pan, Chunhong
    [J]. IEEE SIGNAL PROCESSING LETTERS, 2021, 28 : 1838 - 1842
  • [48] LEARNING DISENTANGLED FEATURE REPRESENTATIONS FOR ANOMALY DETECTION
    Lee, Wei-Yu
    Wang, Yu-Chiang Frank
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2020, : 2156 - 2160
  • [49] Protecting Speech
    Abrams, Floyd
    [J]. NATION, 2011, 292 (05) : 22 - 23
  • [50] Learning Disentangled Multimodal Representations for the Fashion Domain
    Saha, Amrita
    Nawhal, Megha
    Khaprat, Mitesh M.
    Raykar, Vikas C.
    [J]. 2018 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2018), 2018, : 557 - 566