Protecting gender and identity with disentangled speech representations

被引:6
|
作者
Stoidis, Dimitrios [1 ]
Cavallaro, Andrea [1 ]
机构
[1] Queen Mary Univ London, Ctr Intelligent Sensing, London, England
来源
基金
英国工程与自然科学研究理事会;
关键词
privacy; soft biometrics; disentangled representation learning; variational autoencoder; IDENTIFICATION;
D O I
10.21437/Interspeech.2021-2163
中图分类号
R36 [病理学]; R76 [耳鼻咽喉科学];
学科分类号
100104 ; 100213 ;
摘要
Besides its linguistic content, our speech is rich in biometric information that can be inferred by classifiers. Learning privacy-preserving representations for speech signals enables downstream tasks without sharing unnecessary, private information about an individual. In this paper, we show that protecting gender information in speech is more effective than modelling speaker-identity information only when generating a nonsensitive representation of speech. Our method relies on reconstructing speech by decoding linguistic content along with gender information using a variational autoencoder. Specifically, we exploit disentangled representation learning to encode information about different attributes into separate subspaces that can be factorised independently. We present a novel way to encode gender information and disentangle two sensitive biometric identifiers, namely gender and identity, in a privacyprotecting setting. Experiments on the LibriSpeech dataset show that gender recognition and speaker verification can be reduced to a random guess, protecting against classification-based attacks.
引用
收藏
页码:1699 / 1703
页数:5
相关论文
共 50 条
  • [1] Towards Disentangled Speech Representations
    Peyser, Cal
    Huang, Ronny
    Rosenberg, Andrew
    Sainath, Tara N.
    Picheny, Michael
    Cho, Kyunghyun
    [J]. INTERSPEECH 2022, 2022, : 3603 - 3607
  • [2] Learning Disentangled Representations for Identity Preserving Surveillance Face Camouflage
    Li, Jingzhi
    Han, Lutong
    Zhang, Hua
    Han, Xiaoguang
    Ge, Jingguo
    Cao, Xiaochun
    [J]. 2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 9748 - 9755
  • [3] Unsupervised Adaptation with Interpretable Disentangled Representations for Distant Conversational Speech Recognition
    Hsu, Wei-Ning
    Tang, Hao
    Glass, James
    [J]. 19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 1576 - 1580
  • [4] LEARNING DISENTANGLED FEATURE REPRESENTATIONS FOR SPEECH ENHANCEMENT VIA ADVERSARIAL TRAINING
    Hou, Nana
    Xu, Chenglin
    Chng, Eng Siong
    Li, Haizhou
    [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 666 - 670
  • [5] Speech Resynthesis from Discrete Disentangled Self-Supervised Representations
    Polyak, Adam
    Adi, Yossi
    Copet, Jade
    Kharitonov, Eugene
    Lakhotia, Kushal
    Hsu, Wei-Ning
    Mohamed, Abdelrahman
    Dupoux, Emmanuel
    [J]. INTERSPEECH 2021, 2021, : 3615 - 3619
  • [6] On the Fairness of Disentangled Representations
    Locatello, Francesco
    Abbati, Gabriele
    Rainforth, Tom
    Bauer, Stefan
    Scholkopf, Bernhard
    Bachem, Olivier
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
  • [7] Disentangled behavioral representations
    Dezfouli, Amir
    Ashtiani, Hassan
    Ghattas, Omar
    Nock, Richard
    Dayan, Peter
    Ong, Cheng Soon
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
  • [8] Structured Disentangled Representations
    Esmaeili, Babak
    Wu, Hao
    Jain, Sarthak
    Bozkurt, Alican
    Siddharth, N.
    Paige, Brooks
    Brooks, Dana H.
    Dy, Jennifer
    van de Meent, Jan-Willem
    [J]. 22ND INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 89, 2019, 89
  • [9] On Causally Disentangled Representations
    Reddy, Abbavaram Gowtham
    Benin, Godfrey L.
    Balasubramanian, Vineeth N.
    [J]. THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 8089 - 8097
  • [10] Gender identity is indexed and perceived in speech
    Weirich, Melanie
    Simpson, Adrian P.
    [J]. PLOS ONE, 2018, 13 (12):