Design Choices for X-vector Based Speaker Anonymization

被引:17
|
作者
Srivastava, Brij Mohan Lal [1 ]
Tomashenko, N. [2 ]
Wang, Xin [3 ]
Vincent, Emmanuel [4 ]
Yamagishi, Junichi [3 ]
Maouche, Mohamed [1 ]
Bellet, Aurelien [1 ]
Tommasi, Marc [5 ]
机构
[1] Inria, Le Chesnay, France
[2] Avignon Univ, Lab Informat Avignon LIA, Avignon, France
[3] Natl Inst Informat, Tokyo, Japan
[4] Univ Lorraine, LORIA, Inria, CNRS, Nancy, France
[5] Univ Lille, Lille, France
来源
关键词
speaker anonymization; VoicePrivacy challenge; voice conversion; PLDA; x-vectors;
D O I
10.21437/Interspeech.2020-2692
中图分类号
R36 [病理学]; R76 [耳鼻咽喉科学];
学科分类号
100104 ; 100213 ;
摘要
The recently proposed x-vector based anonymization scheme converts any input voice into that of a random pseudo-speaker. In this paper, we present a flexible pseudo-speaker selection technique as a baseline for the first VoicePrivacy Challenge. We explore several design choices for the distance metric between speakers, the region of x-vector space where the pseudo-speaker is picked, and gender selection. To assess the strength of anonymization achieved, we consider attackers using an x-vector based speaker verification system who may use original or anonymized speech for enrollment, depending on their knowledge of the anonymization scheme. The Equal Error Rate (EER) achieved by the attackers and the decoding Word Error Rate (WER) over anonymized data are reported as the measures of privacy and utility. Experiments are performed using datasets derived from LibriSpeech to find the optimal combination of design choices in terms of privacy and utility.
引用
收藏
页码:1713 / 1717
页数:5
相关论文
共 50 条
  • [1] Privacy and Utility of X-Vector Based Speaker Anonymization
    Srivastava, Brij Mohan Lal
    Maouche, Mohamed
    Sahidullah, Md
    Vincent, Emmanuel
    Bellet, Aurelien
    Tommasi, Marc
    Tomashenko, Natalia
    Wang, Xin
    Yamagishi, Junichi
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2022, 30 : 2383 - 2395
  • [2] Speaker anonymization by modifying fundamental frequency and x-vector singular value
    Mawalim, Candy Olivia
    Galajit, Kasorn
    Karnjana, Jessada
    Kidani, Shunsuke
    Unoki, Masashi
    [J]. COMPUTER SPEECH AND LANGUAGE, 2022, 73
  • [3] Voice Privacy Through x-vector and CycleGAN-based Anonymization
    Prajapati, Gauri P.
    Singh, Dipesh K.
    Amin, Preet P.
    Patil, Hemant A.
    [J]. INTERSPEECH 2021, 2021, : 1684 - 1688
  • [4] X-Vector Singular Value Modification and Statistical-Based Decomposition with Ensemble Regression Modeling for Speaker Anonymization System
    Mawalim, Candy Olivia
    Galajit, Kasorn
    Karnjana, Jessada
    Unoki, Masashi
    [J]. INTERSPEECH 2020, 2020, : 1703 - 1707
  • [5] Bayesian HMM based x-vector clustering for Speaker Diarization
    Diez, Mireia
    Burget, Lukas
    Wang, Shuai
    Rohdin, Johan
    Cernocky, Jan
    [J]. INTERSPEECH 2019, 2019, : 346 - 350
  • [6] A Study of X-vector Based Speaker Recognition on Short Utterances
    Kanagasundaram, A.
    Sridharan, S.
    Sriram, G.
    Prachi, S.
    Fookes, C.
    [J]. INTERSPEECH 2019, 2019, : 2943 - 2947
  • [7] Research on x-vector speaker recognition algorithm based on Kaldi
    Zhao, Hong
    Yue, Lupeng
    Wang, Weijie
    Zeng, Xiangyan
    [J]. INTERNATIONAL JOURNAL OF COMPUTING SCIENCE AND MATHEMATICS, 2022, 15 (03) : 199 - 212
  • [8] Multi-task learning for X-vector based speaker recognition
    Zhang Y.
    Liu L.
    [J]. International Journal of Speech Technology, 2023, 26 (04) : 817 - 823
  • [9] X-vector anonymization using autoencoders and adversarial training for preserving speech privacy
    Perero-Codosero, Juan M.
    Espinoza-Cuadros, Fernando M.
    Hernandez-Gomez, Luis A.
    [J]. COMPUTER SPEECH AND LANGUAGE, 2022, 74
  • [10] Review of different robust x-vector extractors for speaker verification
    Rouvier, Mickael
    Dufour, Richard
    Bousquet, Pierre-Michel
    [J]. 28TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO 2020), 2021, : 366 - 370