Speaker Anonymity and Voice Conversion Vulnerability: A Speaker Recognition Analysis

被引:1
|
作者
Saini, Shalini [1 ]
Saxena, Nitesh [1 ]
机构
[1] Texas A&M Univ, Dept Comp Sci & Engn, College Stn, TX 77834 USA
关键词
Voice Anonymity; Voice Conversion; Speaker Recognition; Privacy and Security;
D O I
10.1109/CNS59707.2023.10289030
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Anonymized voice data is essential for maintaining privacy in voice-based exchanges of sensitive information, particularly in healthcare. However, voice conversion methods prioritize target voice identity over completely obscuring the source speaker's voice features. Recent advancements in speaker recognition systems have increased their ability to detect subtle residual voice features of the source speaker in voice-converted samples with greater precision and accuracy, posing potential risks to voice anonymity. Balancing speaker anonymity and recognition accuracy is a persistent challenge in voice-based applications, where maintaining voice anonymity and correctly identifying the speaker are critical but vulnerable. Violating voice anonymity can result in privacy and security threats. In this work, we examine multiple voice conversion and speaker recognition systems to explore the threats to voice anonymity. Our findings demonstrate the significant risk of identifying source speakers from converted voice samples. We discovered that voice anonymity is more vulnerable to breaking with one-to-one conversions compared to many-to-many and any-to-any conversions. The likelihood of identifying the original speaker from anonymized speech data depends on target voice features, voice conversion techniques, and speaker recognition methods.
引用
下载
收藏
页数:9
相关论文
共 50 条
  • [31] Speaker recognition and speaker normalization by projection to speaker subspace
    Ariki, Y
    Tagashira, S
    Nishijima, M
    1996 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, CONFERENCE PROCEEDINGS, VOLS 1-6, 1996, : 319 - 322
  • [32] Android Voice Recognition Application with Multi Speaker Feature
    Frewat, George
    Baroud, Charbel
    Sammour, Roy
    Kassem, Abdallah
    Hamad, Mustapha
    PROCEEDINGS OF THE 18TH MEDITERRANEAN ELECTROTECHNICAL CONFERENCE MELECON 2016, 2016,
  • [33] Speaker Independent Sinhala Speech Recognition for Voice Dialling
    Amarasingh, W. G. T. N.
    Gamini, D. D. A.
    INTERNATIONAL CONFERENCE ON ADVANCES IN ICT FOR EMERGING REGIONS (ICTER2012), 2012, : 3 - 6
  • [34] HUMAN SPEAKER RECOGNITION PERFORMANCE OF LPC VOICE PROCESSORS
    UZDY, Z
    IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1985, 33 (03): : 752 - 753
  • [35] ENABLING IMPROVED SPEAKER RECOGNITION BY VOICE QUALITY ESTIMATION
    Bartos, Anthony L.
    Nelson, Douglas J.
    2011 CONFERENCE RECORD OF THE FORTY-FIFTH ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS & COMPUTERS (ASILOMAR), 2011, : 595 - 599
  • [36] DOMAIN ADAPTATION FOR SPEAKER RECOGNITION IN SINGING AND SPOKEN VOICE
    Chowdhury, Anurag
    Cozzo, Austin
    Ross, Arun
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 7192 - 7196
  • [37] Speaker independent voice recognition with a fuzzy neural network
    Nava, PA
    Taylor, JM
    FUZZ-IEEE '96 - PROCEEDINGS OF THE FIFTH IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS, VOLS 1-3, 1996, : 2049 - 2052
  • [38] Interaction of face and voice areas during speaker recognition
    von Kriegstein, K
    Kleinschmidt, A
    Sterzer, P
    Giraud, AL
    JOURNAL OF COGNITIVE NEUROSCIENCE, 2005, 17 (03) : 367 - 376
  • [39] ON THE USE OF SPEAKER SUPERFACTORS FOR SPEAKER RECOGNITION
    Scheffer, Nicolas
    Vogt, Robbie
    2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 4410 - 4413
  • [40] Speaker Dependent Coefficients for Speaker Recognition
    Orsag, Filip
    INTERNATIONAL JOURNAL OF SECURITY AND ITS APPLICATIONS, 2010, 4 (01): : 31 - 47