Voice Privacy Through Time-Scale and Pitch Modification

被引:0
|
作者
Prajapati, Gauri P. [1 ]
Singh, Dipesh K. [1 ]
Patil, Hemant A. [1 ]
机构
[1] Dhirubhai Ambani Inst Informat & Commun Technol, Gandhinagar, Gujarat, India
关键词
Voice privacy; speech perturbation; anonymization; SPEAKER;
D O I
10.1007/978-3-031-12700-7_8
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
An attacker can fraudulently get access (instead of the genuine user) if the users' speech data has not been preserved by using any protection. Hence, it is important to protect users' speech data for which a voice privacy system can be employed. A voice privacy system is not designed based on any particular kind of attack. Instead, it is designed in a generalized way, making it as universal system. This study presents the time-scale and pitch modification-based anonymization methods to modify the speaker-dependent speech parameters (i.e., F-0) for better privacy preservation of speech data. The proposed voice privacy performance is compared with the signal processing-based baseline system of the INTERSPEECH 2020 voice privacy challenge. The authors have used various perturbation methods, concluding that speed perturbation with factor 0.8 is better to get adequate speaker anonymization (with 38.5% Equal Error Rate (EER) and 91.3% De-IDentification (DeID)) and acceptable speech intelligibility (4.86% WER) for female speakers. It is observed that speed and pitch perturbation are two important candidates for anonymization. However, the tempo perturbation is not found to be so useful for speaker anonymization.
引用
收藏
页码:72 / 80
页数:9
相关论文
共 50 条
  • [31] Quality enhancement of packet audio with time-scale modification
    Liu, F
    Kim, JW
    Kuo, CCJ
    MULTIMEDIA SYSTEMS AND APPLICATIONS V, 2002, 4861 : 163 - 173
  • [32] Approach for time-scale modification of speech based on TCNMF
    Wu, Haijia
    Zhang, Xiongwei
    Huang, Jianjun
    Chen, Weiwei
    ELECTRONICS LETTERS, 2013, 49 (01) : 71 - 72
  • [33] An objective measure of quality for time-scale modification of audio
    Roberts, Timothy
    Paliwal, Kuldip K.
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2021, 149 (03): : 1843 - 1854
  • [34] Time domain technique for pitch modification and robust voice transformation
    Vergin, R
    OShaughnessy, D
    Farhat, A
    1997 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I - V: VOL I: PLENARY, EXPERT SUMMARIES, SPECIAL, AUDIO, UNDERWATER ACOUSTICS, VLSI; VOL II: SPEECH PROCESSING; VOL III: SPEECH PROCESSING, DIGITAL SIGNAL PROCESSING; VOL IV: MULTIDIMENSIONAL SIGNAL PROCESSING, NEURAL NETWORKS - VOL V: STATISTICAL SIGNAL AND ARRAY PROCESSING, APPLICATIONS, 1997, : 947 - 950
  • [35] Mach1: Nonuniform time-scale modification of speech
    Covell, M
    Withgott, M
    Slaney, M
    PROCEEDINGS OF THE 1998 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-6, 1998, : 349 - 352
  • [36] Variable time-scale modification of speech using transient information
    Lee, SJ
    Kim, HD
    Kim, HS
    1997 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I - V: VOL I: PLENARY, EXPERT SUMMARIES, SPECIAL, AUDIO, UNDERWATER ACOUSTICS, VLSI; VOL II: SPEECH PROCESSING; VOL III: SPEECH PROCESSING, DIGITAL SIGNAL PROCESSING; VOL IV: MULTIDIMENSIONAL SIGNAL PROCESSING, NEURAL NETWORKS, 1997, : 1319 - 1322
  • [37] TSM TOOLBOX: MATLAB IMPLEMENTATIONS OF TIME-SCALE MODIFICATION ALGORITHMS
    Driedger, Jonathan
    Mueller, Meinard
    DAFX-14: 17TH INTERNATIONAL CONFERENCE ON DIGITAL AUDIO EFFECTS, 2014, : 249 - 256
  • [38] A Spectral Variation Function for Variable Time-Scale Modification of Speech
    Kachare, Pramod H.
    Pandey, Prem C.
    2021 NATIONAL CONFERENCE ON COMMUNICATIONS (NCC), 2021, : 48 - 52
  • [39] Stereo Time-Scale Modification Using Sum and Difference Transformation
    Roberts, Timothy
    Paliwal, Kuldip K.
    2018 12TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND COMMUNICATION SYSTEMS (ICSPCS), 2018,
  • [40] Time-scale modification of music using a subband approach based on the bark scale
    Dorran, D
    Lawlor, R
    2003 IEEE WORKSHOP ON APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS PROCEEDINGS, 2003, : 173 - 176