Glottal and Vocal Tract Characteristics of Voice Impersonators

被引:10
|
作者
Bin Amin, Talal [1 ]
Marziliano, Pina [2 ]
German, James Sneed [3 ]
机构
[1] Nanyang Technol Univ, Sch Elect & Elect Engn, Singapore 639798, Singapore
[2] Nanyang Technol Univ, Div Informat Engn, Sch Elect & Elect Engn, Singapore 639798, Singapore
[3] Nanyang Technol Univ, Sch Humanities & Social Sci, Div Linguist & Multilingual Studies, Singapore 639798, Singapore
关键词
Acoustic; disguise; formant; glottal; open quotient; speech rate; vocal tract; voice identity; voice impersonator; SPEAKER RECOGNITION; DIALECT; SPEECH; VOWEL; SEX;
D O I
10.1109/TMM.2014.2300071
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Voice impersonators possess a flexible voice which allows them to imitate and create different voice identities. These impersonations present a challenge for forensic analysis and speaker identification systems. To better understand the phenomena underlying successful voice impersonation, we collected a database of synchronous speech and ElectroGlottoGraphic (EGG) signals from three voice impersonators each producing nine distinct voice identities. We analyzed glottal and vocal tract measures including F0, speech rate, vowel formant frequencies, and timing characteristics of the vocal folds. Our analysis confirmed that the impersonators modulated all four parameters in producing the voices, and provides a lower bound on the scale of variability that is available to impersonators. Importantly, vowel formant differences across voices were highly dependent on vowel category, showing that such effects cannot be captured by global transformations that ignore the linguistic parse. We address this issue through the development of a no-reference objective metric based on the vowel-dependent variance of the formants associated with each voice. This metric both ranks the impersonators natural voices highly, and correlates strongly with the results of a subjective listening test. Together, these results demonstrate the utility of voice variability data for the development of voice disguise detection and speaker identification applications.
引用
收藏
页码:668 / 678
页数:11
相关论文
共 50 条
  • [1] Determination of glottal open regions by exploiting changes in the vocal tract system characteristics
    Prasad, Ravi Shankar
    Yegnanarayana, B.
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2016, 140 (01): : 666 - 677
  • [2] 'Mixing' the registers: Glottal source or vocal tract?
    Miller, DG
    Schutte, HK
    [J]. FOLIA PHONIATRICA ET LOGOPAEDICA, 2005, 57 (5-6) : 278 - 291
  • [3] GLOTTAL SOURCE VOCAL-TRACT INTERACTION
    KOIZUMI, T
    TANIGUCHI, S
    HIROMITSU, S
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1985, 78 (05): : 1541 - 1547
  • [4] Study of the effects of vocal tract constriction on glottal vibration
    Mittal, Vinay Kumar
    Yegnanarayana, B.
    Bhaskararao, Peri
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2014, 136 (04): : 1932 - 1941
  • [5] An acoustic glottal source for vocal tract physical models
    Hannukainen, Antti
    Kuortti, Juha
    Malinen, Jarmo
    Ojalammi, Antti
    [J]. MEASUREMENT SCIENCE AND TECHNOLOGY, 2017, 28 (11)
  • [6] GLOTTAL SOURCE-VOCAL TRACT ACOUSTIC INTERACTION
    FANT, G
    LIN, QG
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1987, 81 : S68 - S68
  • [7] The effect of glottal opening on the acoustic response of the vocal tract
    Barney, A.
    De Stefano, A.
    Henrich, N.
    [J]. ACTA ACUSTICA UNITED WITH ACUSTICA, 2007, 93 (06) : 1046 - 1056
  • [8] Measuring variations of voice source and vocal tract characteristics from Korean emotional voice
    Jo, Cheolwoo
    Wang, Jianglin
    [J]. ISDA 2006: SIXTH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS DESIGN AND APPLICATIONS, VOL 2, 2006, : 800 - +
  • [9] A Simulation on the Effect of Glottal Boundary Conditions on Vocal Tract Formants
    Uezu, Yasufumi
    Kaburagi, Tokihiko
    [J]. 18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 2292 - 2296
  • [10] Vocal Tract and Glottal Function During and After Vocal Exercising With Resonance Tube and Straw
    Guzman, Marco
    Laukkanen, Anne-Maria
    Krupa, Petr
    Horacek, Jaromir
    Svec, Jan G.
    Geneid, Ahmed
    [J]. JOURNAL OF VOICE, 2013, 27 (04) : 523.e19 - 523.e34