Glottal and Vocal Tract Characteristics of Voice Impersonators

被引:10
|
作者
Bin Amin, Talal [1 ]
Marziliano, Pina [2 ]
German, James Sneed [3 ]
机构
[1] Nanyang Technol Univ, Sch Elect & Elect Engn, Singapore 639798, Singapore
[2] Nanyang Technol Univ, Div Informat Engn, Sch Elect & Elect Engn, Singapore 639798, Singapore
[3] Nanyang Technol Univ, Sch Humanities & Social Sci, Div Linguist & Multilingual Studies, Singapore 639798, Singapore
关键词
Acoustic; disguise; formant; glottal; open quotient; speech rate; vocal tract; voice identity; voice impersonator; SPEAKER RECOGNITION; DIALECT; SPEECH; VOWEL; SEX;
D O I
10.1109/TMM.2014.2300071
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Voice impersonators possess a flexible voice which allows them to imitate and create different voice identities. These impersonations present a challenge for forensic analysis and speaker identification systems. To better understand the phenomena underlying successful voice impersonation, we collected a database of synchronous speech and ElectroGlottoGraphic (EGG) signals from three voice impersonators each producing nine distinct voice identities. We analyzed glottal and vocal tract measures including F0, speech rate, vowel formant frequencies, and timing characteristics of the vocal folds. Our analysis confirmed that the impersonators modulated all four parameters in producing the voices, and provides a lower bound on the scale of variability that is available to impersonators. Importantly, vowel formant differences across voices were highly dependent on vowel category, showing that such effects cannot be captured by global transformations that ignore the linguistic parse. We address this issue through the development of a no-reference objective metric based on the vowel-dependent variance of the formants associated with each voice. This metric both ranks the impersonators natural voices highly, and correlates strongly with the results of a subjective listening test. Together, these results demonstrate the utility of voice variability data for the development of voice disguise detection and speaker identification applications.
引用
下载
收藏
页码:668 / 678
页数:11
相关论文
共 50 条
  • [41] Subdividing of Pathological Voice Based on Vocal Tract Area
    Shao, Yating
    Xiao, Zhongzhe
    Zhang, Xiaojun
    Wu, Di
    Tao, Zhi
    2017 4TH INTERNATIONAL CONFERENCE ON SYSTEMS AND INFORMATICS (ICSAI), 2017, : 1126 - 1130
  • [42] Characteristics of Two-Dimensional Finite Difference Techniques for Vocal Tract Analysis and Voice Synthesis
    Speed, Matt
    Murphy, Damian
    Howard, David M.
    INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 768 - 771
  • [43] Exploration of glottal characteristics and the vocal folds behavior for the speech under emotion
    Yao, Xiao
    Bai, Wensong
    Ren, Yuqian
    Liu, Xin
    Hui, Zhijian
    NEUROCOMPUTING, 2020, 410 (410) : 328 - 341
  • [44] Performance evaluation of glottal quality measures from the perspective of vocal tract filter consistency
    Torres, Juan
    Moore, Elliot
    INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 953 - 956
  • [45] Estimating the glottal waveform and the vocal-tract filter from a vowel sound signal
    Deng, HQ
    Beddoes, MP
    Ward, RK
    Hodgson, M
    2003 IEEE PACIFIC RIM CONFERENCE ON COMMUNICATIONS, COMPUTERS, AND SIGNAL PROCESSING, VOLS 1 AND 2, CONFERENCE PROCEEDINGS, 2003, : 297 - 300
  • [46] Early voice therapy for unilateral vocal fold paralysis improves subglottal pressure and glottal closure
    Miyata, Eri
    Miyamoto, Makoto
    Shiromoto, Osamu
    Kobayashi, Yoshiki
    Yagi, Masao
    Kitawaki, Tomoki
    Kawaura, Takayuki
    Tomoda, Koichi
    Iwai, Hiroshi
    AMERICAN JOURNAL OF OTOLARYNGOLOGY, 2020, 41 (06)
  • [47] Teacher's voice: vocal tract discomfort symptoms, vocal intensity and noise in the classroom
    Felix Mendes, Amanda Louize
    Luckwu de Lucena, Brunna Thais
    Guedes Dias de Araujo, Aline Menezes
    Fernandes de Melo, Luciana Pimentel
    Lopes, Leonardo Wanderley
    Bonfim de Lima Silva, Maria Fabiana
    CODAS, 2016, 28 (02): : 168 - 175
  • [48] Voice Pathology Detection Based on the Vocal Fold Signal and the Vocal Tract Signal Separation
    Zhang, Xuehui
    Hu, Weiping
    Zeng, Ying
    2018 IEEE 3RD INTERNATIONAL CONFERENCE ON IMAGE, VISION AND COMPUTING (ICIVC), 2018, : 845 - 849
  • [49] Vocal Tract Acoustic Measurements for Detection of Pathological Voice Disorders
    Mishra, Jyoti
    Sharma, R. K.
    JOURNAL OF CIRCUITS SYSTEMS AND COMPUTERS, 2024, 33 (10)
  • [50] Vocal Tract Discomfort Symptoms in Patients With Different Voice Disorders
    Lopes, Leonardo Wanderley
    Cabral, Gyllyane Furtado
    Figueiredo de Almeida, Anna Alice
    JOURNAL OF VOICE, 2015, 29 (03) : 317 - 323