A preliminary study of voice quality transformation based on modifications to the neutral vocal tract area function

被引:15
|
作者
Story, BH [1 ]
Titze, IR [1 ]
机构
[1] Univ Iowa, Dept Speech Pathol & Audiol, Iowa City, IA 52242 USA
基金
美国国家卫生研究院;
关键词
D O I
10.1006/jpho.2002.0168
中图分类号
H0 [语言学];
学科分类号
030303 ; 0501 ; 050102 ;
摘要
The idea is pursued that voice quality can be partially represented by the underlying shape of a speaker's neutral vocal tract. Using an area function model, which allows direct access to the neutral tract shape, four separate modifications were made to one male speaker's vocal tract. The modifications involve imposing constrictive or expansive effects on the pharyngeal and oral portions of the neutral area function as well as on lip aperture and the epi-laryngeal tube. A single word utterance was first synthesized by superimposing deformation patterns appropriate for the word onto the original neutral tract shape (area function). Then, four additional samples of the word were synthesized using different modified neutral area function each time. The modifications were assessed by comparing F1-F2 formant trajectories of the original utterance with those of the modifications. The formant frequencies were observed to shift within the F1-F2. plane in directions predictable from simple tube acoustics. However, the modified voice qualities did not preserve the shape of the original F1-F2 trajectory. In other words, the modifications did not create a simple linear transformation of formant frequencies even though the "articulatory dynamics" (deformation patterns of the area function) were identical in all cases. These somewhat artificial vocal tract modifications were also compared with formant frequencies extracted from recordings of a speaker attempting to produce the same types of modifications. In general, the speaker's formant trajectories showed some similarities to the synthesized versions. However, the speaker also seemed to grade the "level" of the voice quality that was exerted on the utterance depending on whether the demands of the voice quality were in competition with the linguistic demands of a given phonetic segment. Finally, to demonstrate this type of voice quality modification in a broader context, the same procedures were applied to sentence-level speech and results were again shown as F1-F2 formant trajectories. (C) 2002 Elsevier Science Ltd. All rights reserved.
引用
收藏
页码:485 / 509
页数:25
相关论文
共 50 条
  • [31] Obtaining the vocal-tract area function from the vowel sound
    Deng, Huiqun
    Beddoes, Michael P.
    Ward, Rabab K.
    Hodgson, Murray
    [J]. Canadian Acoustics - Acoustique Canadienne, 2003, 31 (03): : 40 - 41
  • [32] A parametric model of the vocal tract area function for vowel and consonant simulation
    Story, BH
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2005, 117 (05): : 3231 - 3254
  • [33] DETERMINATION OF VOCAL-TRACT-AREA FUNCTION FROM TRANSFER IMPEDANCE
    STANSFIELD, EV
    BOGNER, RE
    [J]. PROCEEDINGS OF THE INSTITUTION OF ELECTRICAL ENGINEERS-LONDON, 1973, 120 (02): : 153 - 158
  • [34] Vocal Tract and Area Function Estimation with both Lip and Glottal Losses
    Kalgaonkar, Kaustubh
    Clements, Mark
    [J]. INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 997 - 1000
  • [35] Voice Complaints, Vocal and Non-vocal Behaviours Among Beatboxers-A Preliminary Study
    Balasubramanium, Radish K.
    Dsouza, Siona Benita
    Rao, Ananya
    Saldanha, Samantha J. L.
    Jahan, Najiya
    Thomas, Edna
    Gunjawate, Dhanshree R.
    [J]. JOURNAL OF VOICE, 2023, 37 (02) : 293.e1 - 293.e6
  • [36] A Preliminary Study on Vocal Tract System of Chinese Whispered Vowels
    Gong, Chenghui
    Zhao, Heming
    Lue, Gang
    Liu, Jianxin
    [J]. 2007 SECOND INTERNATIONAL CONFERENCE ON BIO-INSPIRED COMPUTING: THEORIES AND APPLICATIONS, 2007, : 177 - +
  • [37] Mixed source model and its adapted vocal tract filter estimate for voice transformation and synthesis
    Degottex, Gilles
    Lanchantin, Pierre
    Roebel, Axel
    Rodet, Xavier
    [J]. SPEECH COMMUNICATION, 2013, 55 (02) : 278 - 294
  • [38] Consequences of voice impairment in daily life for patients following radiotherapy for early glottic cancer: Voice quality, vocal function, and vocal performance
    Verdonck-de Leeuw, IM
    Keus, RB
    Hilgers, FJM
    Koopmans-van Beinum, FJ
    Greven, AJ
    de Jong, JMA
    Vreeburg, G
    Bartelink, H
    [J]. INTERNATIONAL JOURNAL OF RADIATION ONCOLOGY BIOLOGY PHYSICS, 1999, 44 (05): : 1071 - 1078
  • [39] A DEEP LEARNING APPROACH FOR DATA DRIVEN VOCAL TRACT AREA FUNCTION ESTIMATION
    Asadiabadi, Sasan
    Erzin, Engin
    [J]. 2018 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY (SLT 2018), 2018, : 167 - 173
  • [40] Voice-Related Problems, Vocal and Non-Vocal Habits in Naradiya Kirtankars: A Preliminary Study
    Karulkar, Rasika R.
    Gunjawate, Dhanshree R.
    [J]. JOURNAL OF VOICE, 2023, 37 (06) : 970.e11 - 970.e18