The Role of Voice Quality in the Perception of Prominence in Synthetic Speech

被引:0
|
作者
Murphy, Andy [1 ]
Yanushevskaya, Irena [1 ]
Chasaide, Ailbhe Ni [1 ]
Gobl, Christer [1 ]
机构
[1] Trinity Coll Dublin, Phonet & Speech Lab, Dublin, Ireland
来源
关键词
global waveshape parameter Rd; speech synthesis; voice quality; perception test; prominence; manipulation task; R-D; PARAMETER;
D O I
10.21437/Interspeech.2019-2761
中图分类号
R36 [病理学]; R76 [耳鼻咽喉科学];
学科分类号
100104 ; 100213 ;
摘要
This paper explores how prominence can be modelled in speech synthesis through voice quality variation. Synthetic utterances varying in voice quality (breathy, modal, tense) were generated using a glottal source model where the global waveshape parameter R-d was the main control parameter and f(0) was not varied. A manipulation task perception experiment was conducted to establish perceptually salient R-d values in the signalling of focus. The participants were presented with mini-dialogues designed to elicit narrow focus (with different focal syllable locations) and were asked to manipulate an unknown parameter in the synthetic utterances to produce a natural response. The results showed that participants manipulated R-d not only in focal syllables, but also in the pre- and postfocal material. The direction of R-d manipulation in the focal syllables was the same across the three voice qualities - towards decreased Rd values (tenser phonation). The magnitude of the decrease in R-d was significantly less for tense voice compared to breathy and modal voice, but did not vary with the location of the focal syllable in the utterance. Overall, the results suggest that R-d is effective as a control parameter for modelling prominence in synthetic speech.
引用
收藏
页码:2543 / 2547
页数:5
相关论文
共 50 条
  • [1] Cue interaction in the perception of prosodic prominence: the role of voice quality
    Ludusan, Bogdan
    Wagner, Petra
    Wlodarczak, Marcin
    [J]. INTERSPEECH 2021, 2021, : 1006 - 1010
  • [2] The role of intonation and voice quality in the affective speech perception
    Grichkovtsova, Ioulia
    Lacheret, Anne
    Morel, Michel
    [J]. INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 2037 - +
  • [3] The role of voice quality and prosodic contour in affective speech perception
    Grichkovtsova, Ioulia
    Morel, Michel
    Lacheret, Anne
    [J]. SPEECH COMMUNICATION, 2012, 54 (03) : 414 - 429
  • [4] Vowel-internal cues to vowel quality and prominence in speech perception
    Steffman, Jeremy
    [J]. PHONETICA, 2023, 80 (05) : 329 - 356
  • [5] Perception or synthesized voice quality in connected speech by Cantonese speakers
    Yiu, EML
    Murdoch, B
    Hird, K
    Lau, P
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2002, 112 (03): : 1091 - 1101
  • [6] SCALING PERCEPTUAL QUALITY OF PATHOLOGICAL VOICE BY USE OF SYNTHETIC SPEECH
    IMAIZUMI, S
    HIKI, S
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1978, 64 : S52 - S53
  • [7] The role of voice fundamental frequency in the perception of anger in clear speech
    Ferguson, Sarah H.
    Bennion, Sierra N.
    Smalley, Tara E.
    Young, Elizabeth D.
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2023, 153 (03):
  • [8] Voice Source Contribution to Prominence Perception: Rd Implementation
    Murphy, Andy
    Yanushevskaya, Irena
    Ni Chasaide, Ailbhe
    Gobl, Christer
    [J]. 19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 217 - 221
  • [9] Integrating Voice Quality Cues in the Pitch Perception of Speech and Non-speech Utterances
    Kuang, Jianjing
    Liberman, Mark
    [J]. FRONTIERS IN PSYCHOLOGY, 2018, 09
  • [10] Multimodal prosody: gestures and speech in the perception of prominence in Spanish
    Jimenez-Bravo, Miguel
    Marrero-Aguiar, Victoria
    [J]. FRONTIERS IN COMMUNICATION, 2024, 9