The Role of Voice Quality in the Perception of Prominence in Synthetic Speech

被引:0
|
作者
Murphy, Andy [1 ]
Yanushevskaya, Irena [1 ]
Chasaide, Ailbhe Ni [1 ]
Gobl, Christer [1 ]
机构
[1] Trinity Coll Dublin, Phonet & Speech Lab, Dublin, Ireland
来源
关键词
global waveshape parameter Rd; speech synthesis; voice quality; perception test; prominence; manipulation task; R-D; PARAMETER;
D O I
10.21437/Interspeech.2019-2761
中图分类号
R36 [病理学]; R76 [耳鼻咽喉科学];
学科分类号
100104 ; 100213 ;
摘要
This paper explores how prominence can be modelled in speech synthesis through voice quality variation. Synthetic utterances varying in voice quality (breathy, modal, tense) were generated using a glottal source model where the global waveshape parameter R-d was the main control parameter and f(0) was not varied. A manipulation task perception experiment was conducted to establish perceptually salient R-d values in the signalling of focus. The participants were presented with mini-dialogues designed to elicit narrow focus (with different focal syllable locations) and were asked to manipulate an unknown parameter in the synthetic utterances to produce a natural response. The results showed that participants manipulated R-d not only in focal syllables, but also in the pre- and postfocal material. The direction of R-d manipulation in the focal syllables was the same across the three voice qualities - towards decreased Rd values (tenser phonation). The magnitude of the decrease in R-d was significantly less for tense voice compared to breathy and modal voice, but did not vary with the location of the focal syllable in the utterance. Overall, the results suggest that R-d is effective as a control parameter for modelling prominence in synthetic speech.
引用
收藏
页码:2543 / 2547
页数:5
相关论文
共 50 条
  • [21] The Role of Outer Hair Cell Function in the Perception of Synthetic versus Natural Speech
    Wolters, Maria
    Campbell, Pauline
    DePlacido, Christine
    Liddell, Amy
    Owens, David
    [J]. INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 149 - +
  • [22] Making predictable unpredictable with style - Behavioral and electrophysiological evidence for the critical role of prosodic expectations in the perception of prominence in speech
    Kakouros, Sofoklis
    Salminen, Nelli
    Rasanen, Okko
    [J]. NEUROPSYCHOLOGIA, 2018, 109 : 181 - 199
  • [23] ROLE OF SYNTHETIC SPEECH IN SPEECH RESEARCH
    LAWRENCE, W
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1964, 36 (05): : 1022 - &
  • [24] Auditory discrimination of natural speech and synthetic speech used as voice disguise
    Amino, Kanae
    Makinae, Hisanori
    Kamada, Toshiaki
    [J]. ACOUSTICAL SCIENCE AND TECHNOLOGY, 2018, 39 (01) : 48 - 50
  • [25] The effect of speech melody on voice quality
    Swerts, M
    Veldhuis, R
    [J]. SPEECH COMMUNICATION, 2001, 33 (04) : 297 - 303
  • [26] The analysis of voice quality in speech processing
    Keller, E
    [J]. NONLINEAR SPEECH MODELING AND APPLICATIONS, 2005, 3445 : 54 - 73
  • [27] The emotional quality of speech in voice services
    Maffiolo, V
    Chateau, N
    [J]. ERGONOMICS, 2003, 46 (13-14) : 1375 - 1385
  • [28] EFFECTS OF SPEECH RATE AND PITCH CONTOUR ON THE PERCEPTION OF SYNTHETIC SPEECH
    SLOWIACZEK, LM
    NUSBAUM, HC
    [J]. HUMAN FACTORS, 1985, 27 (06) : 701 - 712
  • [29] LISTENER EXPERIENCE AND PERCEPTION OF VOICE QUALITY
    KREIMAN, J
    GERRATT, BR
    PRECODA, K
    [J]. JOURNAL OF SPEECH AND HEARING RESEARCH, 1990, 33 (01): : 103 - 115
  • [30] THE PERCEPTION OF VOICE QUALITY BY BRAZILIAN BILINGUALS
    Petriu Ferreira Engelbert, Ana Paula
    Kluge, Denise Cristina
    [J]. ILHA DO DESTERRO-A JOURNAL OF ENGLISH LANGUAGE LITERATURES IN ENGLISH AND CULTURAL STUDIES, 2018, 71 (03): : 125 - 141