Synthesis and perception of breathy, normal, and Lombard speech in the presence of noise

被引:23
|
作者
Raitio, Tuomo [1 ]
Suni, Antti [2 ]
Vainio, Martti [2 ]
Alku, Paavo [1 ]
机构
[1] Aalto Univ, Dept Signal Proc & Acoust, Espoo, Finland
[2] Univ Helsinki, Dept Behav Sci, Helsinki, Finland
来源
COMPUTER SPEECH AND LANGUAGE | 2014年 / 28卷 / 02期
基金
芬兰科学院;
关键词
Statistical parametric speech synthesis; Adaptation; Vocal effort; Lombard speech; Breathy speech; Intelligibility; VOCAL EFFORT; ALGORITHM; FEATURES; VOICE;
D O I
10.1016/j.csl.2013.03.003
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This papers studies the synthesis of speech over a wide vocal effort continuum and its perception in the presence of noise. Three types of speech are recorded and studied along the continuum: breathy, normal, and Lombard speech. Corresponding synthetic voices are created by training and adapting the statistical parametric speech synthesis system GlottHMM. Natural and synthetic speech along the continuum is assessed in listening tests that evaluate the intelligibility, quality, and suitability of speech in three different realistic multichannel noise conditions: silence, moderate street noise, and extreme street noise. The evaluation results show that the synthesized voices with varying vocal effort are rated similarly to their natural counterparts both in terms of intelligibility and suitability. (C) 2013 Elsevier Ltd. All rights reserved.
引用
收藏
页码:648 / 664
页数:17
相关论文
共 50 条
  • [21] Sparseness and speech perception in noise
    Li, Guoping
    Lutman, Mark E.
    INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 2162 - 2165
  • [22] Pediatric Speech Perception in Noise
    Bantwal, Anuradha R.
    Hall, James W., III
    CURRENT PEDIATRIC REVIEWS, 2011, 7 (03) : 214 - 226
  • [23] Sensory Inhibition Is Related to Variable Speech Perception in Noise in Adults With Normal Hearing
    Campbell, Julia
    Nielsen, Mashhood
    LaBrec, Alison
    Bean, Connor
    JOURNAL OF SPEECH LANGUAGE AND HEARING RESEARCH, 2020, 63 (05): : 1595 - 1607
  • [24] The effect of background noise on speech perception in monolingual and bilingual adults with normal hearing
    Alqattan, Danah
    Turner, Paul
    NOISE & HEALTH, 2021, 23 (110): : 67 - 74
  • [25] Sensory Inhibition and Speech Perception-in-Noise Performance in Children With Normal Hearing
    Campbell, Julia
    Rouse, Rixon
    Nielsen, Mashhood
    Potter, Sheri
    JOURNAL OF SPEECH LANGUAGE AND HEARING RESEARCH, 2023, 66 (01): : 382 - 399
  • [26] Analysis of HMM-Based Lombard Speech Synthesis
    Raitio, Tuomo
    Suni, Antti
    Vainio, Martti
    Alku, Paavo
    12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 2792 - +
  • [27] Investigating Noise Interference on Speech Towards Applying the Lombard Effect Automatically
    Korvel, Grazina
    Kakol, Krzysztof
    Treigys, Povilas
    Kostek, Bozena
    FOUNDATIONS OF INTELLIGENT SYSTEMS (ISMIS 2022), 2022, 13515 : 399 - 407
  • [28] Online Lombard-adaptation in incremental speech synthesis
    Rottschaefer, Sebastian
    Buschmeier, Hendrik
    van Welbergen, Herwin
    Kopp, Stefan
    16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 80 - 84
  • [29] Measurement and prediction of speech and noise levels and the Lombard effect in eating establishments
    Hodgson, Murray
    Steininger, Gavin
    Razavi, Zohreh
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2007, 121 (04): : 2023 - 2033
  • [30] Measurement and prediction of speech and noise levels and the Lombard effect in eating establishments
    Hodgson, Murray
    Steininger, Gavin
    Razavi, Zohreh
    Journal of the Acoustical Society of America, 2007, 121 (04): : 2023 - 2033