Synthesis and perception of breathy, normal, and Lombard speech in the presence of noise

被引:23
|
作者
Raitio, Tuomo [1 ]
Suni, Antti [2 ]
Vainio, Martti [2 ]
Alku, Paavo [1 ]
机构
[1] Aalto Univ, Dept Signal Proc & Acoust, Espoo, Finland
[2] Univ Helsinki, Dept Behav Sci, Helsinki, Finland
来源
COMPUTER SPEECH AND LANGUAGE | 2014年 / 28卷 / 02期
基金
芬兰科学院;
关键词
Statistical parametric speech synthesis; Adaptation; Vocal effort; Lombard speech; Breathy speech; Intelligibility; VOCAL EFFORT; ALGORITHM; FEATURES; VOICE;
D O I
10.1016/j.csl.2013.03.003
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This papers studies the synthesis of speech over a wide vocal effort continuum and its perception in the presence of noise. Three types of speech are recorded and studied along the continuum: breathy, normal, and Lombard speech. Corresponding synthetic voices are created by training and adapting the statistical parametric speech synthesis system GlottHMM. Natural and synthetic speech along the continuum is assessed in listening tests that evaluate the intelligibility, quality, and suitability of speech in three different realistic multichannel noise conditions: silence, moderate street noise, and extreme street noise. The evaluation results show that the synthesized voices with varying vocal effort are rated similarly to their natural counterparts both in terms of intelligibility and suitability. (C) 2013 Elsevier Ltd. All rights reserved.
引用
收藏
页码:648 / 664
页数:17
相关论文
共 50 条
  • [41] Speech perception of noise with binary gains
    Wang, DeLiang
    Kjems, Ulrik
    Pedersen, Michael S.
    Boldt, Jesper B.
    Lunner, Thomas
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2008, 124 (04): : 2303 - 2307
  • [42] Perception of Speech in Noise: Neural Correlates
    Song, Judy H.
    Skoe, Erika
    Banai, Karen
    Kraus, Nina
    JOURNAL OF COGNITIVE NEUROSCIENCE, 2011, 23 (09) : 2268 - 2279
  • [43] The role of isochrony in speech perception in noise
    Aubanel, Vincent
    Schwartz, Jean-Luc
    SCIENTIFIC REPORTS, 2020, 10 (01)
  • [44] A glimpsing model of speech perception in noise
    Cooke, Martin
    Journal of the Acoustical Society of America, 2006, 119 (03): : 1562 - 1573
  • [45] Speech perception of noise with binary gains
    Wang, DeLang
    Kjems, Ulrik
    Pedersen, Michael S.
    Boldt, Jesper B.
    Lunner, Thomas
    Journal of the Acoustical Society of America, 2008, 124 (04): : 2303 - 2307
  • [46] Aging and Speech-in-Noise Perception
    Emami, Seyede Faranak
    Shariatpanahi, Elnaz
    Gohari, Nasrin
    Mehrabifard, Mobina
    INDIAN JOURNAL OF OTOLARYNGOLOGY AND HEAD & NECK SURGERY, 2023, 75 (03) : 1579 - 1585
  • [47] Cortical mechanisms of speech perception in noise
    Wong, Patrick C. M.
    Uppunda, Ajith K.
    Parrish, Todd B.
    Dhar, Sumitrajit
    JOURNAL OF SPEECH LANGUAGE AND HEARING RESEARCH, 2008, 51 (04): : 1026 - 1041
  • [48] The role of isochrony in speech perception in noise
    Vincent Aubanel
    Jean-Luc Schwartz
    Scientific Reports, 10
  • [49] Impact of depression on speech perception in noise
    Xie, Zilong
    Zinszer, Benjamin D.
    Riggs, Meredith
    Beevers, Christopher G.
    Chandrasekaran, Bharath
    PLOS ONE, 2019, 14 (08):
  • [50] Speech perception in noise: Masking and unmasking
    Wang, Xianhui
    Xu, Li
    JOURNAL OF OTOLOGY, 2021, 16 (02) : 109 - 119