Intelligibility Enhancement of Casual Speech for Reverberant Environments inspired by Clear Speech Properties

被引:0
|
作者
Koutsogiannaki, Maria [1 ]
Petkov, Petko N. [2 ]
Stylianou, Yannis [1 ,2 ]
机构
[1] Univ Crete, CSD, Multimedia Informat Lab, Iraklion, Greece
[2] Toshiba Res Europe Ltd, Cambridge Res Lab, Kawasaki, Kanagawa, Japan
关键词
Clear Speech; Casual Speech; Intelligibility; Reverberation; Spectral Transformations; Time Modifications; Pause insertion; HARD-OF-HEARING; CONVERSATIONAL SPEECH; VOWEL INTELLIGIBILITY; SPEAKING-RATE; PERCEPTION; TALKER; NOISE;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Clear speech has been shown to have an intelligibility advantage over casual speech in noisy and reverberant environments. This work validates spectral and time domain modifications to increase the intelligibility of casual speech in reverberant environments by compensating particular differences between the two speaking styles. To compensate spectral differences, a frequency-domain filtering approach is applied to casual speech. In time domain, two techniques for time-scaling casual speech are explored: (1) uniform time-scaling and (2) pause insertion and phoneme elongation based on loudness and modulation criteria. The effect of the proposed modifications is evaluated through subjective listening tests in two reverberant conditions with reverberation time 0.8s and 2s. The combination of spectral transformation and uniform time-scaling is shown to be the most successful in increasing the intelligibility of casual speech. The evaluation results support the conclusion that modifications inspired by clear speech can be beneficial for the intelligibility enhancement of speech in reverberant environments.
引用
收藏
页码:65 / 69
页数:5
相关论文
共 50 条
  • [41] Padding zero into steady-state portions of speech as a preprocess for improving intelligibility in reverberant environments
    Arai, Takayuki
    ACOUSTICAL SCIENCE AND TECHNOLOGY, 2005, 26 (05) : 459 - 461
  • [42] The Extended Speech Transmission Index: Predicting speech intelligibility in fluctuating noise and reverberant rooms
    van Schoonhoven, Jelmer
    Rhebergen, Koenraad S.
    Dreschler, Wouter A.
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2019, 145 (03): : 1178 - 1194
  • [43] A Speech Preprocessing Method Based on Perceptually Optimized Envelope Processing to Increase Intelligibility in Reverberant Environments
    Fallah, Ali
    van de Par, Steven
    APPLIED SCIENCES-BASEL, 2021, 11 (22):
  • [45] Near end listening enhancement: Speech intelligibility improvement in noisy environments
    Sauert, Bastian
    Vary, Peter
    2006 IEEE International Conference on Acoustics, Speech and Signal Processing, Vols 1-13, 2006, : 493 - 496
  • [46] Time and Frequency Dependent Amplification for Speech Intelligibility Enhancement in Noisy Environments
    Brouckxon, Henk
    Verhelst, Werner
    De Schuymer, Bart
    INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 557 - +
  • [47] Approaching speech intelligibility enhancement with inspiration from Lombard and Clear speaking styles
    Godoy, Elizabeth
    Koutsogiannaki, Maria
    Stylianou, Yannis
    COMPUTER SPEECH AND LANGUAGE, 2014, 28 (02): : 629 - 647
  • [48] Speech intelligibility in quiet and noise environments with the speech Enhancer™ amplification and natural speech
    Weiss, L
    JOURNAL OF MEDICAL SPEECH-LANGUAGE PATHOLOGY, 2002, 10 (04) : 327 - 331
  • [49] Separation of Multiple Speech Sources in Reverberant Environments Based on Sparse Component Enhancement
    Li, Lu
    Jia, Maoshen
    Liu, Jinxiang
    Pai, Tun-Wen
    CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2023, 42 (10) : 6001 - 6028
  • [50] A STUDY ON JOINT BEAMFORMING AND SPECTRAL ENHANCEMENT FOR ROBUST SPEECH RECOGNITION IN REVERBERANT ENVIRONMENTS
    Xiong, Feifei
    Meyer, Bernd T.
    Goetze, Stefan
    2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 5043 - 5047