Speech-in-noise intelligibility improvement based on spectral shaping and dynamic range compression

被引:0
|
作者
Zorila, Tudor-Catalin
Kandia, Varvara
Stylianou, Yannis
机构
关键词
speech-in-noise enhancement; speech intelligibility; spectral shaping; dynamic range compression; CLEAR SPEECH; LISTENERS;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we suggest a non-parametric way to improve the intelligibility of speech in noise. The signal is enhanced before presented in a noisy environment, under the constraint of equal global signal power before and after modifications. Two systems are combined in a cascade form to enhance the quality of the signal first in frequency (spectral shaping) and then in time (dynamic range compression). Experiments with speech shaped (SSN) and competing speaker (CS) types of noise at various low SNR values, show that the suggested approach outperforms state-of-the art methods in terms of the Speech Intelligibility Index (SII). In terms of SNR gain there is an improvement of 7 dB (SSN) and 8 dB (CS) over these methods. A formal listening test confirm the efficiency of the suggested system in enhancing speech intelligibility in noise.
引用
收藏
页码:634 / 637
页数:4
相关论文
共 50 条
  • [1] SPEECH-IN-NOISE INTELLIGIBILITY IMPROVEMENT BASED ON POWER RECOVERY AND DYNAMIC RANGE COMPRESSION
    Zorila, Tudor-Catalin
    Kandia, Varvara
    Stylianou, Yannis
    2012 PROCEEDINGS OF THE 20TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2012, : 2075 - 2079
  • [2] Speech-in-noise enhancement using amplification and dynamic range compression controlled by the speech intelligibility index
    Schepker, Henning
    Rennies, Jan
    Doclo, Simon
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2015, 138 (05): : 2692 - 2706
  • [3] Characterizing Speech Intelligibility in Noise After Wide Dynamic Range Compression
    Rhebergen, Koenraad S.
    Maalderink, Thijs H.
    Dreschler, Wouter A.
    EAR AND HEARING, 2017, 38 (02): : 194 - 204
  • [4] End-to-End Neural Based Modification of Noisy Speech for Speech-in-Noise Intelligibility Improvement
    Shifas, Muhammed P., V
    Zorila, Catalin
    Stylianou, Yannis
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2022, 30 : 162 - 173
  • [5] Increasing Speech Intelligibility via Spectral Shaping with Frequency Warping and Dynamic Range Compression plus Transient Enhancement
    Godoy, Elizabeth
    Stylianou, Yannis
    14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 3539 - 3543
  • [6] Auditory efferents involved in speech-in-noise intelligibility
    Giraud, AL
    Garnier, S
    Micheyl, C
    Lina, G
    Chays, A
    CheryCroze, S
    NEUROREPORT, 1997, 8 (07) : 1779 - 1783
  • [7] Glimpse-based estimation of speech intelligibility from speech-in-noise using artificial neural networks
    Tang, Yan
    COMPUTER SPEECH AND LANGUAGE, 2021, 69
  • [8] Linking dynamic-range compression across the ears can improve speech intelligibility in spatially separated noise
    Wiggins, Ian M.
    Seeber, Bernhard U.
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2013, 133 (02): : 1004 - 1016
  • [9] Linking dynamic-range compression across the ears can improve speech intelligibility in spatially separated noise
    Wiggins, I.M. (Ian.Wiggins@nottingham.ac.uk), 1600, Acoustical Society of America (133):
  • [10] SII-based Speech Preprocessing for Intelligibility Improvement in Noise
    Taal, Cees H.
    Jensen, Jesper
    14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 3549 - 3553