Increasing Speech Intelligibility via Spectral Shaping with Frequency Warping and Dynamic Range Compression plus Transient Enhancement

被引:0
|
作者
Godoy, Elizabeth [1 ]
Stylianou, Yannis [1 ]
机构
[1] Fdn Res & Technol Hellas, Inst Comp Sci, Iraklion, Greece
关键词
speech intelligibility; spectral shaping; frequency warping; dynamic range compression; HARD-OF-HEARING; CLEAR; PERCEPTION;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In order to make speech (natural or synthetic) more intelligible for listeners in real-world noisy environments, various modifications have been proposed that exploit spectral and temporal signal features. Previously, an evaluation campaign involving several approaches illustrated that a Spectral Shaping (SS) and Dynamic Range Compression (DRC) method proved highly successful at increasing speech intelligibility. For the public follow-up campaign (i.e., the Hurricane Challenge), this work introduces additional modifications into SSDRC in an attempt to further enhance intelligibility. First aiming to slow down the articulation rate, the speech is uniformly time stretched to effectively increase signal redundancy. Second, a frequency warping mechanism to expand vowel space is incorporated into the SS. Third, scaling to enhance the transient regions of speech is applied in the time-domain along with DRC. Objective and extensive subjective (i.e., the Hurricane Challenge) evaluations show that the new approach successfully achieves intelligibility gains over natural speech for all of the noise conditions evaluated, though compared to SSDRC, there is less advantage observed at higher SNR.
引用
收藏
页码:3539 / 3543
页数:5
相关论文
共 33 条
  • [11] Multichannel dynamic-range compression using digital frequency warping
    Kates, James M.
    Arehart, Kathryn Hoberg
    Eurasip Journal on Applied Signal Processing, 2005, 2005 (18): : 3003 - 3014
  • [12] THE EFFECTS OF SYLLABIC COMPRESSION AND FREQUENCY SHAPING ON SPEECH-INTELLIGIBILITY IN HEARING-IMPAIRED PEOPLE
    VERSCHUURE, H
    PRINSEN, TT
    DRESCHLER, WA
    EAR AND HEARING, 1994, 15 (01): : 13 - 21
  • [13] SPEECH-IN-NOISE INTELLIGIBILITY IMPROVEMENT BASED ON POWER RECOVERY AND DYNAMIC RANGE COMPRESSION
    Zorila, Tudor-Catalin
    Kandia, Varvara
    Stylianou, Yannis
    2012 PROCEEDINGS OF THE 20TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2012, : 2075 - 2079
  • [14] Assessing the Intelligibility Impact of Vowel Space Expansion via Clear Speech-Inspired Frequency Warping
    Godoy, E.
    Koutsogiannaki, M.
    Stylianou, Y.
    14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 1168 - 1172
  • [15] Improving speech intelligibility in noise by SII-dependent preprocessing using frequency-dependent amplification and dynamic range compression
    Schepker, Henning
    Rennies, Jan
    Doclo, Simon
    14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 3544 - 3548
  • [16] Automatic speech pronunciation correction with dynamic frequency warping-based spectral conversion
    Hojo, Nobukatsu
    Kameoka, Hirokazu
    Tanaka, Kou
    Kaneko, Takuhiro
    2018 26TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2018, : 2310 - 2314
  • [17] The effect of hearing aid dynamic range compression on speech intelligibility in a realistic virtual sound environment
    Mansour, Naim
    Marschall, Marton
    Westermann, Adam
    May, Tobias
    Dau, Torsten
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2022, 151 (01): : 232 - 241
  • [18] Intelligibility and Clarity of Reverberant Speech: Effects of Wide Dynamic Range Compression Release Time and Working Memory
    Reinhart, Paul N.
    Souza, Pamela E.
    JOURNAL OF SPEECH LANGUAGE AND HEARING RESEARCH, 2016, 59 (06): : 1543 - 1554
  • [19] Side effects of fast-acting dynamic range compression that affect intelligibility in a competing speech task
    Stone, MA
    Moore, BCJ
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2004, 116 (04): : 2311 - 2323
  • [20] Side effects of fast-acting dynamic range compression that affect intelligibility in a competing speech task
    Stone, M.A. (mas19@cam.ac.uk), 1600, Acoustical Society of America (116):