A model of speech recognition for hearing-impaired listeners based on deep learning

被引:8
|
作者
Rossbach, Jana [1 ]
Kollmeier, Birger [2 ]
Meyer, Bernd T. [1 ]
机构
[1] Carl von Ossietzky Univ Oldenburg, Commun Acoust & Cluster Excellence Hearing4all, D-26111 Oldenburg, Germany
[2] Carl von Ossietzky Univ Oldenburg, Med Phys & Cluster Excellence Hearing4all, D-26111 Oldenburg, Germany
来源
关键词
INTELLIGIBILITY INDEX; RECEPTION THRESHOLD; FLUCTUATING NOISE; PREDICTION; ENVELOPE; PERCEPTION; MODULATION; ALGORITHM; MASKING;
D O I
10.1121/10.0009411
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Automatic speech recognition (ASR) has made major progress based on deep machine learning, which motivated the use of deep neural networks (DNNs) as perception models and specifically to predict human speech recognition (HSR). This study investigates if a modeling approach based on a DNN that serves as phoneme classifier [Spille, Ewert, Kollmeier, and Meyer (2018). Comput. Speech Lang. 48, 51-66] can predict HSR for subjects with different degrees of hearing loss when listening to speech embedded in different complex noises. The eight noise signals range from simple stationary noise to a single competing talker and are added to matrix sentences, which are presented to 20 hearing-impaired (HI) listeners (categorized into three groups with different types of age-related hearing loss) to measure their speech recognition threshold (SRT), i.e., the signal-to-noise ratio with 50% word recognition rate. These are compared to responses obtained from the ASR-based model using degraded feature representations that take into account the individual hearing loss of the participants captured by a pure-tone audiogram. Additionally, SRTs obtained from eight normal-hearing (NH) listeners are analyzed. For NH subjects and three groups of HI listeners, the average SRT prediction error is below 2 dB, which is lower than the errors of the baseline models. (C) 2022 Authos(s).
引用
收藏
页码:1417 / 1427
页数:11
相关论文
共 50 条
  • [31] AUDITORY FILTER CHARACTERISTICS AND CONSONANT RECOGNITION FOR HEARING-IMPAIRED LISTENERS
    DUBNO, JR
    DIRKS, DD
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1989, 85 (04): : 1666 - 1675
  • [32] EVALUATING A SPEECH-RECEPTION THRESHOLD-MODEL FOR HEARING-IMPAIRED LISTENERS - COMMENTS
    PLOMP, R
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1994, 96 (01): : 586 - 587
  • [33] Predicting speech intelligibility in hearing-impaired listeners using a physiologically inspired auditory model
    Zaar, Johannes
    Carney, Laurel H.
    HEARING RESEARCH, 2022, 426
  • [34] EVALUATING A SPEECH-RECEPTION THRESHOLD-MODEL FOR HEARING-IMPAIRED LISTENERS - REPLY
    HUMES, LE
    LEE, LW
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1994, 96 (01): : 588 - 589
  • [35] DETECTION AND RECOGNITION OF STOP CONSONANTS BY NORMAL-HEARING AND HEARING-IMPAIRED LISTENERS
    TURNER, CW
    FABRY, DA
    BARRETT, S
    HORWITZ, AR
    JOURNAL OF SPEECH AND HEARING RESEARCH, 1992, 35 (04): : 942 - 949
  • [36] EEG-based auditory attention decoding with audiovisual speech for hearing-impaired listeners
    Wang, Bo
    Xu, Xiran
    Niu, Yadong
    Wu, Chao
    Wu, Xihong
    Chen, Jing
    CEREBRAL CORTEX, 2023, 33 (22) : 10972 - 10983
  • [37] LOUDNESS DISCOMFORT LEVELS OF HEARING-IMPAIRED LISTENERS USING SPEECH MATERIAL
    EDGERTON, BJ
    BEATTIE, RC
    WIDES, JW
    EAR AND HEARING, 1980, 1 (04): : 206 - 210
  • [38] BINAURAL SPEECH-DISCRIMINATION UNDER NOISE IN HEARING-IMPAIRED LISTENERS
    KUMAR, KV
    RAO, AB
    AVIATION SPACE AND ENVIRONMENTAL MEDICINE, 1988, 59 (10): : 932 - 936
  • [39] SPECTRAL ENHANCEMENT TO IMPROVE THE INTELLIGIBILITY OF SPEECH IN NOISE FOR HEARING-IMPAIRED LISTENERS
    SIMPSON, AM
    MOORE, BCJ
    GLASBERG, BR
    ACTA OTO-LARYNGOLOGICA, 1990, : 101 - 107
  • [40] Level variations in speech: Effect on masking release in hearing-impaired listeners
    Reed, Charlotte M.
    Desloge, Joseph G.
    Braida, Louis D.
    Perez, Zachary D.
    Leger, Agnes C.
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2016, 140 (01): : 102 - 113