A model of speech recognition for hearing-impaired listeners based on deep learning

被引：8

作者：

Rossbach, Jana ^{[1
]}

Kollmeier, Birger ^{[2
]}

Meyer, Bernd T. ^{[1
]}

机构：

[1] Carl von Ossietzky Univ Oldenburg, Commun Acoust & Cluster Excellence Hearing4all, D-26111 Oldenburg, Germany

[2] Carl von Ossietzky Univ Oldenburg, Med Phys & Cluster Excellence Hearing4all, D-26111 Oldenburg, Germany

来源：

JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA | 2022年 / 151卷 / 03期

关键词：

INTELLIGIBILITY INDEX; RECEPTION THRESHOLD; FLUCTUATING NOISE; PREDICTION; ENVELOPE; PERCEPTION; MODULATION; ALGORITHM; MASKING;

D O I：

10.1121/10.0009411

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

Automatic speech recognition (ASR) has made major progress based on deep machine learning, which motivated the use of deep neural networks (DNNs) as perception models and specifically to predict human speech recognition (HSR). This study investigates if a modeling approach based on a DNN that serves as phoneme classifier [Spille, Ewert, Kollmeier, and Meyer (2018). Comput. Speech Lang. 48, 51-66] can predict HSR for subjects with different degrees of hearing loss when listening to speech embedded in different complex noises. The eight noise signals range from simple stationary noise to a single competing talker and are added to matrix sentences, which are presented to 20 hearing-impaired (HI) listeners (categorized into three groups with different types of age-related hearing loss) to measure their speech recognition threshold (SRT), i.e., the signal-to-noise ratio with 50% word recognition rate. These are compared to responses obtained from the ASR-based model using degraded feature representations that take into account the individual hearing loss of the participants captured by a pure-tone audiogram. Additionally, SRTs obtained from eight normal-hearing (NH) listeners are analyzed. For NH subjects and three groups of HI listeners, the average SRT prediction error is below 2 dB, which is lower than the errors of the baseline models. (C) 2022 Authos(s).

引用

页码：1417 / 1427

页数：11

共 50 条

[21] BINAURAL SPEECH-INTELLIGIBILITY IN NOISE FOR HEARING-IMPAIRED LISTENERS
BRONKHORST, AW
PLOMP, R
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1989, 86 (04): : 1374 - 1383
[22] Auditory rehabilitation effects on speech lateralization in hearing-impaired listeners
Philibert, B
Collet, L
Vesson, JF
Veuillet, E
ACTA OTO-LARYNGOLOGICA, 2003, 123 (02) : 172 - 175
[23] Effects of reverberation on speech intelligibility in noise for hearing-impaired listeners
Cueille, Raphael
Lavandier, Mathieu
Grimault, Nicolas
ROYAL SOCIETY OPEN SCIENCE, 2022, 9 (08):
[24] SYLLABIC COMPRESSION AND SPEECH-INTELLIGIBILITY IN HEARING-IMPAIRED LISTENERS
VERSCHUURE, J
DRESCHLER, WA
DEHAAN, EH
VANCAPPELLEN, M
HAMMERSCHLAG, R
MARE, MJ
MAAS, AJJ
HIJMANS, AC
SCANDINAVIAN AUDIOLOGY, 1993, 22 : 92 - 100
[25] USE OF THE CONNECTED SPEECH TEST (CST) WITH HEARING-IMPAIRED LISTENERS
COX, RM
ALEXANDER, GC
GILMORE, C
PUSAKULICH, KM
EAR AND HEARING, 1988, 9 (04): : 198 - 207
[26] SPEECH FOUNDATION MODELS ON INTELLIGIBILITY PREDICTION FOR HEARING-IMPAIRED LISTENERS
Cuervo, Santiago
Marxer, Ricard
2024 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, ICASSP 2024, 2024, : 1421 - 1425
[27] EVALUATION OF A SPEECH ENHANCEMENT STRATEGY WITH NORMAL-HEARING AND HEARING-IMPAIRED LISTENERS
JAMIESON, DG
BRENNAN, RL
CORNELISSE, LE
EAR AND HEARING, 1995, 16 (03): : 274 - 286
[28] Spectral integration of speech bands in normal-hearing and hearing-impaired listeners
Hall, Joseph W., III
Buss, Emily
Grose, John H.
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2008, 124 (02): : 1105 - 1115
[29] Speech-cue transmission by an algorithm to increase consonant recognition in noise for hearing-impaired listeners
Healy, Eric W.
Yoho, Sarah E.
Wang, Yuxuan
Apoux, Frederic
Wang, DeLiang
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2014, 136 (06): : 3325 - 3336
[30] The role of working memory in speech recognition by hearing-impaired older listeners: does the task matter?
Strori, Dorina
Souza, Pamela E.
INTERNATIONAL JOURNAL OF AUDIOLOGY, 2023, 62 (11) : 1067 - 1075

← 1 2 3 4 5 →