Speech enhancement based on neural networks improves speech intelligibility in noise for cochlear implant users

被引:87
|
作者
Goehring, Tobias [1 ]
Bolner, Federico [2 ,3 ]
Monaghan, Jessica J. M. [1 ]
van Dijk, Bas [3 ]
Zarowski, Andrzej [4 ]
Bleeck, Stefan [1 ]
机构
[1] Univ Southampton, ISVR, Univ Rd, Southampton SO17 1BJ, Hants, England
[2] Katholieke Univ Leuven, ExpORL, O&N II Herestr 49, B-3000 Leuven, Belgium
[3] Cochlear Technol Ctr, Schalienhoevedreef 20 I, B-2800 Mechelen, Belgium
[4] Sint Augustinus Hosp, European Inst ORL HNS, Oosterveldlaan 24, B-2610 Antwerp, Belgium
基金
英国工程与自然科学研究理事会;
关键词
Cochlear implants; Noise reduction; Speech enhancement; Machine learning; Neural networks; HEARING-IMPAIRED LISTENERS; TIME-FREQUENCY MASKING; BACKGROUND-NOISE; AMPLITUDE-MODULATION; GAIN-FUNCTION; RECOGNITION; ALGORITHM; SUPPRESSION; REDUCTION; NUMBERS;
D O I
10.1016/j.heares.2016.11.012
中图分类号
R36 [病理学]; R76 [耳鼻咽喉科学];
学科分类号
100104 ; 100213 ;
摘要
Speech understanding in noisy environments is still one of the major challenges for cochlear implant (CI) users in everyday life. We evaluated a speech enhancement algorithm based on neural networks (NNSE) for improving speech intelligibility in noise for CI users. The algorithm decomposes the noisy speech signal into time-frequency units, extracts a set of auditory-inspired features and feeds them to the neural network to produce an estimation of which frequency channels contain more perceptually important information (higher signal-to-noise ratio, SNR). This estimate is used to attenuate noise dominated and retain speech-dominated CI channels for electrical stimulation, as in traditional n of-m CI coding strategies. The proposed algorithm was evaluated by measuring the speech-in-noise performance of 14 CI users using three types of background noise. Two NNSE algorithms were compared: a speaker-dependent algorithm, that was trained on the target speaker used for testing, and a speaker-independent algorithm, that was trained on different speakers. Significant improvements in the intelligibility of speech in stationary and fluctuating noises were found relative to the unprocessed condition for the speaker-dependent algorithm in all noise types and for the speaker independent algorithm in 2 out of 3 noise types. The NNSE algorithms used noise-specific neural networks that generalized to novel segments of the same noise type and worked over a range of SNRs. The proposed algorithm has the potential to improve the intelligibility of speech in noise for CI users while meeting the requirements of low computational complexity and processing delay for application in CI devices. (C) 2016 The Authors. Published by Elsevier B.V.
引用
收藏
页码:183 / 194
页数:12
相关论文
共 50 条
  • [1] Speech onset enhancement improves intelligibility in adverse listening conditions for cochlear implant users
    Koning, Raphael
    Wouters, Jan
    [J]. HEARING RESEARCH, 2016, 342 : 13 - 22
  • [2] Artificial speech bandwidth extension improves telephone speech intelligibility and quality in cochlear implant users
    Nogueira, W.
    Abel, J.
    Fingscheidt, T.
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2019, 145 (03): : 1640 - 1649
  • [3] Spectral contrast enhancement improves speech intelligibility in noise for cochlear implants
    Nogueira, Waldo
    Rode, Thilo
    Buechner, Andreas
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2016, 139 (02): : 728 - 739
  • [4] Speech Intelligibility Enhancement in Strong Mechanical Noise Based on Neural Networks
    Cheng, Feng
    Wang, Xiaochen
    Gang, Li
    Tu, Weiping
    Wang, Jinshan
    [J]. ADVANCES IN MULTIMEDIA INFORMATION PROCESSING - PCM 2017, PT II, 2018, 10736 : 702 - 712
  • [5] Comparing the Speech Perception of Cochlear Implant Users with Three Different Finnish Speech Intelligibility Tests in Noise
    Willberg, Tytti
    Sivonen, Ville
    Linder, Pia
    Dietz, Aarno
    [J]. JOURNAL OF CLINICAL MEDICINE, 2021, 10 (16)
  • [6] Environment-specific noise suppression for improved speech intelligibility by cochlear implant users
    Hu, Yi
    Loizou, Philipos C.
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2010, 127 (06): : 3689 - 3695
  • [7] Effect of Spectral Contrast Enhancement on Speech-on-Speech Intelligibility and Voice Cue Sensitivity in Cochlear Implant Users
    El Boghdady, Nawal
    Langner, Florian
    Gaudrain, Etienne
    Baskent, Deniz
    Nogueira, Waldo
    [J]. EAR AND HEARING, 2021, 42 (02): : 271 - 289
  • [8] Spatial Hearing and Speech Intelligibility in Bilateral Cochlear Implant Users
    Litovsky, Ruth Y.
    Parkinson, Aaron
    Arcaroli, Jennifer
    [J]. EAR AND HEARING, 2009, 30 (04): : 419 - 431
  • [9] SPEECH ENHANCEMENT BASED ON NEURAL NETWORKS APPLIED TO COCHLEAR IMPLANT CODING STRATEGIES
    Bolner, Federico
    Goehring, Tobias
    Monaghan, Jessica
    van Dijk, Bas
    Wouters, Jan
    Bleeck, Stefan
    [J]. 2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 2016, : 6520 - 6524
  • [10] Head orientation benefit to speech intelligibility in noise for cochlear implant users and in realistic listening conditions
    Grange, Jacques A.
    Culling, John F.
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2016, 140 (06): : 4061 - 4072