Speech enhancement based on neural networks improves speech intelligibility in noise for cochlear implant users

被引:87
|
作者
Goehring, Tobias [1 ]
Bolner, Federico [2 ,3 ]
Monaghan, Jessica J. M. [1 ]
van Dijk, Bas [3 ]
Zarowski, Andrzej [4 ]
Bleeck, Stefan [1 ]
机构
[1] Univ Southampton, ISVR, Univ Rd, Southampton SO17 1BJ, Hants, England
[2] Katholieke Univ Leuven, ExpORL, O&N II Herestr 49, B-3000 Leuven, Belgium
[3] Cochlear Technol Ctr, Schalienhoevedreef 20 I, B-2800 Mechelen, Belgium
[4] Sint Augustinus Hosp, European Inst ORL HNS, Oosterveldlaan 24, B-2610 Antwerp, Belgium
基金
英国工程与自然科学研究理事会;
关键词
Cochlear implants; Noise reduction; Speech enhancement; Machine learning; Neural networks; HEARING-IMPAIRED LISTENERS; TIME-FREQUENCY MASKING; BACKGROUND-NOISE; AMPLITUDE-MODULATION; GAIN-FUNCTION; RECOGNITION; ALGORITHM; SUPPRESSION; REDUCTION; NUMBERS;
D O I
10.1016/j.heares.2016.11.012
中图分类号
R36 [病理学]; R76 [耳鼻咽喉科学];
学科分类号
100104 ; 100213 ;
摘要
Speech understanding in noisy environments is still one of the major challenges for cochlear implant (CI) users in everyday life. We evaluated a speech enhancement algorithm based on neural networks (NNSE) for improving speech intelligibility in noise for CI users. The algorithm decomposes the noisy speech signal into time-frequency units, extracts a set of auditory-inspired features and feeds them to the neural network to produce an estimation of which frequency channels contain more perceptually important information (higher signal-to-noise ratio, SNR). This estimate is used to attenuate noise dominated and retain speech-dominated CI channels for electrical stimulation, as in traditional n of-m CI coding strategies. The proposed algorithm was evaluated by measuring the speech-in-noise performance of 14 CI users using three types of background noise. Two NNSE algorithms were compared: a speaker-dependent algorithm, that was trained on the target speaker used for testing, and a speaker-independent algorithm, that was trained on different speakers. Significant improvements in the intelligibility of speech in stationary and fluctuating noises were found relative to the unprocessed condition for the speaker-dependent algorithm in all noise types and for the speaker independent algorithm in 2 out of 3 noise types. The NNSE algorithms used noise-specific neural networks that generalized to novel segments of the same noise type and worked over a range of SNRs. The proposed algorithm has the potential to improve the intelligibility of speech in noise for CI users while meeting the requirements of low computational complexity and processing delay for application in CI devices. (C) 2016 The Authors. Published by Elsevier B.V.
引用
收藏
页码:183 / 194
页数:12
相关论文
共 50 条
  • [21] Effects of a transient noise reduction algorithm on speech intelligibility in noise, noise tolerance and perceived annoyance in cochlear implant users
    Dingemanse, J. Gertjan
    Vroegop, Jantien L.
    Goedegebure, Andre
    [J]. INTERNATIONAL JOURNAL OF AUDIOLOGY, 2018, 57 (05) : 360 - 369
  • [22] Neural tracking of the speech envelope in cochlear implant users
    Somers, Ben
    Verschueren, Eline
    Francart, Tom
    [J]. JOURNAL OF NEURAL ENGINEERING, 2019, 16 (01)
  • [23] Electro-haptic enhancement of speech-in-noise performance in cochlear implant users
    Fletcher, Mark D.
    Hadeedi, Amatullah
    Goehring, Tobias
    Mills, Sean R.
    [J]. SCIENTIFIC REPORTS, 2019, 9 (1)
  • [24] Electro-haptic enhancement of speech-in-noise performance in cochlear implant users
    Mark D. Fletcher
    Amatullah Hadeedi
    Tobias Goehring
    Sean R. Mills
    [J]. Scientific Reports, 9
  • [25] The combined effects of reverberation and noise on speech intelligibility by cochlear implant listeners
    Hazrati, Oldooz
    Loizou, Philipos C.
    [J]. INTERNATIONAL JOURNAL OF AUDIOLOGY, 2012, 51 (06) : 437 - 443
  • [26] Speech Intelligibility for Cochlear Implant Users with the MMSE Noise-Reduction Time-Frequency Mask
    Mourao, Gustavo Leao
    Costa, Marcio Holsbach
    Paul, Stephan
    [J]. BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2020, 60
  • [27] Digisonic® cochlear implant signal processing for speech intelligibility improvement in noise
    Wable, J
    Gallego, S
    Chouard, CH
    Meyer, B
    [J]. COCHLEAR IMPLANTS - AN UPDATE, 2002, : 159 - 164
  • [28] PREDICTING THE EFFECT OF AGC ON SPEECH INTELLIGIBILITY OF COCHLEAR IMPLANT RECIPIENTS IN NOISE
    Khing, Phyu P.
    Ambikairajah, Eliathamby
    Swanson, Brett A.
    [J]. 2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 8061 - 8065
  • [29] Speech Intelligibility in Noise With a Pinna Effect Imitating Cochlear Implant Processor
    Wimmer, Wilhelm
    Weder, Stefan
    Caversaccio, Marco
    Kompis, Martin
    [J]. OTOLOGY & NEUROTOLOGY, 2016, 37 (01) : 19 - 23
  • [30] A Deep Learning based Noise Reduction Approach to Improve Speech Intelligibility for Cochlear Implant Recipients in the Presence of Competing Speech Noise
    Wang, Syu-Siang
    Tsao, Yu
    Wang, Hsiao-Lan Sharon
    Lai, Ying-Hui
    Li, Lieber Po-Hung
    [J]. 2017 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC 2017), 2017, : 808 - 812