ElectrodeNet-A Deep-Learning-Based Sound Coding Strategy for Cochlear Implants

被引:3
|
作者
Huang, Enoch Hsin-Ho [1 ,2 ]
Chao, Rong [2 ,3 ]
Tsao, Yu [2 ,4 ]
Wu, Chao-Min [1 ]
机构
[1] Natl Cent Univ, Dept Elect Engn, Taoyuan 320317, Taiwan
[2] Acad Sinica, Res Ctr Informat Technol Innovat, Taipei 115201, Taiwan
[3] Natl Cheng Kung Univ, Dept Comp Sci & Informat Engn, Tainan 701401, Taiwan
[4] Chung Yuan Christian Univ, Dept Elect Engn, Taoyuan 320314, Taiwan
关键词
Channel selection (CS); cochlear implant (CI); deep learning; sound coding strategy; vocoder simulation; HEARING HEALTH-CARE; SPEECH-INTELLIGIBILITY; NEURAL-NETWORKS; PERCEPTION; RECOGNITION; MUSIC; NOISE; COMBINATION; PREDICTION; IMPROVE;
D O I
10.1109/TCDS.2023.3275587
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
ElectrodeNet, a deep-learning-based sound coding strategy for the cochlear implant (CI), is proposed to emulate the advanced combination encoder (ACE) strategy by replacing the conventional envelope detection using various artificial neural networks. The extended ElectrodeNet-CS strategy further incorporates the channel selection (CS). Network models of deep neural network (DNN), convolutional neural network (CNN), and long short-term memory (LSTM) were trained using the fast Fourier transformed bins and channel envelopes obtained from the processing of clean speech by the ACE strategy. Objective speech understanding using short-time objective intelligibility (STOI) and normalized covariance metric (NCM) was estimated for ElectrodeNet using CI simulations. Sentence recognition tests for vocoded Mandarin speech were conducted with normal-hearing listeners. DNN, CNN, and LSTM-based ElectrodeNets exhibited strong correlations to ACE in objective and subjective scores using mean squared error (MSE), linear correlation coefficient (LCC), and Spearman's rank correlation coefficient (SRCC). The ElectrodeNet-CS strategy was capable of producing N-of-M compatible electrode patterns using a modified DNN network to embed maxima selection, and to perform in similar or even slightly higher average in STOI and sentence recognition compared to ACE. The methods and findings demonstrated the feasibility and potential of using deep learning in the CI coding strategy.
引用
收藏
页码:346 / 357
页数:12
相关论文
共 50 条
  • [1] A Deep Denoising Sound Coding Strategy for Cochlear Implants
    Gajecki, Tom
    Zhang, Yichi
    Nogueira, Waldo
    IEEE TRANSACTIONS ON BIOMEDICAL ENGINEERING, 2023, 70 (09) : 2700 - 2709
  • [2] A Fused Deep Denoising Sound Coding Strategy for Bilateral Cochlear Implants
    Gajecki, Tom
    Nogueira, Waldo
    IEEE TRANSACTIONS ON BIOMEDICAL ENGINEERING, 2024, 71 (07) : 2232 - 2242
  • [3] A sound coding strategy based on a temporal masking model for cochlear implants
    Kludt, Eugen
    Nogueira, Waldo
    Lenarz, Thomas
    Buechner, Andreas
    PLOS ONE, 2021, 16 (01):
  • [4] Deep electrode insertion and sound coding in cochlear implants
    Hochmair, Ingeborg
    Hochmair, Erwin
    Nopp, Peter
    Waller, Melissa
    Jolly, Claude
    HEARING RESEARCH, 2015, 322 : 14 - 23
  • [5] Sound Coding in Cochlear Implants
    Wouters, Jan
    McDermott, Hugh J.
    Francart, Tom
    IEEE SIGNAL PROCESSING MAGAZINE, 2015, 32 (02) : 67 - 80
  • [6] A new sound coding strategy for suppressing noise in cochlear implants
    Hu, Yi
    Loizou, Philipos C.
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2008, 124 (01): : 498 - 509
  • [7] AN END-TO-END DEEP LEARNING SPEECH CODING AND DENOISING STRATEGY FOR COCHLEAR IMPLANTS
    Gajecki, Tom
    Nogueira, Waldo
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 3109 - 3113
  • [8] The Temporal Limits Encoder as a Sound Coding Strategy for Bilateral Cochlear Implants
    Kan, Alan
    Meng, Qinglin
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2021, 29 : 265 - 273
  • [9] Deep-Learning-Based Lossless Image Coding
    Schiopu, Ionut
    Munteanu, Adrian
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2020, 30 (07) : 1829 - 1842
  • [10] Formant frequency discrimination with a fine structure sound coding strategy for cochlear implants
    Liepins, R.
    Kaider, A.
    Honeder, C.
    Auinger, A. B.
    Dahm, V
    Riss, D.
    Arnoldner, C.
    HEARING RESEARCH, 2020, 392