A Real-Time Convolutional Neural Network Based Speech Enhancement for Hearing Impaired Listeners Using Smartphone

被引:0
|
作者
Bhat, Gautam S. [1 ]
Shankar, Nikhil [1 ]
Reddy, Chandan K. A. [2 ]
Panahi, Issa M. S. [1 ]
机构
[1] Univ Texas Dallas, Dept Elect & Comp Engn, Richardson, TX 75080 USA
[2] Microsoft Corp, IC3 AI, Redmond, WA 98052 USA
来源
IEEE ACCESS | 2019年 / 7卷
基金
美国国家卫生研究院;
关键词
Convolutional neural network (CNN); speech enhancement (SE); hearing aid (HA); smartphone; real-time implementation; log power spectra (LPS); NOISE; SUPPRESSION; ALGORITHM;
D O I
10.1109/ACCESS.2019.2922370
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper presents a Speech Enhancement (SE) technique based on multi-objective learning convolutional neural network to improve the overall quality of speech perceived by Hearing Aid (HA) users. The proposed method is implemented on a smartphone as an application that performs real-time SE. This arrangement works as an assistive tool to HA. A multi-objective learning architecture including primary and secondary features uses a mapping-based convolutional neural network (CNN) model to remove noise from a noisy speech spectrum. The algorithm is computationally fast and has a low processing delay which enables it to operate seamlessly on a smartphone. The steps and the detailed analysis of real-time implementation are discussed. The proposed method is compared with existing conventional and neural network-based SE techniques through speech quality and intelligibility metrics in various noisy speech conditions. The key contribution of this paper includes the realization of CNN SE model on a smartphone processor that works seamlessly with HA. The experimental results demonstrate significant improvements over the state-of-the-art techniques and reflect the usability of the developed SE application in noisy environments.
引用
收藏
页码:78421 / 78433
页数:13
相关论文
共 50 条
  • [1] Real-Time Speech Enhancement Based on Convolutional Recurrent Neural Network
    Girirajan, S.
    Pandian, A.
    [J]. INTELLIGENT AUTOMATION AND SOFT COMPUTING, 2023, 35 (02): : 1987 - 2001
  • [2] A Convolutional Recurrent Neural Network for Real-Time Speech Enhancement
    Tan, Ke
    Wang, DeLiang
    [J]. 19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 3229 - 3233
  • [3] Real-Time Convolutional Neural Network-Based Speech Source Localization on Smartphone
    Kucuk, Abdullah
    Ganguly, Anshuman
    Hao, Yiya
    Panahi, Issa M. S.
    [J]. IEEE ACCESS, 2019, 7 : 169969 - 169978
  • [4] RETRACTED ARTICLE: Deep convolutional neural network-based speech enhancement to improve speech intelligibility and quality for hearing-impaired listeners
    P. F. Khaleelur Rahiman
    V. S. Jayanthi
    A. N. Jayanthi
    [J]. Medical & Biological Engineering & Computing, 2019, 57 : 757 - 757
  • [5] RETRACTION: Deep convolutional neural network-based speech enhancement to improve speech intelligibility and quality for hearing-impaired listeners (Retraction of 2018)
    Rahiman, P. F. Khaleelur
    Jayanthi, V. S.
    Jayanthi, A. N.
    [J]. MEDICAL & BIOLOGICAL ENGINEERING & COMPUTING, 2019, 57 (03) : 757 - 757
  • [6] Smartphone-based Real-time Speech Enhancement for Improving Hearing Aids Speech Perception
    Rao, Yu
    Hao, Yiya
    Panahi, Issa M. S.
    Kehtarnavaz, Nasser
    [J]. 2016 38TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY (EMBC), 2016, : 5885 - 5888
  • [7] TCNN: TEMPORAL CONVOLUTIONAL NEURAL NETWORK FOR REAL-TIME SPEECH ENHANCEMENT IN THE TIME DOMAIN
    Pandey, Ashutosh
    Wang, DeLiang
    [J]. 2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 6875 - 6879
  • [8] Convolutional Neural Network Based Real Time Arabic Speech Recognition to Arabic Braille for Hearing and Visually Impaired
    Bhatia, Surbhi
    Devi, Ajantha
    Alsuwailem, Razan Ibrahim
    Mashat, Arwa
    [J]. FRONTIERS IN PUBLIC HEALTH, 2022, 10
  • [9] Enhancement of spectral contrast in speech for hearing impaired listeners
    Munoz, CMA
    Nelson, PB
    Rutledge, JC
    [J]. PROCEEDINGS OF THE IEEE-EURASIP WORKSHOP ON NONLINEAR SIGNAL AND IMAGE PROCESSING (NSIP'99), 1999, : 457 - 460
  • [10] Convolutional quasi-recurrent network for real-time speech enhancement
    Shi, Yunlong
    Yuan, Wenhao
    Hu, Shaodong
    Lou, Yingxi
    [J]. Xi'an Dianzi Keji Daxue Xuebao/Journal of Xidian University, 2022, 49 (03): : 183 - 190