An optimized convolutional neural network for speech enhancement

被引:0
|
作者
Karthik A. [1 ,2 ]
Mazher Iqbal J.L. [1 ]
机构
[1] Department of ECE, Veltech Rangarajan Dr Sagunthala R&D Institute of Science and Technology, Chennai
[2] Department of ECE, Institute of Aeronautical Engineering, Hyderabad
关键词
Character error rate; Convolutional neural network; Minimization; Optimization; Recognition; Speech enhancement;
D O I
10.1007/s10772-023-10073-6
中图分类号
学科分类号
摘要
Speech enhancement is an important property in today’s world because most applications use voice recognition as an important feature for performing operations in it. Perfect recognition of commands is achieved only by recognizing the voice correctly. Hence, the speech signal must be enhanced and free from background noise for the recognition process. In the existing approach, a recurrent convolutional encoder/decoder is used for denoising the speech signal. It utilized the signal-to-noise ratio property for enhancing the speech signal. It removes the noise signal effectively by having a low character error rate. But it does not describe the range of SNR of the noise added to the signal. Hence, in this, optimized deep learning is proposed to enhance the speech signal. AI function deep learning mimics the human brain's ability to analyze data and create patterns for use in making decisions. An optimized convolutional neural network was proposed for enhancing the speech for a different type of signal-to-noise ratio value of noises. Here, the particle swarm optimization process performs tuning the hyper-parameters of the convolutional neural network. The tuning of value is to minimize the character error rate of the signal. The proposed method is realized using MATLAB R2020b software and evaluation takes place by calculating the character error rate, PESQ, and STOI of the signal. Then, the comparison of the proposed and existing method takes place using evaluation metrics with − 5 dB, 0 dB, + 5 dB and + 10 dB. © 2023, The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature.
引用
收藏
页码:1117 / 1129
页数:12
相关论文
共 50 条
  • [1] A Fully Convolutional Neural Network for Speech Enhancement
    Park, Se Rim
    Lee, Jin Won
    18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 1993 - 1997
  • [2] Speech Enhancement based on Deep Convolutional Neural Network
    Nuthakki, Ramesh
    Masanta, Payel
    Yukta, T. N.
    PROCEEDINGS OF THE 2021 FIFTH INTERNATIONAL CONFERENCE ON I-SMAC (IOT IN SOCIAL, MOBILE, ANALYTICS AND CLOUD) (I-SMAC 2021), 2021, : 770 - 775
  • [3] Speech Enhancement using Fully Convolutional UNET and Gated Convolutional Neural Network
    Baloch, Danish
    Abdullah, Sidrah
    Qaiser, Asma
    Ahmed, Saad
    Nasim, Faiza
    Kanwal, Mehreen
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2023, 14 (11) : 831 - 836
  • [4] Dilated convolutional recurrent neural network for monaural speech enhancement
    Pirhosseinloo, Shadi
    Brumberg, Jonathan S.
    CONFERENCE RECORD OF THE 2019 FIFTY-THIRD ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS & COMPUTERS, 2019, : 158 - 162
  • [5] Convolutional Deep Neural Network and Full Connectivity for Speech Enhancement
    Alameri, Ban M.
    Kadhim, Inas Jawad
    Hadi, Suha Qasim
    Hassoon, Ali F.
    Abd, Mustafa M.
    Premaratne, Prashan
    INTERNATIONAL JOURNAL OF ONLINE AND BIOMEDICAL ENGINEERING, 2023, 19 (04) : 140 - 154
  • [6] Speech Enhancement using Convolutional Neural Network with Skip Connections
    Shi, Yupeng
    Rong, Weicong
    Zheng, Nengheng
    2018 11TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2018, : 6 - 10
  • [7] Regression-Based Speech Enhancement by Convolutional Neural Network
    Erseven, Mustafa
    Bolat, Bulent
    2018 26TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2018,
  • [8] Single channel speech enhancement using convolutional neural network
    Kounovsky, Tomas
    Malek, Jiri
    2017 IEEE INTERNATIONAL WORKSHOP OF ELECTRONICS, CONTROL, MEASUREMENT, SIGNALS AND THEIR APPLICATION TO MECHATRONICS (ECMSM), 2017,
  • [9] Speech enhancement method based on convolutional gated recurrent neural network
    Yuan W.
    Lou Y.
    Xia B.
    Sun W.
    Huazhong Keji Daxue Xuebao (Ziran Kexue Ban)/Journal of Huazhong University of Science and Technology (Natural Science Edition), 2019, 47 (04): : 13 - 18
  • [10] SNR-Aware Convolutional Neural Network Modeling for Speech Enhancement
    Fu, Szu-Wei
    Tsao, Yu
    Lu, Xugang
    17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 3768 - 3772