An optimized convolutional neural network for speech enhancement

被引：0

作者：

Karthik A. ^{[1
,2
]}

Mazher Iqbal J.L. ^{[1
]}

机构：

[1] Department of ECE, Veltech Rangarajan Dr Sagunthala R&D Institute of Science and Technology, Chennai

[2] Department of ECE, Institute of Aeronautical Engineering, Hyderabad

来源：

International Journal of Speech Technology | 2023年 / 26卷 / 04期

关键词：

Character error rate; Convolutional neural network; Minimization; Optimization; Recognition; Speech enhancement;

D O I：

10.1007/s10772-023-10073-6

中图分类号：

学科分类号：

摘要：

Speech enhancement is an important property in today’s world because most applications use voice recognition as an important feature for performing operations in it. Perfect recognition of commands is achieved only by recognizing the voice correctly. Hence, the speech signal must be enhanced and free from background noise for the recognition process. In the existing approach, a recurrent convolutional encoder/decoder is used for denoising the speech signal. It utilized the signal-to-noise ratio property for enhancing the speech signal. It removes the noise signal effectively by having a low character error rate. But it does not describe the range of SNR of the noise added to the signal. Hence, in this, optimized deep learning is proposed to enhance the speech signal. AI function deep learning mimics the human brain's ability to analyze data and create patterns for use in making decisions. An optimized convolutional neural network was proposed for enhancing the speech for a different type of signal-to-noise ratio value of noises. Here, the particle swarm optimization process performs tuning the hyper-parameters of the convolutional neural network. The tuning of value is to minimize the character error rate of the signal. The proposed method is realized using MATLAB R2020b software and evaluation takes place by calculating the character error rate, PESQ, and STOI of the signal. Then, the comparison of the proposed and existing method takes place using evaluation metrics with − 5 dB, 0 dB, + 5 dB and + 10 dB. © 2023, The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature.

引用

页码：1117 / 1129

页数：12

共 50 条

[1] A Fully Convolutional Neural Network for Speech Enhancement
Park, Se Rim
Lee, Jin Won
18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 1993 - 1997
[2] Speech Enhancement based on Deep Convolutional Neural Network
Nuthakki, Ramesh
Masanta, Payel
Yukta, T. N.
PROCEEDINGS OF THE 2021 FIFTH INTERNATIONAL CONFERENCE ON I-SMAC (IOT IN SOCIAL, MOBILE, ANALYTICS AND CLOUD) (I-SMAC 2021), 2021, : 770 - 775
[3] Speech Enhancement using Fully Convolutional UNET and Gated Convolutional Neural Network
Baloch, Danish
Abdullah, Sidrah
Qaiser, Asma
Ahmed, Saad
Nasim, Faiza
Kanwal, Mehreen
INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2023, 14 (11) : 831 - 836
[4] Dilated convolutional recurrent neural network for monaural speech enhancement
Pirhosseinloo, Shadi
Brumberg, Jonathan S.
CONFERENCE RECORD OF THE 2019 FIFTY-THIRD ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS & COMPUTERS, 2019, : 158 - 162
[5] Convolutional Deep Neural Network and Full Connectivity for Speech Enhancement
Alameri, Ban M.
Kadhim, Inas Jawad
Hadi, Suha Qasim
Hassoon, Ali F.
Abd, Mustafa M.
Premaratne, Prashan
INTERNATIONAL JOURNAL OF ONLINE AND BIOMEDICAL ENGINEERING, 2023, 19 (04) : 140 - 154
[6] Speech Enhancement using Convolutional Neural Network with Skip Connections
Shi, Yupeng
Rong, Weicong
Zheng, Nengheng
2018 11TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2018, : 6 - 10
[7] Regression-Based Speech Enhancement by Convolutional Neural Network
Erseven, Mustafa
Bolat, Bulent
2018 26TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2018,
[8] Single channel speech enhancement using convolutional neural network
Kounovsky, Tomas
Malek, Jiri
2017 IEEE INTERNATIONAL WORKSHOP OF ELECTRONICS, CONTROL, MEASUREMENT, SIGNALS AND THEIR APPLICATION TO MECHATRONICS (ECMSM), 2017,
[9] Speech enhancement method based on convolutional gated recurrent neural network
Yuan W.
Lou Y.
Xia B.
Sun W.
Huazhong Keji Daxue Xuebao (Ziran Kexue Ban)/Journal of Huazhong University of Science and Technology (Natural Science Edition), 2019, 47 (04): : 13 - 18
[10] SNR-Aware Convolutional Neural Network Modeling for Speech Enhancement
Fu, Szu-Wei
Tsao, Yu
Lu, Xugang
17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 3768 - 3772

← 1 2 3 4 5 →