Speech enhancement with noise estimation and filtration using deep learning models

被引:0
|
作者
Kantamaneni, Sravanthi [1 ]
Charles, A. [1 ]
Babu, T. Ranga [2 ]
机构
[1] Annamalai Univ, ECE, Chidambaram, Tamil Nadu, India
[2] RVR&JC Coll Engn, ECE, Chowdavaram, Andhra Pradesh, India
关键词
Speech enhancement; Perceptual quality; Speech signal; RESNET-50; Denoising; Deep transfer learning model;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Speech enhancement helps in eliminating the environmental noises from the communica-tion signals. The main intention of the augmentation system is to develop the perceptual quality of communication or speech. For this purpose, various filtering schemes, spectral restoration models and speech models were implemented. In order to improve the odds of reducing noise and restoring the original signal, artificial intelligence (AI) and machine learning algorithms (MLA) were included into every sector. Deep transfer learning was used in this work to remove noise from the data and restore the original signals. This proposed approach includes a filtration scheme instead of using a convolution layer in the RESNET-50 architecture. The filters tested for speech enhanced deep learning models are modified Kalman filter and enhanced wiener filter. The performance metrics were calculated be-tween various algorithms and proposed models to identify which approaches to follow the better way result obtained. The performance metrics compared PESA, LSD and segSNR for different low signal to noise ratio conditions. (c) 2022 Elsevier B.V. All rights reserved.
引用
收藏
页码:14 / 28
页数:15
相关论文
共 50 条
  • [1] Speech enhancement with noise estimation and filtration using deep learning models
    Kantamaneni, Sravanthi
    Charles, A.
    Babu, T. Ranga
    THEORETICAL COMPUTER SCIENCE, 2023, 941 : 14 - 28
  • [2] iDeepMMSE: An Improved Deep Learning Approach to MMSE Speech and Noise Power Spectrum Estimation for Speech Enhancement
    Kim, Minseung
    Song, Hyungchan
    Cheong, Sein
    Shin, Jong Won
    INTERSPEECH 2022, 2022, : 181 - 185
  • [3] Unseen Noise Estimation Using Separable Deep Auto Encoder for Speech Enhancement
    Sun, Meng
    Zhang, Xiongwei
    Van hamme, Hugo
    Zheng, Thomas Fang
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2016, 24 (01) : 93 - 104
  • [4] Explaining deep learning models for speech enhancement
    Sivasankaran, Sunit
    Vincent, Emmanuel
    Fohr, Dominique
    INTERSPEECH 2021, 2021, : 696 - 700
  • [5] Voice disorder classification using speech enhancement and deep learning models
    Chaiani, Mounira
    Selouani, Sid Ahmed
    Boudraa, Malika
    Yakoub, Mohammed Sidi
    BIOCYBERNETICS AND BIOMEDICAL ENGINEERING, 2022, 42 (02) : 463 - 480
  • [6] Deep Learning-based Speech Presence Probability Estimation for Noise PSD Estimation in Single-channel Speech Enhancement
    Yang, Haemin
    Choe, Soyeon
    Kim, Keulbit
    Kang, Hong-Goo
    2018 INTERNATIONAL CONFERENCE ON SIGNALS AND SYSTEMS (ICSIGSYS), 2018, : 267 - 270
  • [7] Speech enhancement using Maximum A-Posteriori and Gaussian Mixture Models for speech and noise Periodogram estimation
    Chehrehsa, Sarang
    Moir, Tom James
    COMPUTER SPEECH AND LANGUAGE, 2016, 36 : 58 - 71
  • [8] Noise profiling for speech enhancement employing machine learning models
    Kakol, Krzysztof
    Korvel, Grazina
    Kostek, Bozena
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2022, 152 (06): : 3595 - 3605
  • [9] Noise profiling for speech enhancement employing machine learning models
    Kąkol, Krzysztof
    Korvel, Gra ina
    Kostek, Bo ena
    Journal of the Acoustical Society of America, 2022, 152 (06): : 3595 - 3605
  • [10] Non-intrusive speech intelligibility estimation using deep learning with speech enhancement and convolutional layers
    Nakazawa, Kazushi
    Kondo, Kazuhiro
    PROCEEDINGS OF 2022 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2022, : 1047 - 1052