Speech enhancement with noise estimation and filtration using deep learning models

被引:0
|
作者
Kantamaneni, Sravanthi [1 ]
Charles, A. [1 ]
Babu, T. Ranga [2 ]
机构
[1] Annamalai Univ, ECE, Chidambaram, Tamil Nadu, India
[2] RVR&JC Coll Engn, ECE, Chowdavaram, Andhra Pradesh, India
关键词
Speech enhancement; Perceptual quality; Speech signal; RESNET-50; Denoising; Deep transfer learning model;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Speech enhancement helps in eliminating the environmental noises from the communica-tion signals. The main intention of the augmentation system is to develop the perceptual quality of communication or speech. For this purpose, various filtering schemes, spectral restoration models and speech models were implemented. In order to improve the odds of reducing noise and restoring the original signal, artificial intelligence (AI) and machine learning algorithms (MLA) were included into every sector. Deep transfer learning was used in this work to remove noise from the data and restore the original signals. This proposed approach includes a filtration scheme instead of using a convolution layer in the RESNET-50 architecture. The filters tested for speech enhanced deep learning models are modified Kalman filter and enhanced wiener filter. The performance metrics were calculated be-tween various algorithms and proposed models to identify which approaches to follow the better way result obtained. The performance metrics compared PESA, LSD and segSNR for different low signal to noise ratio conditions. (c) 2022 Elsevier B.V. All rights reserved.
引用
收藏
页码:14 / 28
页数:15
相关论文
共 50 条
  • [21] Subband noise estimation for speech enhancement using a perceptual Wiener filter
    Lin, L
    Holmes, WH
    Ambikairajah, E
    2003 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING I, 2003, : 80 - 83
  • [22] Noise estimation based on entropy without using VAD for speech enhancement
    Ravi Teja, B.
    Bhavani, S.
    International Journal of Signal Processing, Image Processing and Pattern Recognition, 2014, 7 (02) : 355 - 364
  • [23] Deep Inference for Covariance Estimation: Learning Gaussian Noise Models for State Estimation
    Liu, Katherine
    Ok, Kyel
    Vega-Brown, William
    Roy, Nicholas
    2018 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2018, : 1436 - 1443
  • [24] A deep representation learning speech enhancement method using β-VAE
    Xiang, Yang
    Hojvang, Jesper Lisby
    Rasmussen, Morten Hojfeldt
    Christensen, Mads Graesboll
    2022 30TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO 2022), 2022, : 359 - 363
  • [25] A CROSS-TASK TRANSFER LEARNING APPROACH TO ADAPTING DEEP SPEECH ENHANCEMENT MODELS TO UNSEEN BACKGROUND NOISE USING PAIRED SENONE CLASSIFIERS
    Wang, Sicheng
    Li, Wei
    Siniscalchi, Sabato Marco
    Lee, Chin-Hui
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 6219 - 6223
  • [26] An Ensemble Method for Multiple Speech Enhancement Using Deep Learning
    Fujita, Masahiko
    Itoyama, Katsutoshi
    Nishida, Kenji
    Nakadai, Kazuhiro
    2023 IEEE/SICE INTERNATIONAL SYMPOSIUM ON SYSTEM INTEGRATION, SII, 2023,
  • [27] NOISE ESTIMATION WITH LOW COMPLEXITY FOR SPEECH ENHANCEMENT
    Yong, Pei Chee
    Nordholm, Sven
    Dam, Hai Huyen
    2011 IEEE WORKSHOP ON APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS (WASPAA), 2011, : 109 - 112
  • [28] Adaptive noise estimation algorithm for speech enhancement
    Lin, L
    Holmes, WL
    Ambikairajah, E
    ELECTRONICS LETTERS, 2003, 39 (09) : 754 - 755
  • [29] Speech Enhancement Based on Teacher-Student Deep Learning Using Improved Speech Presence Probability for Noise-Robust Speech Recognition
    Tu, Yan-Hui
    Du, Jun
    Lee, Chin-Hui
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2019, 27 (12) : 2080 - 2091
  • [30] Evaluation of Lombard Speech Models in the Context of Speech in Noise Enhancement
    Korvel, Grazina
    Kakol, Krzysztof
    Kurasova, Olga
    Kostek, Bozena
    IEEE ACCESS, 2020, 8 (155156-155170) : 155156 - 155170