Speech enhancement with noise estimation and filtration using deep learning models

被引:0
|
作者
Kantamaneni, Sravanthi [1 ]
Charles, A. [1 ]
Babu, T. Ranga [2 ]
机构
[1] Annamalai Univ, ECE, Chidambaram, Tamil Nadu, India
[2] RVR&JC Coll Engn, ECE, Chowdavaram, Andhra Pradesh, India
关键词
Speech enhancement; Perceptual quality; Speech signal; RESNET-50; Denoising; Deep transfer learning model;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Speech enhancement helps in eliminating the environmental noises from the communica-tion signals. The main intention of the augmentation system is to develop the perceptual quality of communication or speech. For this purpose, various filtering schemes, spectral restoration models and speech models were implemented. In order to improve the odds of reducing noise and restoring the original signal, artificial intelligence (AI) and machine learning algorithms (MLA) were included into every sector. Deep transfer learning was used in this work to remove noise from the data and restore the original signals. This proposed approach includes a filtration scheme instead of using a convolution layer in the RESNET-50 architecture. The filters tested for speech enhanced deep learning models are modified Kalman filter and enhanced wiener filter. The performance metrics were calculated be-tween various algorithms and proposed models to identify which approaches to follow the better way result obtained. The performance metrics compared PESA, LSD and segSNR for different low signal to noise ratio conditions. (c) 2022 Elsevier B.V. All rights reserved.
引用
收藏
页码:14 / 28
页数:15
相关论文
共 50 条
  • [31] Noise Estimation and Suppression Using Nonlinear Function with A Priori Speech Absence Probability in Speech Enhancement
    Lee, Soojeong
    Lee, Gangseong
    JOURNAL OF SENSORS, 2016, 2016
  • [32] Hardware Efficient Speech Enhancement With Noise Aware Multi-Target Deep Learning
    Abdullah, Salinna
    Zamani, Majid
    Demosthenous, Andreas
    IEEE OPEN JOURNAL OF CIRCUITS AND SYSTEMS, 2024, 5 : 141 - 152
  • [33] Learning Noise Adapters for Incremental Speech Enhancement
    Yang, Ziye
    Song, Xiang
    Chen, Jie
    Richard, Cedric
    Cohen, Israel
    IEEE Signal Processing Letters, 2024, 31 : 2915 - 2919
  • [34] Improving Speech Enhancement in Unseen Noise Using Deep Convolutional Neural Network
    Yuan W.-H.
    Sun W.-Z.
    Xia B.
    Ou S.-F.
    Zidonghua Xuebao/Acta Automatica Sinica, 2018, 44 (04): : 751 - 759
  • [35] Speech Enhancement In Multiple-Noise Conditions using Deep Neural Networks
    Kumar, Anurag
    Florencio, Dinei
    17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 3738 - 3742
  • [36] Online noise estimation using stochastic-gain HMM for speech enhancement
    Zhao, David Y.
    Kleijn, W. Bastiaan
    Ypma, Alexander
    de Vries, Bert
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2008, 16 (04): : 835 - 846
  • [37] Variance Normalized Perceptual Subspace Speech Enhancement With Noise Estimation Using SPP
    Surendran, Sudeep
    Kumar, T. Kishore
    2016 INTERNATIONAL CONFERENCE ON NEXT GENERATION INTELLIGENT SYSTEMS (ICNGIS), 2016, : 364 - 369
  • [38] Speech Enhancement Based on Adaptive Noise Power Estimation Using Spectral Difference
    Choi, Jae-Hun
    Chang, Joon-Hyuk
    Kim, Dong Kook
    Kim, Suhyun
    IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 2011, E94A (10) : 2031 - 2034
  • [39] Speech Enhancement Using Successive State Estimation under Industrial Noise Environment
    Wu, Qinghe
    Wu, Haifeng
    Zeng, Yu
    PROCEEDINGS OF THE 2018 2ND INTERNATIONAL CONFERENCE ON ALGORITHMS, COMPUTING AND SYSTEMS (ICACS 2018), 2018, : 214 - 219
  • [40] ADAPTIVE NOISE POWER ESTIMATION USING SPECTRAL DIFFERENCE FOR ROBUST SPEECH ENHANCEMENT
    Choi, Jae-Hun
    Kim, Sang-Kyun
    Chang, Joon-Hyuk
    2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 4649 - 4652