Long-Term SNR Estimation Using Noise Residuals and a Two-Stage Deep-Learning Framework

被引:8
|
作者
Dong, Xuan [1 ]
Williamson, Donald S. [1 ]
机构
[1] Indiana Univ, Bloomington, IN 47408 USA
关键词
Signal-to-noise ratio estimation; Speech separation; Deep neural networks; SPEECH ENHANCEMENT;
D O I
10.1007/978-3-319-93764-9_33
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Knowing the signal-to-noise ratio of a noisy speech signal is important since it can help improve speech applications. This paper presents a two-stage approach for estimating the long-term signal-to-noise ratio (SNR) of speech signals that are corrupted by background noise. The first stage produces noise residuals from a speech separation module. The second stage then uses the residuals and a deep neural network (DNN) to predict long-term SNR. Traditional SNR estimation approaches use signal processing, unsupervised learning, or computational auditory scene analysis (CASA) techniques. We propose a deep-learning based approach, since DNNs have outperformed other techniques in several speech processing tasks. We evaluate our approach across a variety of noise types and input SNR levels, using the TIMIT speech corpus and NOISEX-92 noise database. The results show that our approach generalizes well in unseen noisy environments, and it outperforms several existing methods.
引用
收藏
页码:351 / 360
页数:10
相关论文
共 50 条
  • [31] A two-stage network framework for topology optimization incorporating deep learning and physical information
    Wang, Dalei
    Ning, Yun
    Xiang, Cheng
    Chen, Airong
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 133
  • [32] A two-stage deep learning framework for counterfeit luxury handbag detection in logo images
    Jianbiao Peng
    Beiji Zou
    Chengzhang Zhu
    Signal, Image and Video Processing, 2023, 17 : 1439 - 1448
  • [33] Two-stage broad learning inversion framework for shear-wave velocity estimation
    Yang, Xiao-Hui
    Han, Peng
    Yang, Zhentao
    Chen, Xiaofei
    GEOPHYSICS, 2023, 88 (01) : WA219 - WA237
  • [34] Two-stage framework for optic disc localization and glaucoma classification in retinal fundus images using deep learning
    Bajwa, Muhammad Naseer
    Malik, Muhammad Imran
    Siddiqui, Shoaib Ahmed
    Dengel, Andreas
    Shafait, Faisal
    Neumeier, Wolfgang
    Ahmed, Sheraz
    BMC MEDICAL INFORMATICS AND DECISION MAKING, 2019, 19 (1)
  • [35] Two-stage framework for optic disc localization and glaucoma classification in retinal fundus images using deep learning
    Muhammad Naseer Bajwa
    Muhammad Imran Malik
    Shoaib Ahmed Siddiqui
    Andreas Dengel
    Faisal Shafait
    Wolfgang Neumeier
    Sheraz Ahmed
    BMC Medical Informatics and Decision Making, 19
  • [36] A Hybrid Deep Learning Framework for Long-Term Traffic Flow Prediction
    Li, Yiqun
    Chai, Songjian
    Ma, Zhengwei
    Wang, Guibin
    IEEE ACCESS, 2021, 9 : 11264 - 11271
  • [37] Long-term behaviour of a two-stage CW system regarding nitrogen removal
    Langergraber, Guenter
    Pressl, Alexander
    Leroch, Klaus
    Rohrhofer, Roland
    Haberl, Raimund
    WATER SCIENCE AND TECHNOLOGY, 2011, 64 (05) : 1137 - 1141
  • [38] Two-Stage Autotransplantation of the Human Submandibular Gland: First Long-Term Results
    Burghartz, Marc
    Ginzkey, Christian
    Hackenberg, Stephan
    Hagen, Rudolf
    LARYNGOSCOPE, 2016, 126 (07): : 1551 - 1555
  • [39] The Two-Stage Solution: Toward a Long-Term Israeli-Palestinian Truce
    Thrall, Nathan
    MEDITERRANEAN POLITICS, 2016, 21 (03) : 432 - 436
  • [40] Two-Stage Reconstruction in Bony Finger Joint Defects - Long-Term Results
    Moeller, Richard-Tobias
    Mentzel, Martin
    Vergote, Daniel
    Bauknecht, Simon
    HANDCHIRURGIE MIKROCHIRURGIE PLASTISCHE CHIRURGIE, 2024, 56 (03) : 227 - 234