Long-Term SNR Estimation Using Noise Residuals and a Two-Stage Deep-Learning Framework

被引:8
|
作者
Dong, Xuan [1 ]
Williamson, Donald S. [1 ]
机构
[1] Indiana Univ, Bloomington, IN 47408 USA
关键词
Signal-to-noise ratio estimation; Speech separation; Deep neural networks; SPEECH ENHANCEMENT;
D O I
10.1007/978-3-319-93764-9_33
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Knowing the signal-to-noise ratio of a noisy speech signal is important since it can help improve speech applications. This paper presents a two-stage approach for estimating the long-term signal-to-noise ratio (SNR) of speech signals that are corrupted by background noise. The first stage produces noise residuals from a speech separation module. The second stage then uses the residuals and a deep neural network (DNN) to predict long-term SNR. Traditional SNR estimation approaches use signal processing, unsupervised learning, or computational auditory scene analysis (CASA) techniques. We propose a deep-learning based approach, since DNNs have outperformed other techniques in several speech processing tasks. We evaluate our approach across a variety of noise types and input SNR levels, using the TIMIT speech corpus and NOISEX-92 noise database. The results show that our approach generalizes well in unseen noisy environments, and it outperforms several existing methods.
引用
收藏
页码:351 / 360
页数:10
相关论文
共 50 条
  • [21] A Two-Stage Deep-Learning Model for Link Prediction Based on Network Structure and Node Attributes
    Liu, Peng
    Gui, Liang
    Wang, Huirong
    Riaz, Muhammad
    SUSTAINABILITY, 2022, 14 (23)
  • [22] A Novel Two-stage Separable Deep Learning Framework for Practical Blind Watermarking
    Liu, Yang
    Guo, Mengxi
    Zhang, Jian
    Zhu, Yuesheng
    Xie, Xiaodong
    PROCEEDINGS OF THE 27TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA (MM'19), 2019, : 1509 - 1517
  • [23] Two-stage deep learning framework for occlusal crown depth image generation
    Roh, Junghyun
    Kim, Junhwi
    Lee, Jimin
    Computers in Biology and Medicine, 2024, 183
  • [24] Optimal Two-Stage Phase II Designs with Long-Term Endpoints
    Huang, Bo
    Talukder, Enayet
    Thomas, Neal
    STATISTICS IN BIOPHARMACEUTICAL RESEARCH, 2010, 2 (01): : 51 - 61
  • [25] Machine Learning-Based Two-Stage Data Selection Scheme for Long-Term Influenza Forecasting
    Moon, Jaeuk
    Jung, Seungwon
    Park, Sungwoo
    Hwang, Eenjun
    CMC-COMPUTERS MATERIALS & CONTINUA, 2021, 68 (03): : 2945 - 2959
  • [26] TWO-STAGE NOISE AWARE TRAINING USING ASYMMETRIC DEEP DENOISING AUTOENCODER
    Lee, Kang Hyun
    Kang, Shin Jae
    Kang, Woo Hyun
    Kim, Nam Soo
    2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 2016, : 5765 - 5769
  • [27] Long-term and short-term customer reaction: a two-stage queueing approach
    van Ackere, Ann
    Haxholdt, Christian
    Larsen, Erik R.
    SYSTEM DYNAMICS REVIEW, 2006, 22 (04) : 349 - 369
  • [28] Applying a novel two-stage deep-learning model to improve accuracy in detecting retinal fundus images
    Maryada, Sai Kiran Reddy
    Booker, William Lee
    Danala, Gopichandh
    Ha, Catherine An
    Mudduluru, Sanjana
    Hougen, Dean F.
    Zheng, Bin
    MEDICAL IMAGING 2022: COMPUTER-AIDED DIAGNOSIS, 2022, 12033
  • [29] A two-stage deep learning framework for counterfeit luxury handbag detection in logo images
    Peng, Jianbiao
    Zou, Beiji
    Zhu, Chengzhang
    SIGNAL IMAGE AND VIDEO PROCESSING, 2023, 17 (04) : 1439 - 1448
  • [30] TWO-STAGE SPATIAL TEMPORAL DEEP LEARNING FRAMEWORK FOR FUNCTIONAL BRAIN NETWORK MODELING
    Zhao, Yu
    Dai, Haixing
    Zhang, Wei
    Ge, Fangfei
    Liu, Tianming
    2019 IEEE 16TH INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING (ISBI 2019), 2019, : 1576 - 1580