VARIATIONAL AUTOENCODER FOR SPEECH ENHANCEMENT WITH A NOISE-AWARE ENCODER

被引:21
|
作者
Fang, Huajian [1 ,2 ]
Carbajal, Guillaume [1 ]
Wermter, Stefan [2 ]
Gerkmann, Timo [1 ]
机构
[1] Univ Hamburg, Signal Proc SP, Hamburg, Germany
[2] Univ Hamburg, Knowledge Technol WTM, Hamburg, Germany
关键词
speech enhancement; generative model; variational autoencoder; semi-supervised learning;
D O I
10.1109/ICASSP39728.2021.9414060
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Recently, a generative variational autoencoder (VAE) has been proposed for speech enhancement to model speech statistics. However, this approach only uses clean speech in the training phase, making the estimation particularly sensitive to noise presence, especially in low signal-to-noise ratios (SNRs). To increase the robustness of the VAE, we propose to include noise information in the training phase by using a noise-aware encoder trained on noisy-clean speech pairs. We evaluate our approach on real recordings of different noisy environments and acoustic conditions using two different noise datasets. We show that our proposed noise-aware VAE outperforms the standard VAE in terms of overall distortion without increasing the number of model parameters. At the same time, we demonstrate that our model is capable of generalizing to unseen noise conditions better than a supervised feedforward deep neural network (DNN). Furthermore, we demonstrate the robustness of the model performance to a reduction of the noisy-clean speech training data size.
引用
收藏
页码:676 / 680
页数:5
相关论文
共 50 条
  • [11] A Disentangled Recurrent Variational Autoencoder for Speech Enhancement
    Yan, Hegen
    Lu, Zhihua
    [J]. 2023 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, ICME, 2023, : 1697 - 1702
  • [12] Noise-Aware Variational Eigensolvers: A Dissipative Route for Lattice Gauge Theories
    Cobos, Jesus
    Locher, David F.
    Bermudez, Alejandro
    Mueller, Markus
    Rico, Enrique
    [J]. PRX QUANTUM, 2024, 5 (03):
  • [13] Adaptive Neural Speech Enhancement with a Denoising Variational Autoencoder
    Bando, Yoshiaki
    Sekiguchi, Kouhei
    Yoshii, Kazuyoshi
    [J]. INTERSPEECH 2020, 2020, : 2437 - 2441
  • [14] NEAR: Noise-aware Temporal Encoder and Adaptive Recurrent Interaction for Motion Forecasting
    Chen, Weibang
    Wang, Yafei
    Liu, Xulei
    [J]. 2023 IEEE 26TH INTERNATIONAL CONFERENCE ON INTELLIGENT TRANSPORTATION SYSTEMS, ITSC, 2023, : 2971 - 2976
  • [15] GUIDED VARIATIONAL AUTOENCODER FOR SPEECH ENHANCEMENT WITH A SUPERVISED CLASSIFIER
    Carbajal, Guillaume
    Richter, Julius
    Gerkmann, Timo
    [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 681 - 685
  • [16] AN ANALYSIS OF NOISE-AWARE FEATURES IN COMBINATION WITH THE SIZE AND DIVERSITY OF TRAINING DATA FOR DNN-BASED SPEECH ENHANCEMENT
    Rehr, Robert
    Gerkmann, Timo
    [J]. 2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 601 - 605
  • [17] Whisper Speech Enhancement Using Joint Variational Autoencoder for Improved Speech Recognition
    Agrawal, Vikas
    Kumar, Shashi
    Rath, Shakti P.
    [J]. INTERSPEECH 2021, 2021, : 2706 - 2710
  • [18] Noise-Aware Texture-Preserving Low-Light Enhancement
    Azizi, Zohreh
    Lei, Xuejing
    Kuo, C-C Jay
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON VISUAL COMMUNICATIONS AND IMAGE PROCESSING (VCIP), 2020, : 443 - 446
  • [19] NaPer: A TSV Noise-Aware Placer
    Lee, Yu-Min
    Pan, Kuan-Te
    Chen, Chun
    [J]. IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, 2017, 25 (05) : 1703 - 1713
  • [20] Noise-Aware Quantum Amplitude Estimation
    Herbert, Steven
    Williams, Ifan
    Guichard, Roland
    Ng, Darren
    [J]. IEEE Transactions on Quantum Engineering, 2024, 5