VARIATIONAL AUTOENCODER FOR SPEECH ENHANCEMENT WITH A NOISE-AWARE ENCODER

被引:21
|
作者
Fang, Huajian [1 ,2 ]
Carbajal, Guillaume [1 ]
Wermter, Stefan [2 ]
Gerkmann, Timo [1 ]
机构
[1] Univ Hamburg, Signal Proc SP, Hamburg, Germany
[2] Univ Hamburg, Knowledge Technol WTM, Hamburg, Germany
关键词
speech enhancement; generative model; variational autoencoder; semi-supervised learning;
D O I
10.1109/ICASSP39728.2021.9414060
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Recently, a generative variational autoencoder (VAE) has been proposed for speech enhancement to model speech statistics. However, this approach only uses clean speech in the training phase, making the estimation particularly sensitive to noise presence, especially in low signal-to-noise ratios (SNRs). To increase the robustness of the VAE, we propose to include noise information in the training phase by using a noise-aware encoder trained on noisy-clean speech pairs. We evaluate our approach on real recordings of different noisy environments and acoustic conditions using two different noise datasets. We show that our proposed noise-aware VAE outperforms the standard VAE in terms of overall distortion without increasing the number of model parameters. At the same time, we demonstrate that our model is capable of generalizing to unseen noise conditions better than a supervised feedforward deep neural network (DNN). Furthermore, we demonstrate the robustness of the model performance to a reduction of the noisy-clean speech training data size.
引用
收藏
页码:676 / 680
页数:5
相关论文
共 50 条
  • [41] A TSV Noise-Aware 3-D Placer
    Lee, Yu-Min
    Chen, Chun
    Song, JiaXing
    Pan, Kuan-Te
    [J]. 2015 DESIGN, AUTOMATION & TEST IN EUROPE CONFERENCE & EXHIBITION (DATE), 2015, : 1653 - 1658
  • [42] UNSUPERVISED NOISE-AWARE ADAPTIVE FEEDBACK CANCELLATION FOR HEARING AID DEVICES UNDER NOISY SPEECH FRAMEWORK
    Mishra, Parth
    Ganguly, Anshuman
    Kucuk, Abdullah
    Panahi, Issa M. S.
    [J]. 2017 IEEE SIGNAL PROCESSING IN MEDICINE AND BIOLOGY SYMPOSIUM (SPMB), 2017,
  • [43] Speaker-aware Deep Denoising Autoencoder with Embedded Speaker Identity for Speech Enhancement
    Chuang, Fu-Kai
    Wang, Syu-Siang
    Hung, Jeih-weih
    Tsao, Yu
    Fang, Shih-Hau
    [J]. INTERSPEECH 2019, 2019, : 3173 - 3177
  • [44] QuantumNAT: Quantum Noise-Aware Training with Noise Injection, Quantization and Normalization
    Wang, Hanrui
    Gu, Jiaqi
    Ding, Yongshan
    Li, Zirui
    Chong, Frederic T.
    Pan, David Z.
    Han, Song
    [J]. PROCEEDINGS OF THE 59TH ACM/IEEE DESIGN AUTOMATION CONFERENCE, DAC 2022, 2022, : 1 - 6
  • [45] A Noise-Aware Multiple Imputation Algorithm for Missing Data
    Li, Fangfang
    Sun, Hui
    Gu, Yu
    Yu, Ge
    [J]. MATHEMATICS, 2023, 11 (01)
  • [46] Cleaning training-datasets with noise-aware algorithms
    Escalante, H. Jair
    [J]. SEVENTH MEXICAN INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE, PROCEEDINGS, 2006, : 151 - 158
  • [47] Noise-Aware Quantum Circuit Simulation With Decision Diagrams
    Grurl, Thomas
    Fuss, Juergen
    Wille, Robert
    [J]. IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 2023, 42 (03) : 860 - 873
  • [48] Noise-Aware and Lightweight LSTM for Keyword Spotting Applications
    Wang, Yingfeng
    Chong, Yi Sheng
    Goh, Wang Ling
    Anh Tuan Do
    [J]. 2022 19TH INTERNATIONAL SOC DESIGN CONFERENCE (ISOCC), 2022, : 135 - 136
  • [49] Noise is the fatal poison: A Noise-aware Network for noisy dataset classification
    Yu, Xiaotian
    Zhang, Shengxuming
    Jia, Lingxiang
    Wang, Yuexuan
    Song, Mingli
    Feng, Zunlei
    [J]. NEUROCOMPUTING, 2024, 563
  • [50] Unseen Noise Estimation Using Separable Deep Auto Encoder for Speech Enhancement
    Sun, Meng
    Zhang, Xiongwei
    Van hamme, Hugo
    Zheng, Thomas Fang
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2016, 24 (01) : 93 - 104