Speech Enhancement Using Dynamical Variational AutoEncoder

被引:0
|
作者
Do, Hao D. [1 ]
机构
[1] FPT Univ, Ho Chi Minh City, Vietnam
关键词
speech enhancement; dynamical variational autoEncoder; generative model;
D O I
10.1007/978-981-99-5837-5_21
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This research focuses on dealing with speech enhancement via a generative model. Many other solutions, which are trained with some fixed kinds of interference or noises, need help when extracting speech from the mixture with a strange noise. We use a class of generative models called Dynamical Variational AutoEncoder (DVAE), which combines generative and temporal models to analyze the speech signal. This class of models makes attention to speech signal behavior, then extracts and enhances the speech. Moreover, we design a new architecture in the DVAE class named Bi-RVAE, which is more straightforward than the other models but gains good results. Experimental results show that DVAE class, including our proposed design, achieves a high-quality recovered speech. This class could enhance the speech signal before passing it into the central processing models.
引用
收藏
页码:247 / 258
页数:12
相关论文
共 50 条
  • [31] Speaker normalization using Joint Variational Autoencoder
    Kumar, Shashi
    Rath, Shakti P.
    Pandey, Abhishek
    INTERSPEECH 2021, 2021, : 1289 - 1293
  • [32] Botnet Detection Using Recurrent Variational Autoencoder
    Kim, Jeeyung
    Sim, Alex
    Kim, Jinoh
    Wu, Kesheng
    2020 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM), 2020,
  • [33] Crash data augmentation using variational autoencoder
    Islam, Zubayer
    Abdel-Aty, Mohamed
    Cai, Qing
    Yuan, Jinghui
    ACCIDENT ANALYSIS AND PREVENTION, 2021, 151
  • [34] A Statistically Principled and Computationally Efficient Approach to Speech Enhancement using Variational Autoencoders
    Pariente, Manuel
    Deleforge, Antoine
    Vincent, Emmanuel
    INTERSPEECH 2019, 2019, : 3158 - 3162
  • [35] MIMO Speech Compression and Enhancement Based on Convolutional Denoising Autoencoder
    Li, You-Jin
    Wang, Syu-Siang
    Tsao, Yu
    Su, Borching
    2021 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2021, : 1245 - 1250
  • [36] Deep Denoising Autoencoder Based Post Filtering for Speech Enhancement
    Zezario, Ryandhimas E.
    Huang, Jen-Wei
    Lu, Xugang
    Tsao, Yu
    Hwang, Hsin-Te
    Wang, Hsin-Min
    2018 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2018, : 373 - 377
  • [37] Multi-channel Speech Enhancement Using Time-Domain Convolutional Denoising Autoencoder
    Tawara, Naohiro
    Kobayashi, Tetsunori
    Ogawa, Tetsuji
    INTERSPEECH 2019, 2019, : 86 - 90
  • [38] Insider Threat Detection using Deep Autoencoder and Variational Autoencoder Neural Networks
    Pantelidis, Efthimios
    Bendiab, Gueltoum
    Shiaeles, Stavros
    Kolokotronis, Nicholas
    PROCEEDINGS OF THE 2021 IEEE INTERNATIONAL CONFERENCE ON CYBER SECURITY AND RESILIENCE (IEEE CSR), 2021, : 129 - 134
  • [39] RETRACTED: Modeling neural dynamics during speech production using a state space variational autoencoder (Retracted Article)
    Sun, Pengfei
    Moses, David A.
    Chang, Edward F.
    2019 9TH INTERNATIONAL IEEE/EMBS CONFERENCE ON NEURAL ENGINEERING (NER), 2019, : 428 - 432
  • [40] Multichannel Variational Autoencoder-Based Speech Separation in Designated Speaker Order
    Liao, Lele
    Cheng, Guoliang
    Ruan, Haoxin
    Chen, Kai
    Lu, Jing
    SYMMETRY-BASEL, 2022, 14 (12):