ARTIFICIAL BANDWIDTH EXTENSION USING CONDITIONAL VARIATIONAL AUTO-ENCODERS AND ADVERSARIAL LEARNING

被引:0
|
作者
Bachhav, Pramod [1 ]
Todisco, Massimiliano [1 ]
Evans, Nicholas [1 ]
机构
[1] EURECOM, Sophia Antipolis, France
关键词
variational auto-encoder; generative adversarial network; latent variable; artificial bandwidth extension; speech quality; NETWORK;
D O I
10.1109/icassp40776.2020.9053737
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Artificial bandwidth extension (ABE) algorithms have been developed to estimate missing highband frequency components (4-8kHz) to improve quality of narrowband (0-4kHz) telephone calls. Most ABE solutions employ deep neural networks (DNNs) due to their well-known ability to model highly complex, non-linear relationship between narrowband and highband features. Generative models such as conditional variational auto-encoders (CVAEs) are capable of modelling complex data distributions via latent representation learning. This paper reports their application to ABE. CVAEs, form of directed, graphical models, are exploited to model the probability distribution of highband features conditioned on narrowband features. While CVAEs are trained with the standard mean square criterion (MSE), their combination with adversarial learning give further improvements. When compared to results obtained with the baseline approach, the wideband PESQ is improved significantly by 0.21 points. The performance is also compared on an automatic speech recognition (ASR) task on the TIMIT dataset where word error rate (WER) is decreased by an absolute value of 0.3%.
引用
收藏
页码:6924 / 6928
页数:5
相关论文
共 50 条
  • [21] InvMap and Witness Simplicial Variational Auto-Encoders
    Medbouhi, Aniss Aiman
    Polianskii, Vladislav
    Varava, Anastasia
    Kragic, Danica
    [J]. MACHINE LEARNING AND KNOWLEDGE EXTRACTION, 2023, 5 (01): : 199 - 236
  • [22] MaskAAE: Latent space optimization for Adversarial Auto-Encoders
    Mondal, Arnab Kumar
    Chowdhury, Sankalan Pal
    Jayendran, Aravind
    Singla, Parag
    Asnani, Himanshu
    Prathosh, A. P.
    [J]. CONFERENCE ON UNCERTAINTY IN ARTIFICIAL INTELLIGENCE (UAI 2020), 2020, 124 : 689 - 698
  • [23] Adversarial Auto-encoders for Speech Based Emotion Recognition
    Sahu, Saurabh
    Gupta, Rahul
    Sivaraman, Ganesh
    AbdAlmageed, Wael
    Espy-Wilson, Carol
    [J]. 18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 1243 - 1247
  • [24] Interpretable ECG Beat Embedding using Disentangled Variational Auto-Encoders
    Van Steenkiste, Tom
    Deschrijver, Dirk
    Dhaene, Tom
    [J]. 2019 IEEE 32ND INTERNATIONAL SYMPOSIUM ON COMPUTER-BASED MEDICAL SYSTEMS (CBMS), 2019, : 373 - 378
  • [25] Triggering dark showers with conditional dual auto-encoders
    Anzalone, Luca
    Chhibra, Simranjit Singh
    Maier, Benedikt
    Chernyavskaya, Nadezda
    Pierini, Maurizio
    [J]. MACHINE LEARNING-SCIENCE AND TECHNOLOGY, 2024, 5 (03):
  • [26] Isometric Quotient Variational Auto-Encoders for Structure-Preserving Representation Learning
    Huh, In
    Jeong, Changwook
    Choe, Jae Myung
    Kim, Young-Gu
    Kim, Dae Sin
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [27] Automatic selection of latent variables in variational auto-encoders
    Jouffroy, Emma
    Giremus, Audrey
    Berthoumieu, Yannick
    Bach, Olivier
    Hugget, Alain
    [J]. 2022 30TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO 2022), 2022, : 1407 - 1411
  • [28] FMCW Radar Sensing for Indoor Drones Using Variational Auto-Encoders
    Safa, Ali
    Verbelen, Tim
    Catal, Ozan
    Van de Maele, Toon
    Hartmann, Matthias
    Dhoedt, Bart
    Bourdoux, Andre
    [J]. 2023 IEEE RADAR CONFERENCE, RADARCONF23, 2023,
  • [29] Pre-train and Plug-in: Flexible Conditional Text Generation with Variational Auto-Encoders
    Duan, Yu
    Xu, Canwen
    Pei, Jiaxin
    Han, Jialong
    Li, Chenliang
    [J]. 58TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2020), 2020, : 253 - 262
  • [30] Discriminative regularization of the latent manifold of variational auto-encoders
    Kossyk, Ingo
    Marton, Zoltan-Csaba
    [J]. JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2019, 61 : 121 - 129