ADAPTIVE WAVENET VOCODER FOR RESIDUAL COMPENSATION IN GAN-BASED VOICE CONVERSION

被引:0
|
作者
Sisman, Berrak [1 ,2 ,3 ]
Zhang, Mingyang [1 ]
Sakti, Sakriani [2 ,3 ]
Li, Haizhou [1 ]
Nakamura, Satoshi [2 ,3 ]
机构
[1] Natl Univ Singapore, Singapore, Singapore
[2] Nara Inst Sci & Technol, Nara, Japan
[3] RIKEN, Ctr Adv Intelligence Project AIP, Tokyo, Japan
关键词
voice conversion; generative adversarial networks; adaptive Wavenet; residual compensation;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we propose to use generative adversarial networks (GAN) together with a WaveNet vocoder to address the over-smoothing problem arising from the deep learning approaches to voice conversion, and to improve the vocoding quality over the traditional vocoders. As GAN aims to minimize the divergence between the natural and converted speech parameters, it effectively alleviates the over-smoothing problem in the converted speech. On the other hand, WaveNet vocoder allows us to leverage from the human speech of a large speaker population, thus improving the naturalness of the synthetic voice. Furthermore, for the first time, we study how to use WaveNet vocoder for residual compensation to improve the voice conversion performance. The experiments show that the proposed voice conversion framework consistently outperforms the baselines.
引用
收藏
页码:282 / 289
页数:8
相关论文
共 50 条
  • [41] A Novel Dead-time Adaptive Control Method for GaN-based Motor Drive
    Qin, Haihong
    Wang, Wenlu
    Xie, Sixuan
    Peng, Jiangjin
    Chen, Wenming
    [J]. Zhongguo Dianji Gongcheng Xuebao/Proceedings of the Chinese Society of Electrical Engineering, 2023, 43 (11): : 4422 - 4433
  • [42] Color Conversion of GaN-Based Micro Light-Emitting Diodes Using Quantum Dots
    Lee, Ching-Ting
    Cheng, Chao-Jung
    Lee, Hsin-Ying
    Chu, Ying-Chien
    Fang, Yen-Hsiang
    Chao, Chia-Hsin
    Wu, Ming-Hsien
    [J]. IEEE PHOTONICS TECHNOLOGY LETTERS, 2015, 27 (21) : 2296 - 2299
  • [43] Influence of residual carbon impurities in i-GaN layer on the performance of GaN-based p-i-n photodetectors
    Li, Xiaojing
    Zhao, Degang
    Jiang, Desheng
    Chen, Ping
    Zhu, Jianjun
    Liu, Zongshun
    Le, Lingcong
    Yang, Jing
    He, Xiaoguang
    Zhang, Liqun
    Zhang, Shuming
    Liu, Jianping
    Yang, Hui
    [J]. JOURNAL OF VACUUM SCIENCE & TECHNOLOGY B, 2016, 34 (01):
  • [44] Improved conversion efficiency of GaN-based solar cells with Mn-doped absorption layer
    Sheu, Jinn-Kong
    Huang, Feng-Wen
    Lee, Chia-Hui
    Lee, Ming-Lun
    Yeh, Yu-Hsiang
    Chen, Po-Cheng
    Lai, Wei-Chih
    [J]. APPLIED PHYSICS LETTERS, 2013, 103 (06)
  • [45] An Integrated Driver With Adaptive Dead-Time Control for GaN-Based Synchronous Buck Converter
    Chen, Ching-Jan
    Chiu, Ping-Kun
    Chen, Yen-Ming
    Wang, Pin-Ying
    Chang, Yu-Cheng
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II-EXPRESS BRIEFS, 2022, 69 (02) : 539 - 543
  • [46] Theoretical and Numerical Design of a Wireless Power Transmission Link With GaN-Based Transmitter and Adaptive Receiver
    Florian, Corrado
    Mastri, Franco
    Paganelli, Rudi Paolo
    Masotti, Diego
    Costanzo, Alessandra
    [J]. IEEE TRANSACTIONS ON MICROWAVE THEORY AND TECHNIQUES, 2014, 62 (04) : 931 - 946
  • [47] GAN-based statistical modeling with adaptive schemes for surface defect inspection of IC metal packages
    Wu, Zhenshuang
    Cai, Nian
    Chen, Kaiqiong
    Xia, Hao
    Zhou, Shuai
    Wang, Han
    [J]. JOURNAL OF INTELLIGENT MANUFACTURING, 2024, 35 (04) : 1811 - 1824
  • [48] A GaN-Based Gate Driver with Adaptive Charge Sharing Bootstrap Technique to Improve the Conduction Loss
    Sun, Tsung-Wen
    Hsu, Yung -Tang
    Tsai, Tsung-Heng
    Chang, Chia-Chan
    [J]. 2024 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, ISCAS 2024, 2024,
  • [49] VOICE CONVERSION THROUGH RESIDUAL WARPING IN A SPARSE, ANCHOR-BASED REPRESENTATION OF SPEECH
    Liberatore, Christopher
    Zhao, Guanlong
    Gutierrez-Osuna, Ricardo
    [J]. 2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 5284 - 5288
  • [50] Design and evaluation of a voice conversion algorithm based on spectral envelope mapping and residual prediction
    Kain, A
    Macon, MW
    [J]. 2001 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-VI, PROCEEDINGS: VOL I: SPEECH PROCESSING 1; VOL II: SPEECH PROCESSING 2 IND TECHNOL TRACK DESIGN & IMPLEMENTATION OF SIGNAL PROCESSING SYSTEMS NEURALNETWORKS FOR SIGNAL PROCESSING; VOL III: IMAGE & MULTIDIMENSIONAL SIGNAL PROCESSING MULTIMEDIA SIGNAL PROCESSING, 2001, : 813 - 816