HIGH-QUALITY NONPARALLEL VOICE CONVERSION BASED ON CYCLE-CONSISTENT ADVERSARIAL NETWORK

被引:0
|
作者
Fang, Fuming [1 ]
Yamagishi, Junichi [1 ,2 ]
Echizen, Isao [1 ]
Lorenzo-Trueba, Jaime [1 ]
机构
[1] Natl Inst Informat, Tokyo, Japan
[2] Univ Edinburgh, Edinburgh, Midlothian, Scotland
关键词
Voice conversion; deep learning; cycle-consistent adversarial network; generative adversarial network;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Although voice conversion (VC) algorithms have achieved remarkable success along with the development of machine learning, superior performance is still difficult to achieve when using nonparallel data. In this paper, we propose using a cycle-consistent adversarial network (Cyc1eGAN) for nonparallel data-based VC training. A Cyc1eGAN is a generative adversarial network (GAN) originally developed for unpaired image-to-image translation. A subjective evaluation of inter-gender conversion demonstrated that the proposed method significantly outperformed a method based on the Merlin open source neural network speech synthesis system (a parallel VC system adapted for our setup) and a GAN-based parallel VC system. This is the first research to show that the performance of a nonparallel VC method can exceed that of state-of-the-art parallel VC methods.
引用
收藏
页码:5279 / 5283
页数:5
相关论文
共 50 条
  • [1] MANY-TO-MANY VOICE CONVERSION USING CONDITIONAL CYCLE-CONSISTENT ADVERSARIAL NETWORKS
    Lee, Shindong
    Ko, BongGu
    Lee, Keonnyeong
    Yoo, In-Chul
    Yook, Dongsuk
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 6279 - 6283
  • [2] Seismic impedance inversion based on cycle-consistent generative adversarial network
    Yu-Qing Wang
    Qi Wang
    Wen-Kai Lu
    Qiang Ge
    Xin-Fei Yan
    [J]. Petroleum Science, 2022, (01) : 147 - 161
  • [3] Seismic impedance inversion based on cycle-consistent generative adversarial network
    Wang, Yu-Qing
    Wang, Qi
    Lu, Wen-Kai
    Ge, Qiang
    Yan, Xin-Fei
    [J]. PETROLEUM SCIENCE, 2022, 19 (01) : 147 - 161
  • [4] Seismic impedance inversion based on cycle-consistent generative adversarial network
    YuQing Wang
    Qi Wang
    WenKai Lu
    Qiang Ge
    XinFei Yan
    [J]. Petroleum Science., 2022, 19 (01) - 161
  • [5] CycleGAN-VC: Non-parallel Voice Conversion Using Cycle-Consistent Adversarial Networks
    Kaneko, Takuhiro
    Kameoka, Hirokazu
    [J]. 2018 26TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2018, : 2100 - 2104
  • [6] Self-Supervised Pansharpening Based on a Cycle-Consistent Generative Adversarial Network
    Li, Jie
    Sun, Weixuan
    Jiang, Menghui
    Yuan, Qiangqiang
    [J]. IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2022, 19
  • [7] CROSS-DOMAIN SPEECH RECOGNITION USING NONPARALLEL CORPORA WITH CYCLE-CONSISTENT ADVERSARIAL NETWORKS
    Mimura, Masato
    Sakai, Shinsuke
    Kawahara, Tatsuya
    [J]. 2017 IEEE AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING WORKSHOP (ASRU), 2017, : 134 - 140
  • [8] Non-Parallel Voice Conversion Using Cycle-Consistent Adversarial Networks with Self-Supervised Representations
    Chun, Chanjun
    Lee, Young Han
    Lee, Geon Woo
    Jeon, Moongu
    Kim, Hong Kook
    [J]. 2023 IEEE 20TH CONSUMER COMMUNICATIONS & NETWORKING CONFERENCE, CCNC, 2023,
  • [9] Parallel data free singing voice conversion with cycle-consistent BEGAN
    Yousuf, Assila
    George, David Solomon
    [J]. MATERIALS TODAY-PROCEEDINGS, 2022, 58 : 157 - 161
  • [10] Image-to-image Translation Based on Improved Cycle-consistent Generative Adversarial Network
    Zhang Jinglei
    Hou Yawei
    [J]. JOURNAL OF ELECTRONICS & INFORMATION TECHNOLOGY, 2020, 42 (05) : 1216 - 1222