Cross-Lingual Voice Conversion using a Cyclic Variational Auto-encoder and a WaveNet Vocoder

被引:0
|
作者
Nakatani, Hikaru [1 ]
Tobing, Patrick Lumban [1 ]
Takeda, Kazuya [1 ]
Toda, Tomoki [1 ]
机构
[1] Nagoya Univ, Nagoya, Aichi, Japan
关键词
NEURAL-NETWORKS; SPEECH;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We propose a novel, cross-lingual voice conversion (VC) method using a cyclic variational auto-encoder (CycleVAE). Voice conversion is the transformation of the voice of one speaker into the voice of another speaker, while cross-lingual VC performs voice conversion between speakers who speak different languages. When using VC methods based on parallel learning, it is necessary to prepare accented speech uttered by the source or target speaker, using the pronunciation system of the speaker's mother tongue. On the other hand, VC methods which use a non-parallel learning approach can utilize the natural speech data of both the source and target speakers, produced in their own native languages. It then becomes necessary, however, to deal with the issues of time-alignment and language mismatches. To address these issues, we apply CycleVAE to cross-lingual VC as a sophisticated, non-parallel method of VC. We also apply the WaveNet vocoder in the waveform generation process of CycleVAE-VC to improve overall conversion quality. Our objective and subjective experimental results when performing cross-lingual VC from a native English speaker to a native Japanese speaker confirm that the proposed method achieves a higher level of naturalness and speaker similarity than a conventional RNN-based parallel VC method using accented speech.
引用
收藏
页码:520 / 526
页数:7
相关论文
共 50 条
  • [21] Cross-Lingual Voice Conversion with a Cycle Consistency Loss on Linguistic Representation
    Zhou, Yi
    Tian, Xiaohai
    Wu, Zhizheng
    Li, Haizhou
    [J]. INTERSPEECH 2021, 2021, : 1374 - 1378
  • [22] Path Tracking Control Using Imitation Learning with Variational Auto-Encoder
    Lee, Su-Jin
    Chun, Tae Yoon
    Lim, Hyoung Woo
    Lee, Sang-Ho
    [J]. 2019 19TH INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND SYSTEMS (ICCAS 2019), 2019, : 501 - 505
  • [23] Investigation of using disentangled and interpretable representations for one-shot cross-lingual voice conversion
    Mohammadi, Seyed Hamidreza
    Kim, Taehwan
    [J]. 19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 2833 - 2837
  • [24] Multi-Task WaveRNN With an Integrated Architecture for Cross-Lingual Voice Conversion
    Zhou, Yi
    Tian, Xiaohai
    Li, Haizhou
    [J]. IEEE SIGNAL PROCESSING LETTERS, 2020, 27 : 1310 - 1314
  • [25] Optimization of Cross-Lingual Voice Conversion With Linguistics Losses to Reduce Foreign Accents
    Zhou, Yi
    Wu, Zhizheng
    Tian, Xiaohai
    Li, Haizhou
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2023, 31 : 1916 - 1926
  • [26] Data Augmentation for Electricity Theft Detection Using Conditional Variational Auto-Encoder
    Gong, Xuejiao
    Tang, Bo
    Zhu, Ruijin
    Liao, Wenlong
    Song, Like
    [J]. ENERGIES, 2020, 13 (17)
  • [27] Data imputation in IoT using Spatio-Temporal Variational Auto-Encoder
    Zhang, Shuo
    Chen, Jinyi
    Chen, Jiayuan
    Chen, Xiaofei
    Huang, Hejiao
    [J]. NEUROCOMPUTING, 2023, 529 : 23 - 32
  • [28] Learning from Demonstration using a Curvature Regularized Variational Auto-Encoder (CurvVAE)
    Rhodes, Travers
    Bhattacharjee, Tapomayukh
    Lee, Daniel D.
    [J]. 2022 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2022, : 10795 - 10800
  • [29] Unsupervised Anomaly Detection Using Variational Auto-Encoder based Feature Extraction
    Yao, Rong
    Liu, Chongdang
    Zhang, Linxuan
    Peng, Peng
    [J]. 2019 IEEE INTERNATIONAL CONFERENCE ON PROGNOSTICS AND HEALTH MANAGEMENT (ICPHM), 2019,
  • [30] Unsupervised Anomaly Detection in Flight Data Using Convolutional Variational Auto-Encoder
    Memarzadeh, Milad
    Matthews, Bryan
    Avrekh, Ilya
    [J]. AEROSPACE, 2020, 7 (08)