Cross-Lingual Voice Conversion using a Cyclic Variational Auto-encoder and a WaveNet Vocoder

被引：0

作者：

Nakatani, Hikaru ^{[1
]}

Tobing, Patrick Lumban ^{[1
]}

Takeda, Kazuya ^{[1
]}

Toda, Tomoki ^{[1
]}

机构：

[1] Nagoya Univ, Nagoya, Aichi, Japan

来源：

2020 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC) | 2020年

关键词：

NEURAL-NETWORKS; SPEECH;

D O I：

暂无

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

We propose a novel, cross-lingual voice conversion (VC) method using a cyclic variational auto-encoder (CycleVAE). Voice conversion is the transformation of the voice of one speaker into the voice of another speaker, while cross-lingual VC performs voice conversion between speakers who speak different languages. When using VC methods based on parallel learning, it is necessary to prepare accented speech uttered by the source or target speaker, using the pronunciation system of the speaker's mother tongue. On the other hand, VC methods which use a non-parallel learning approach can utilize the natural speech data of both the source and target speakers, produced in their own native languages. It then becomes necessary, however, to deal with the issues of time-alignment and language mismatches. To address these issues, we apply CycleVAE to cross-lingual VC as a sophisticated, non-parallel method of VC. We also apply the WaveNet vocoder in the waveform generation process of CycleVAE-VC to improve overall conversion quality. Our objective and subjective experimental results when performing cross-lingual VC from a native English speaker to a native Japanese speaker confirm that the proposed method achieves a higher level of naturalness and speaker similarity than a conventional RNN-based parallel VC method using accented speech.

引用

页码：520 / 526

页数：7

共 50 条

[1] Refined WaveNet Vocoder for Variational Autoencoder Based Voice Conversion
Huang, Wen-Chin
Wu, Yi-Chiao
Hwang, Hsin-Te
Tobing, Patrick Lumban
Hayashi, Tomoki
Kobayashi, Kazuhiro
Toda, Tomoki
Tsao, Yu
Wang, Hsin-Min
[J]. 2019 27TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2019,
[2] Voice Conversion from Non-parallel Corpora Using Variational Auto-encoder
Hsu, Chin-Cheng
Hwang, Hsin-Te
Wu, Yi-Chiao
Tsao, Yu
Wang, Hsin-Min
[J]. 2016 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA), 2016,
[3] Enhanced Variational Auto-encoder for Voice Conversion Using Non-parallel Corpora
Huang Guojie
Jin Hui
Yu Yibiao
[J]. PROCEEDINGS OF 2018 14TH IEEE INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING (ICSP), 2018, : 46 - 49
[4] An Approach to Cross-Lingual Voice Conversion
Rallabandi, Sai Sirisha
Gangashetty, Suryakanth V.
[J]. 2019 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2019,
[5] Any-to-any voice conversion using representation separation auto-encoder
Jian, Zhihua
Zhang, Zixu
[J]. Tongxin Xuebao/Journal on Communications, 2024, 45 (02): : 162 - 172
[6] Spectrum and Prosody Conversion for Cross-lingual Voice Conversion with CycleGAN
Du, Zongyang
Zhou, Kun
Sisman, Barrak
Li, Haizhou
[J]. 2020 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2020, : 507 - 513
[7] VOICE CONVERSION WITH CYCLIC RECURRENT NEURAL NETWORK AND FINE-TUNED WAVENET VOCODER
Tobing, Patrick Lumban
Wu, Yi-Chiao
Hayashi, Tomoki
Kobayashi, Kazuhiro
Toda, Tomoki
[J]. 2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 6815 - 6819
[8] Frame Alignment Method for Cross-lingual Voice Conversion
Erro, Daniel
Moreno, Asuncion
[J]. INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 1533 - 1536
[9] Cross-Lingual Voice Conversion With Controllable Speaker Individuality Using Variational Autoencoder and Star Generative Adversarial Network
Ho, Tuan Vu
Akagi, Masato
[J]. IEEE ACCESS, 2021, 9 : 47503 - 47515
[10] DNN-Based Cross-Lingual Voice Conversion Using Bottleneck Features
M. Kiran Reddy
K. Sreenivasa Rao
[J]. Neural Processing Letters, 2020, 51 : 2029 - 2042

← 1 2 3 4 5 →