Application of Voice Conversion for Cross-Language Rap Singing Transformation

被引：9

作者：

Tuerk, Oytun ^{[1
]}

Bueyuek, Osman ^{[2
,3
]}

Haznedaroglu, Ali ^{[2
,3
]}

Arslan, Levent M. ^{[2
,3
]}

机构：

[1] DFKI GmbH Language Technol Lab, Speech Grp, Berlin, Germany

[2] ITU Ayazaga Kampusu, Sestek Inc, Istanbul, Turkey

[3] Bogazici Univ, Dept Elect Engn & Elect, TR-80815 Bebek, Turkey

来源：

2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS | 2009年

关键词：

voice conversion; singing voice transformation; weighted codebook mapping;

D O I：

10.1109/ICASSP.2009.4960404

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

Voice conversion enables generation of a desired speaker's voice from audio recordings of another speaker. In this paper, we focus on a music application and describe the first steps towards generating voices of music celebrities using conventional voice conversion techniques. Specifically, rap singing transformations from English to Spanish are performed using parallel training material in English. Weighted codebook mapping based voice conversion with two different alignment methods and temporal smoothing of the transformation filter are employed. The first aligner uses a HMM trained for each source recording to force-align the corresponding target recording. The second aligner employs speaker-independent HMMs trained from a large number of speakers. Additionally, a smoothing step is devised to reduce discontinuities and to improve performance. The results of subjective evaluations indicate that both aligners perform equivalenty well. The proposed smoothing technique improves both similarity to target singer and quality significantly regardless of the alignment method.

引用

页码：3597 / +

页数：2

共 50 条

[1] Cross-Language Voice Conversion Based on Eigenvoices
Charlier, Malorie
Ohtani, Yamato
Toda, Tomoki
Moinet, Alexis
Dutoit, Thierry
[J]. INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 1603 - +
[2] A Phonetic Assessment of Cross-Language Voice Conversion
Yanagisawa, Kayoko
Huckvale, Mark
[J]. INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 593 - 596
[3] VTLN-based cross-language voice conversion
Sündermann, D
Ney, H
Höge, H
[J]. ASRU'03: 2003 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING ASRU '03, 2003, : 676 - 681
[4] Text-Independent Cross-Language Voice Conversion
Suendermann, David
Hoege, Harald
Bonafonte, Antonio
Ney, Hermann
Hirschberg, Julia
[J]. INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 2262 - +
[5] STATISTICAL-ANALYSIS OF BILINGUAL SPEAKERS SPEECH FOR CROSS-LANGUAGE VOICE CONVERSION
ABE, M
SHIKANO, K
KUWABARA, H
[J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1991, 90 (01): : 76 - 82
[6] Cross-Language Evaluation of Voice-To-Phoneme Conversions for Voice-Tag Application in Embedded Platforms
Cheng, Yan Ming
Ma, Changxue
Melnar, Lynette
[J]. INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 121 - 124
[7] Unsupervised Cross-Domain Singing Voice Conversion
Polyak, Adam
Wolf, Lior
Adi, Yossi
Taigman, Yaniv
[J]. INTERSPEECH 2020, 2020, : 801 - 805
[8] Cross-language analysis of stutterers voice onset time
Rezaei-Aghbash, N
Whiteside, SP
Cudd, P
[J]. JOURNAL OF FLUENCY DISORDERS, 2000, 25 (03) : 258 - 258
[9] VOICE TIMING - CROSS-LANGUAGE EXPERIMENTS IN IDENTIFICATION AND DISCRIMINATION
ABRAMSON, AS
LISKER, L
[J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1968, 44 (01): : 377 - &
[10] Language translation and media transformation in cross-language image retrieval
Chen, Hsin-Hsi
Chang, Yih-Chen
[J]. DIGITAL LIBRARIES: ACHIEVEMENTS, CHALLENGES AND OPPORTUNITIES, PROCEEDINGS, 2006, 4312 : 350 - +

← 1 2 3 4 5 →