Application of Voice Conversion for Cross-Language Rap Singing Transformation

被引:9
|
作者
Tuerk, Oytun [1 ]
Bueyuek, Osman [2 ,3 ]
Haznedaroglu, Ali [2 ,3 ]
Arslan, Levent M. [2 ,3 ]
机构
[1] DFKI GmbH Language Technol Lab, Speech Grp, Berlin, Germany
[2] ITU Ayazaga Kampusu, Sestek Inc, Istanbul, Turkey
[3] Bogazici Univ, Dept Elect Engn & Elect, TR-80815 Bebek, Turkey
关键词
voice conversion; singing voice transformation; weighted codebook mapping;
D O I
10.1109/ICASSP.2009.4960404
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Voice conversion enables generation of a desired speaker's voice from audio recordings of another speaker. In this paper, we focus on a music application and describe the first steps towards generating voices of music celebrities using conventional voice conversion techniques. Specifically, rap singing transformations from English to Spanish are performed using parallel training material in English. Weighted codebook mapping based voice conversion with two different alignment methods and temporal smoothing of the transformation filter are employed. The first aligner uses a HMM trained for each source recording to force-align the corresponding target recording. The second aligner employs speaker-independent HMMs trained from a large number of speakers. Additionally, a smoothing step is devised to reduce discontinuities and to improve performance. The results of subjective evaluations indicate that both aligners perform equivalenty well. The proposed smoothing technique improves both similarity to target singer and quality significantly regardless of the alignment method.
引用
收藏
页码:3597 / +
页数:2
相关论文
共 50 条
  • [1] Cross-Language Voice Conversion Based on Eigenvoices
    Charlier, Malorie
    Ohtani, Yamato
    Toda, Tomoki
    Moinet, Alexis
    Dutoit, Thierry
    [J]. INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 1603 - +
  • [2] A Phonetic Assessment of Cross-Language Voice Conversion
    Yanagisawa, Kayoko
    Huckvale, Mark
    [J]. INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 593 - 596
  • [3] VTLN-based cross-language voice conversion
    Sündermann, D
    Ney, H
    Höge, H
    [J]. ASRU'03: 2003 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING ASRU '03, 2003, : 676 - 681
  • [4] Text-Independent Cross-Language Voice Conversion
    Suendermann, David
    Hoege, Harald
    Bonafonte, Antonio
    Ney, Hermann
    Hirschberg, Julia
    [J]. INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 2262 - +
  • [5] STATISTICAL-ANALYSIS OF BILINGUAL SPEAKERS SPEECH FOR CROSS-LANGUAGE VOICE CONVERSION
    ABE, M
    SHIKANO, K
    KUWABARA, H
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1991, 90 (01): : 76 - 82
  • [6] Cross-Language Evaluation of Voice-To-Phoneme Conversions for Voice-Tag Application in Embedded Platforms
    Cheng, Yan Ming
    Ma, Changxue
    Melnar, Lynette
    [J]. INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 121 - 124
  • [7] Unsupervised Cross-Domain Singing Voice Conversion
    Polyak, Adam
    Wolf, Lior
    Adi, Yossi
    Taigman, Yaniv
    [J]. INTERSPEECH 2020, 2020, : 801 - 805
  • [8] Cross-language analysis of stutterers voice onset time
    Rezaei-Aghbash, N
    Whiteside, SP
    Cudd, P
    [J]. JOURNAL OF FLUENCY DISORDERS, 2000, 25 (03) : 258 - 258
  • [9] VOICE TIMING - CROSS-LANGUAGE EXPERIMENTS IN IDENTIFICATION AND DISCRIMINATION
    ABRAMSON, AS
    LISKER, L
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1968, 44 (01): : 377 - &
  • [10] Language translation and media transformation in cross-language image retrieval
    Chen, Hsin-Hsi
    Chang, Yih-Chen
    [J]. DIGITAL LIBRARIES: ACHIEVEMENTS, CHALLENGES AND OPPORTUNITIES, PROCEEDINGS, 2006, 4312 : 350 - +