High-Individuality Voice Conversion Based on Concatenative Speech Synthesis

被引:0
|
作者
Fujii, Kei [1 ]
Okawa, Jun [1 ]
Suigetsu, Kaori [1 ]
机构
[1] Kumamoto Natl Coll Technol, Dept Informat & Comp Sci, Kohshi City, Kumamoto 8611102, Japan
关键词
concatenative speech synthesis; join cost; speaker individuality; unit selection; voice conversion;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Concatenative speech synthesis is a method that can make speech sound which has naturalness and high-individuality of a speaker by introducing a large speech corpus. Based on this method, in this paper, we propose a voice conversion method whose conversion speech has high-individuality and naturalness. The authors also have two subjective evaluation experiments for evaluating individuality and sound quality of conversion speech. From the results, following three facts have be confirmed: (a) the proposal method can convert the individuality of speakers well, (b) employing the framework of unit selection (especially join cost) of concatenative speech synthesis into conventional voice conversion improves the sound quality of conversion speech, and (c) the proposal method is robust against the difference of genders between a source speaker and a target speaker.
引用
收藏
页码:483 / 488
页数:6
相关论文
共 50 条
  • [1] Syllable Based Concatenative Synthesis for Text to Speech Conversion
    Ananthi, S.
    Dhanalakshmi, P.
    [J]. COMPUTATIONAL INTELLIGENCE IN DATA MINING, VOL 3, 2015, 33
  • [2] Applying Voice Conversion To Concatenative Singing-Voice Synthesis
    Villavicencio, Fernando
    Bonada, Jordi
    [J]. 11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 3 AND 4, 2010, : 2162 - +
  • [3] Assessment and correction of voice quality variabilities in large speech databases for concatenative speech synthesis
    Stylianou, Yannis
    [J]. ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, 1999, 1 : 377 - 380
  • [4] Archisegment-based letter-to-phone conversion for concatenative speech synthesis in Portuguese
    Albano, EC
    Moreira, AA
    [J]. ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 1708 - 1711
  • [5] Assessment and correction of voice quality variabilities in large speech databases for concatenative speech synthesis
    Stylianou, Y
    [J]. ICASSP '99: 1999 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS VOLS I-VI, 1999, : 377 - 380
  • [6] A Concatenative Synthesis Based Speech Synthesiser for Hindi
    Gupta, Kshitij
    [J]. ADVANCES IN COMPUTER AND INFORMATIOM SCIENCES AND ENGINEERING, 2008, : 261 - 264
  • [7] Dimensional Affective Speech Synthesis Based on Voice Conversion
    Zhang, Xin
    Wan, Yaobin
    Wang, Wei
    [J]. Intelligent Computing, 2024, 3
  • [8] MULTI VOICE TEXT TO SPEECH SYNTHESIS BASED ON THE INSTANTANEOUS PARAMETRIC VOICE CONVERSION
    Azarov, Elias
    Petrovsky, Alexander
    Zubrycki, Piotr
    [J]. SPA 2010: SIGNAL PROCESSING ALGORITHMS, ARCHITECTURES, ARRANGEMENTS, AND APPLICATIONS CONFERENCE PROCEEDINGS, 2010, : 78 - 82
  • [9] Voice Conversion for Whispered Speech Synthesis
    Cotescu, Marius
    Drugman, Thomas
    Huybrechts, Goeric
    Lorenzo-Trueba, Jaime
    Moinet, Alexis
    [J]. IEEE SIGNAL PROCESSING LETTERS, 2020, 27 : 186 - 190
  • [10] Introduction to Multilingual Corpus-Based Concatenative Speech Synthesis
    Deprez, Filip
    Odijk, Jan
    De Moortel, Jan
    [J]. INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 357 - 360