A Close Look into the Probablistic Concatenation Model for Corpus-based Speech Synthesis

被引:0
|
作者
Sakai, Shinsuke [1 ]
Maia, Ranniery [1 ]
Kawai, Hisashi [1 ]
Nakamura, Satoshi [1 ]
机构
[1] Natl Inst Informat & Commun Technol, Tokyo, Japan
关键词
speech synthesis; unit selection; join costs;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We have proposed a novel probabilistic approach to concatenation modeling for corpus-based speech synthesis, where the goodness of concatenation for a unit is modeled using a conditional Gaussian probability density whose mean is defined as a linear transform of the feature vector from the previous unit. This approach has shown its effectiveness through a subjective listening test. In this paper, we further investigate the characteristics of the proposed method by a objective evaluation and by observing the sequence of concatenation scores across an utterance. We also present the mathematical relationships of the proposed method with other approaches and show that it has a flexible modeling power, having other approaches to concatenation scoring methods as special cases.
引用
收藏
页码:744 / 747
页数:4
相关论文
共 50 条
  • [1] Probabilistic Concatenation Modeling for Corpus-Based Speech Synthesis
    Sakai, Shinsuke
    Kawahara, Tatsuya
    Kawai, Hisashi
    [J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2011, E94D (10): : 2006 - 2014
  • [2] Decision Tree-based Training of Probabilistic Concatenation Models for Corpus-based Speech Synthesis
    Sakai, Shinsuke
    Kawahara, Tatsuya
    [J]. INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 1746 - 1749
  • [3] A corpus-based speech synthesis system with emotion
    Iida, A
    Campbell, N
    Higuchi, F
    Yasumura, M
    [J]. SPEECH COMMUNICATION, 2003, 40 (1-2) : 161 - 187
  • [4] A corpus-based speech synthesis system for Uyghur
    Silamu, Wushour
    Tursun, Nasirjan
    Tursun, Mamateli
    [J]. RECENT ADVANCE OF CHINESE COMPUTING TECHNOLOGIES, 2007, : 373 - 376
  • [5] Introduction to Multilingual Corpus-Based Concatenative Speech Synthesis
    Deprez, Filip
    Odijk, Jan
    De Moortel, Jan
    [J]. INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 357 - 360
  • [6] Segment Connection Networks for Corpus-Based Speech Synthesis
    Coorman, Geert
    [J]. INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 2074 - 2077
  • [7] Developments in corpus-based speech synthesis: Approaching natural conversational speech
    Campbell, N
    [J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2005, E88D (03): : 376 - 383
  • [8] SPEECH, SPEECH - A CLOSE LOOK AT SPEECH SYNTHESIS
    MCCOMB, G
    [J]. CREATIVE COMPUTING, 1982, 8 (12): : 120 - &
  • [9] Corpus-based Malay Text-to-Speech Synthesis System
    Swee, Tan Tian
    Salleh, Sheikh Hussain Shaikh
    [J]. 2008 14TH ASIA-PACIFIC CONFERENCE ON COMMUNICATIONS, (APCC), VOLS 1 AND 2, 2008, : 52 - 56
  • [10] Maximum Likelihood Unit Selection for Corpus-based Speech Synthesis
    Gamboa Rosales, Abubeker
    Rosales, Hamurabi Gamboa
    Hoffmann, Ruediger
    [J]. INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 748 - +