CLUSTERGEN: A Statistical Parametric Synthesizer using Trajectory Modeling

被引:0
|
作者
Black, Alan W. [1 ]
机构
[1] Carnegie Mellon Univ, Language Technol Inst, Pittsburgh, PA 15213 USA
关键词
speech synthesis; statistical parametric synthesis; trajectory HMMs;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Unit selection synthesis has shown itself to be capable of producing high quality natural sounding synthetic speech when constructed from large databases of well-recorded, well-labeled speech. However, the cost in time and expertise of building such voices is still too expensive and specialized to be able to build individual voices for everyone. The quality in unit selection synthesis is directly related to the quality and size of the database used. As we require our speech synthesizers to have more variation, style and emotion, for unit selection synthesis, much larger databases will be required. As an alternative, more recently we have started looking for parametric models for speech synthesis, that are still trained from databases of natural speech but are more robust to errors and allow for better modeling of variation. This paper presents the CLUSTERGEN synthesizer which is implemented within the Festival/FestVox voice building environment. As well as the basic technique, three methods of modeling dynamics in the signal are presented and compared: a simple point model, a basic trajectory model and a trajectory model with overlap and add.
引用
收藏
页码:1762 / 1765
页数:4
相关论文
共 50 条
  • [1] Statistical parametric speech synthesis using a hidden trajectory model
    Cai, Ming-Qi
    Ling, Zhen-Hua
    Dai, Li-Rong
    [J]. SPEECH COMMUNICATION, 2015, 72 : 149 - 159
  • [2] IMPROVED TIME-FREQUENCY TRAJECTORY EXCITATION MODELING FOR A STATISTICAL PARAMETRIC SPEECH SYNTHESIS SYSTEM
    Song, Eunwoo
    Joo, Young-Sun
    Kang, Hong-Goo
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 4949 - 4953
  • [3] Parametric Shape Modeling of Femurs Using Statistical Shape Analysis
    Choi, Myung Hwan
    Koo, Bon Yeol
    Chae, Je Wook
    Kim, Jay Jung
    [J]. TRANSACTIONS OF THE KOREAN SOCIETY OF MECHANICAL ENGINEERS A, 2014, 38 (10) : 1139 - 1145
  • [4] A Parametric Piano Synthesizer
    Rauhala, Jukka
    Laurson, Mikael
    Valimaki, Vesa
    Lehtonen, Heidi-Maria
    Norilo, Vesa
    [J]. COMPUTER MUSIC JOURNAL, 2008, 32 (04) : 17 - 30
  • [5] FLEXIBLE TRAJECTORY MODELING USING A MIXTURE OF PARAMETRIC MOTION FIELDS FOR VIDEO SURVEILLANCE
    Nascimento, Jacinto C.
    Marques, Jorge S.
    Lemos, Joao M.
    [J]. 2011 18TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2011, : 1413 - 1416
  • [6] Reliability of Dynamic Causal Modeling using the Statistical Parametric Mapping Toolbox
    Hosseini, Pegah T.
    Wang, Shouyan
    Brinton, Julie
    Bell, Steven
    Simpson, David M.
    [J]. INTERNATIONAL JOURNAL OF SYSTEM DYNAMICS APPLICATIONS, 2014, 3 (02) : 1 - 16
  • [7] Non-parametric Statistical Density Function Synthesizer and Monte Carlo Sampler in CMOS
    Shylendra, Ahish
    Alizad, Sina Haji
    Shukla, Priyesh
    Trivedi, Amit Ranjan
    [J]. 2020 33RD INTERNATIONAL CONFERENCE ON VLSI DESIGN AND 2020 19TH INTERNATIONAL CONFERENCE ON EMBEDDED SYSTEMS (VLSID), 2020, : 19 - 24
  • [8] Pitch and tone's modeling in parametric trajectory model
    Zhang, YY
    Liu, WJ
    Xu, B
    Zhang, HY
    [J]. 2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-IV, PROCEEDINGS, 2002, : 4160 - 4160
  • [9] A Neural Parametric Singing Synthesizer
    Blaauw, Merlijn
    Bonada, Jordi
    [J]. 18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 4001 - 4005
  • [10] A Neural Parametric Singing Synthesizer Modeling Timbre and Expression from Natural Songs
    Blaauw, Merlijn
    Bonada, Jordi
    [J]. APPLIED SCIENCES-BASEL, 2017, 7 (12):