Speech Synthesis Based on Gaussian Conditional Random Fields

被引:2
|
作者
Khorram, Soheil [1 ]
Bahmaninezhad, Fahimeh [1 ]
Sameti, Hossein [1 ]
机构
[1] Sharif Univ Technol, Dept Comp Engn, Tehran, Iran
关键词
Gaussian conditional random field; Statistical parametric speech synthesis; HSMM extension; ALGORITHMS; HMM;
D O I
10.1007/978-3-319-10849-0_19
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Hidden Markov Model (HMM)-based synthesis (HTS) has recently been confirmed to be the most effective method in generating natural speech. However, it lacks adequate context generalization when the training data is limited. As a solution, current study provides a new context-dependent speech modeling framework based on the Gaussian Conditional Random Field (GCRF) theory. By applying this model, an innovative speech synthesis system has been developed which can be viewed as an extension of Context-Dependent Hidden Semi Markov Model (CD-HSMM). A novel Viterbi decoder along with a stochastic gradient ascent algorithm was applied to train model parameters. Also, a fast and efficient parameter generation algorithm was derived for the synthesis part. Experimental results using objective and subjective criteria have shown that the proposed system outperforms HSMM substantially in limited speech databases. Moreover, Mel-cepstral distance of the spectral parameters has been reduced considerably for any size of training database.
引用
收藏
页码:183 / 193
页数:11
相关论文
共 50 条
  • [31] Image Classification Based on Conditional Random Fields
    Yang, Yan
    Wen-bo, Huang
    Yun-ji, Wang
    [J]. MECHATRONICS ENGINEERING, COMPUTING AND INFORMATION TECHNOLOGY, 2014, 556-562 : 4901 - +
  • [32] Active Learning for Speech Emotion Recognition Using Conditional Random Fields
    Zhao, Ziping
    Ma, Xirong
    [J]. 2013 14TH ACIS INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING, ARTIFICIAL INTELLIGENCE, NETWORKING AND PARALLEL/DISTRIBUTED COMPUTING (SNPD 2013), 2013, : 127 - 131
  • [33] DISCRIMINATIVE DURATION MODELING FOR SPEECH RECOGNITION WITH SEGMENTAL CONDITIONAL RANDOM FIELDS
    Kao, Justine T.
    Zweig, Geoffrey
    Nguyen, Patrick
    [J]. 2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 4476 - 4479
  • [34] AUTOMATIC MINUTE GENERATION FOR PARLIAMENTARY SPEECH USING CONDITIONAL RANDOM FIELDS
    Zhang, Justin Jian
    Fung, Pascale
    Chan, Ricky Ho Yin
    [J]. 2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 5536 - 5539
  • [35] An improved gaussian mixture hidden conditional random fields model for audio-based emotions classification
    Siddiqi, Muhammad Hameed
    [J]. EGYPTIAN INFORMATICS JOURNAL, 2021, 22 (01) : 45 - 51
  • [36] Human Activity Recognition Using Gaussian Mixture Hidden Conditional Random Fields
    Siddiqi, Muhammad Hameed
    Alruwaili, Madallah
    Ali, Amjad
    Alanazi, Saad
    Zeshan, Furkh
    [J]. COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2019, 2019
  • [37] Wireless Channel Quality Prediction using Sparse Gaussian Conditional Random Fields
    Raj, Raushan
    Kulkarni, Adita
    Seetharam, Anand
    Ramesh, Arti
    [J]. 2021 IEEE 18TH ANNUAL CONSUMER COMMUNICATIONS & NETWORKING CONFERENCE (CCNC), 2021,
  • [38] EFFICIENT CONDITIONAL MONTE CARLO SIMULATIONS FOR THE EXPONENTIAL INTEGRALS OF GAUSSIAN RANDOM FIELDS
    Quang Huy Nguyen
    Robert, Christian Y.
    [J]. JOURNAL OF APPLIED PROBABILITY, 2022, 59 (02) : 366 - 383
  • [39] Graphical Model Selection for Gaussian Conditional Random Fields in the Presence of Latent Variables
    Frot, Benjamin
    Jostins, Luke
    McVean, Gilean
    [J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2019, 114 (526) : 723 - 734
  • [40] Some links between conditional and coregionalized multivariate Gaussian Markov random fields
    Martinez-Beneito, Miguel A.
    [J]. SPATIAL STATISTICS, 2020, 40