Speech Synthesis Based on Gaussian Conditional Random Fields

被引:2
|
作者
Khorram, Soheil [1 ]
Bahmaninezhad, Fahimeh [1 ]
Sameti, Hossein [1 ]
机构
[1] Sharif Univ Technol, Dept Comp Engn, Tehran, Iran
关键词
Gaussian conditional random field; Statistical parametric speech synthesis; HSMM extension; ALGORITHMS; HMM;
D O I
10.1007/978-3-319-10849-0_19
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Hidden Markov Model (HMM)-based synthesis (HTS) has recently been confirmed to be the most effective method in generating natural speech. However, it lacks adequate context generalization when the training data is limited. As a solution, current study provides a new context-dependent speech modeling framework based on the Gaussian Conditional Random Field (GCRF) theory. By applying this model, an innovative speech synthesis system has been developed which can be viewed as an extension of Context-Dependent Hidden Semi Markov Model (CD-HSMM). A novel Viterbi decoder along with a stochastic gradient ascent algorithm was applied to train model parameters. Also, a fast and efficient parameter generation algorithm was derived for the synthesis part. Experimental results using objective and subjective criteria have shown that the proposed system outperforms HSMM substantially in limited speech databases. Moreover, Mel-cepstral distance of the spectral parameters has been reduced considerably for any size of training database.
引用
收藏
页码:183 / 193
页数:11
相关论文
共 50 条
  • [1] Gaussian conditional random fields for classification
    Petrovic, Andrija
    Nikolic, Mladen
    Jovanovic, Milos
    Delibasic, Boris
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2023, 212
  • [2] Background Extraction Based on Joint Gaussian Conditional Random Fields
    Wang, Hong-Cyuan
    Lai, Yu-Chi
    Cheng, Wen-Huang
    Cheng, Chin-Yun
    Hua, Kai-Lung
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2018, 28 (11) : 3127 - 3140
  • [3] Gaussian Conditional Random Fields for Face Recognition
    Smereka, Jonathon M.
    Kumar, B. V. K. Vijaya
    Rodriguez, Andres
    [J]. PROCEEDINGS OF 29TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, (CVPRW 2016), 2016, : 155 - 162
  • [4] BAYESIAN ESTIMATION OF GAUSSIAN CONDITIONAL RANDOM FIELDS
    Gan, Lingrui
    Narisetty, Naveen
    Liang, Feng
    [J]. STATISTICA SINICA, 2022, 32 (01) : 131 - 152
  • [5] Software package for regression algorithms based on Gaussian Conditional Random Fields
    Markovic, Tijana
    Devedzic, Vladan
    Zhou, Fang
    Obradovic, Zoran
    [J]. 2022 21ST IEEE INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS, ICMLA, 2022, : 1121 - 1128
  • [6] Object Segmentation Based on Gaussian Mixture Model and Conditional Random Fields
    Qi, Yali
    Zhang, Guoshan
    Qi, Yali
    Li, Yeli
    [J]. 2016 IEEE INTERNATIONAL CONFERENCE ON INFORMATION AND AUTOMATION (ICIA), 2016, : 900 - 904
  • [7] Conditional Random Fields for Hierarchical Segment Selection in Text-to-Speech Synthesis
    Weiss, Christian
    Hess, Wolfgang
    [J]. INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 2026 - 2029
  • [8] Mixed Membership Sparse Gaussian Conditional Random Fields
    Yang, Jie
    Leung, Henry C. M.
    Yiu, S. M.
    Chin, Francis Y. L.
    [J]. ADVANCED DATA MINING AND APPLICATIONS, ADMA 2017, 2017, 10604 : 287 - 302
  • [9] Gaussian conditional random fields extended for directed graphs
    Vujicic, Tijana
    Glass, Jesse
    Zhou, Fang
    Obradovic, Zoran
    [J]. MACHINE LEARNING, 2017, 106 (9-10) : 1271 - 1288
  • [10] Gaussian conditional random fields extended for directed graphs
    Tijana Vujicic
    Jesse Glass
    Fang Zhou
    Zoran Obradovic
    [J]. Machine Learning, 2017, 106 : 1271 - 1288