Combining Extreme Learning Machine and Decision Tree for Duration Prediction in HMM based Speech Synthesis

被引:0
|
作者
Wang, Yang [1 ]
Yang, Minghao [1 ]
Wen, Zhengqi [1 ]
Tao, Jianhua [1 ,2 ]
机构
[1] Chinese Acad Sci, Inst Automat, Natl Lab Pattern Recognit, Beijing, Peoples R China
[2] Chinese Acad Sci, Beijing Key Lab Mobile Comp & Pervas Device, Inst Comp Technol, Beijing, Peoples R China
关键词
speech synthesis; duration prediction; extreme learning machine;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Hidden Markov Model (HMM) based speech synthesis using Decision Tree (DT) for duration prediction is known to produce over-averaged rhythm. To alleviate this problem, this paper proposes a two level duration prediction method together with outlier removal. This method takes advantages of accurate regression capability by Extreme Learning Machine (ELM) for phone level duration prediction, and the capability of distributing state durations by DT for state level duration prediction. Experimental results showed that the method decreased RMSE of phone duration, increased the fluctuation of syllable duration, and achieved 63.75% in preference evaluation. Furthermore, this method does not incur laborious manual alignment on training corpus.
引用
收藏
页码:2197 / 2201
页数:5
相关论文
共 50 条
  • [1] Speech emotion recognition based on feature selection and extreme learning machine decision tree
    Liu, Zhen-Tao
    Wu, Min
    Cao, Wei-Hua
    Mao, Jun-Wei
    Xu, Jian-Ping
    Tan, Guan-Zheng
    [J]. NEUROCOMPUTING, 2018, 273 : 271 - 280
  • [2] Extended Decision Tree with OR Relationship for HMM-based Speech Synthesis
    Wang, Yang
    Tao, Jianhua
    Yang, Minghao
    Li, Ya
    [J]. 2013 SECOND IAPR ASIAN CONFERENCE ON PATTERN RECOGNITION (ACPR 2013), 2013, : 225 - 229
  • [3] MOOC Dropout Prediction Using a Hybrid Algorithm Based on Decision Tree and Extreme Learning Machine
    Chen, Jing
    Feng, Jun
    Sun, Xia
    Wu, Nannan
    Yang, Zhengzheng
    Chen, Sushing
    [J]. MATHEMATICAL PROBLEMS IN ENGINEERING, 2019, 2019
  • [4] Decision Tree-based Clustering with Outlier Detection for HMM-based Speech Synthesis
    Oh, Kyung Hwan
    Sung, June Sig
    Hong, Doo Hwa
    Kim, Nam Soo
    [J]. 12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 108 - +
  • [5] A hybrid disease prediction model based on decision tree and extreme learning machine for predicting dialysis of diabetic nephropathy
    Chen, I-Fei
    Lee, Tian-Shyug
    Jhou, Mao-Jhen
    Lu, Chi-Jie
    [J]. Journal of Quality, 2020, 27 (04): : 214 - 230
  • [6] Decision tree based duration prediction in Mandarin
    Guo, Q
    Katae, N
    [J]. Proceedings of the 2005 IEEE International Conference on Natural Language Processing and Knowledge Engineering (IEEE NLP-KE'05), 2005, : 209 - 212
  • [7] State duration modeling for HMM-based speech synthesis
    Zen, Heiga
    Masuko, Takashi
    Tokuda, Keiichi
    Yoshimura, Takayoshi
    Kobayasih, Takao
    Kitamura, Tadashi
    [J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2007, E90D (03): : 692 - 693
  • [8] Speaking style adaptation using context clustering decision tree for HMM-based speech synthesis
    Yamagishi, J
    Tachibana, M
    Masuko, T
    Kobayashi, T
    [J]. 2004 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING, 2004, : 5 - 8
  • [9] Prediction of Daily Smoking Behavior Based on Decision Tree Machine Learning Algorithm
    Zhang, Yupu
    Liu, Jinhai
    Zhang, Zhihang
    Huang, Junnan
    [J]. PROCEEDINGS OF 2019 IEEE 9TH INTERNATIONAL CONFERENCE ON ELECTRONICS INFORMATION AND EMERGENCY COMMUNICATION (ICEIEC 2019), 2019, : 330 - 333
  • [10] Topics in decision tree based speech synthesis
    Donovan, RE
    [J]. COMPUTER SPEECH AND LANGUAGE, 2003, 17 (01): : 43 - 67