Simultaneous Optimization of Multiple Tree-Based Factor Analyzed HMM for Speech Synthesis

被引:1
|
作者
Yoshimura, Takenori [1 ]
Hashimoto, Kei [1 ]
Oura, Keiichiro [1 ]
Nankaku, Yoshihiko [1 ]
Tokuda, Keiichi [1 ]
机构
[1] Nagoya Inst Technol, Nagoya, Aichi 4668555, Japan
基金
日本科学技术振兴机构;
关键词
Decision tree-based context clustering; eigenvoice; factor analysis; HMM-based speech synthesis; HIDDEN MARKOV-MODELS; SPEAKER ADAPTATION; MAXIMUM-LIKELIHOOD; PITCH;
D O I
10.1109/TASLP.2017.2721219
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper proposes a novel method to build multiple decision trees as a structure of factor analyzed hidden Markov model for speech synthesis. In the proposed method, the multiple decision trees grow simultaneously rather than sequentially to take into account the relationship between the trees. However, the simultaneous growing is computationally infeasible due to an exponential increase in the number of tree structures to be evaluated. To solve the problem, we further propose two computational complexity reduction algorithms that achieve a significant reduction in the computational time. Experimental results show that the proposed method outperforms the conventional one based on a single decision tree.
引用
收藏
页码:1532 / 1541
页数:10
相关论文
共 50 条
  • [21] HMM speech synthesis based on MDCT representation
    Biagetti G.
    Crippa P.
    Falaschetti L.
    Turchetti C.
    International Journal of Speech Technology, 2018, 21 (4) : 1045 - 1055
  • [22] A Multi Model HMM Based Speech Synthesis
    Chanjaradwichai, Supadaech
    Suchato, Atiwong
    Punyabukkana, Proadpran
    ENGINEERING JOURNAL-THAILAND, 2018, 22 (01): : 187 - 203
  • [23] HMM-Based Vietnamese Speech Synthesis
    Trinh Quoc Son
    2015 IEEE/ACIS 14TH INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION SCIENCE (ICIS), 2015, : 349 - 353
  • [24] Czech HMM-Based Speech Synthesis
    Hanzlicek, Zdenek
    TEXT, SPEECH AND DIALOGUE, 2010, 6231 : 291 - 298
  • [25] Robustness of HMM-based Speech Synthesis
    Yamagishi, Junichi
    Ling, Zhenhua
    King, Simon
    INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 581 - 584
  • [26] A Solution on Tibetan Speech Synthesis Based on HMM
    Zhou, Yan
    Zhao, Dongcai
    Wang, Fuzhao
    PROCEEDINGS OF 2018 IEEE 3RD ADVANCED INFORMATION TECHNOLOGY, ELECTRONIC AND AUTOMATION CONTROL CONFERENCE (IAEAC 2018), 2018, : 1776 - 1780
  • [27] Arabic Speech Synthesis System Based on HMM
    Amrouche, Aissa
    Abed, Ahcene
    Falek, Leila
    2019 6TH INTERNATIONAL CONFERENCE ON ELECTRICAL AND ELECTRONICS ENGINEERING (ICEEE 2019), 2019, : 73 - 78
  • [28] Arabic HMM-based Speech Synthesis
    Khalil, Krichi Mohamed
    Adnan, Cherif
    2013 INTERNATIONAL CONFERENCE ON ELECTRICAL ENGINEERING AND SOFTWARE APPLICATIONS (ICEESA), 2013, : 450 - 454
  • [29] Combining Extreme Learning Machine and Decision Tree for Duration Prediction in HMM based Speech Synthesis
    Wang, Yang
    Yang, Minghao
    Wen, Zhengqi
    Tao, Jianhua
    16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 2197 - 2201
  • [30] HMM-Based Vietnamese Speech Synthesis
    Trinh, Son
    Hoang, Kiem
    INTERNATIONAL JOURNAL OF SOFTWARE INNOVATION, 2015, 3 (04) : 33 - 47