Simultaneous Optimization of Multiple Tree-Based Factor Analyzed HMM for Speech Synthesis

被引：1

作者：

Yoshimura, Takenori ^{[1
]}

Hashimoto, Kei ^{[1
]}

Oura, Keiichiro ^{[1
]}

Nankaku, Yoshihiko ^{[1
]}

Tokuda, Keiichi ^{[1
]}

机构：

[1] Nagoya Inst Technol, Nagoya, Aichi 4668555, Japan

来源：

IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING | 2017年 / 25卷 / 09期

基金：

日本科学技术振兴机构;

关键词：

Decision tree-based context clustering; eigenvoice; factor analysis; HMM-based speech synthesis; HIDDEN MARKOV-MODELS; SPEAKER ADAPTATION; MAXIMUM-LIKELIHOOD; PITCH;

D O I：

10.1109/TASLP.2017.2721219

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

This paper proposes a novel method to build multiple decision trees as a structure of factor analyzed hidden Markov model for speech synthesis. In the proposed method, the multiple decision trees grow simultaneously rather than sequentially to take into account the relationship between the trees. However, the simultaneous growing is computationally infeasible due to an exponential increase in the number of tree structures to be evaluated. To solve the problem, we further propose two computational complexity reduction algorithms that achieve a significant reduction in the computational time. Experimental results show that the proposed method outperforms the conventional one based on a single decision tree.

引用

页码：1532 / 1541

页数：10

共 50 条

[21] HMM speech synthesis based on MDCT representation
Biagetti G.
Crippa P.
Falaschetti L.
Turchetti C.
International Journal of Speech Technology, 2018, 21 (4) : 1045 - 1055
[22] A Multi Model HMM Based Speech Synthesis
Chanjaradwichai, Supadaech
Suchato, Atiwong
Punyabukkana, Proadpran
ENGINEERING JOURNAL-THAILAND, 2018, 22 (01): : 187 - 203
[23] HMM-Based Vietnamese Speech Synthesis
Trinh Quoc Son
2015 IEEE/ACIS 14TH INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION SCIENCE (ICIS), 2015, : 349 - 353
[24] Czech HMM-Based Speech Synthesis
Hanzlicek, Zdenek
TEXT, SPEECH AND DIALOGUE, 2010, 6231 : 291 - 298
[25] Robustness of HMM-based Speech Synthesis
Yamagishi, Junichi
Ling, Zhenhua
King, Simon
INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 581 - 584
[26] A Solution on Tibetan Speech Synthesis Based on HMM
Zhou, Yan
Zhao, Dongcai
Wang, Fuzhao
PROCEEDINGS OF 2018 IEEE 3RD ADVANCED INFORMATION TECHNOLOGY, ELECTRONIC AND AUTOMATION CONTROL CONFERENCE (IAEAC 2018), 2018, : 1776 - 1780
[27] Arabic Speech Synthesis System Based on HMM
Amrouche, Aissa
Abed, Ahcene
Falek, Leila
2019 6TH INTERNATIONAL CONFERENCE ON ELECTRICAL AND ELECTRONICS ENGINEERING (ICEEE 2019), 2019, : 73 - 78
[28] Arabic HMM-based Speech Synthesis
Khalil, Krichi Mohamed
Adnan, Cherif
2013 INTERNATIONAL CONFERENCE ON ELECTRICAL ENGINEERING AND SOFTWARE APPLICATIONS (ICEESA), 2013, : 450 - 454
[29] Combining Extreme Learning Machine and Decision Tree for Duration Prediction in HMM based Speech Synthesis
Wang, Yang
Yang, Minghao
Wen, Zhengqi
Tao, Jianhua
16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 2197 - 2201
[30] HMM-Based Vietnamese Speech Synthesis
Trinh, Son
Hoang, Kiem
INTERNATIONAL JOURNAL OF SOFTWARE INNOVATION, 2015, 3 (04) : 33 - 47

← 1 2 3 4 5 →