Extended Decision Tree with OR Relationship for HMM-based Speech Synthesis

被引：1

作者：

Wang, Yang ^{[1
]}

Tao, Jianhua ^{[1
]}

Yang, Minghao ^{[1
]}

Li, Ya ^{[1
]}

机构：

[1] Chinese Acad Sci, Inst Automat, Natl Lab Pattern Recognit, Beijing, Peoples R China

来源：

2013 SECOND IAPR ASIAN CONFERENCE ON PATTERN RECOGNITION (ACPR 2013) | 2013年

关键词：

HMM-based speech synthesis; decision tree; OR relationship;

D O I：

10.1109/ACPR.2013.94

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This paper proposes a variant of decision tree (DT) for HMM-based speech synthesis. We call it Extended Decision Tree with OR Relationship (EDTOR). A leaf node in conventional DT is uniquely reached by answering a series of yes/no questions starting from its root node until the leaf node. Thus the decision condition for deciding whether the acoustic parameters of a context label belong to a certain leaf node is subject to AND logical expressions. However, some linguistic knowledge cannot be represented by AND logical expressions compactly and efficiently. We introduce OR relationship to DT at leaf node level to loosen the restriction on DT. Preliminary experimental results show that EDTOR can, 1) greatly reduce the leaf node number of DT (i.e., model size) without affecting speech synthesis performance, which is appealing to embedded applications, or; 2) slightly improve the performance if DT has the same leaf node number as that of EDTOR.

引用

页码：225 / 229

页数：5

共 50 条

[1] Decision Tree-based Clustering with Outlier Detection for HMM-based Speech Synthesis
Oh, Kyung Hwan
Sung, June Sig
Hong, Doo Hwa
Kim, Nam Soo
[J]. 12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 108 - +
[2] Speaking style adaptation using context clustering decision tree for HMM-based speech synthesis
Yamagishi, J
Tachibana, M
Masuko, T
Kobayashi, T
[J]. 2004 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING, 2004, : 5 - 8
[3] Voiced/Unvoiced Decision Algorithm for HMM-based Speech Synthesis
Kang, Shiyin
Shuang, Zhiwei
Duan, Quansheng
Qin, Yong
Cai, Lianhong
[J]. INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 420 - +
[4] On the Use of Extended Context for HMM-based Spontaneous Conversational Speech Synthesis
Koriyama, Tomoki
Nose, Takashi
Kobayashi, Takao
[J]. 12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 2668 - 2671
[5] HMM-Based Vietnamese Speech Synthesis
Trinh Quoc Son
[J]. 2015 IEEE/ACIS 14TH INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION SCIENCE (ICIS), 2015, : 349 - 353
[6] Czech HMM-Based Speech Synthesis
Hanzlicek, Zdenek
[J]. TEXT, SPEECH AND DIALOGUE, 2010, 6231 : 291 - 298
[7] Robustness of HMM-based Speech Synthesis
Yamagishi, Junichi
Ling, Zhenhua
King, Simon
[J]. INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 581 - 584
[8] Arabic HMM-based Speech Synthesis
Khalil, Krichi Mohamed
Adnan, Cherif
[J]. 2013 INTERNATIONAL CONFERENCE ON ELECTRICAL ENGINEERING AND SOFTWARE APPLICATIONS (ICEESA), 2013, : 450 - 454
[9] HMM-Based Vietnamese Speech Synthesis
Trinh, Son
Hoang, Kiem
[J]. INTERNATIONAL JOURNAL OF SOFTWARE INNOVATION, 2015, 3 (04) : 33 - 47
[10] A decision tree-based clustering approach to state definition in an excitation modeling framework for HMM-based speech synthesis
Maia, Ranniery
Toda, Tomoki
Tokuda, Keiichi
Sakai, Shinsuke
Nakamura, Satoshi
[J]. INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 1743 - 1746

← 1 2 3 4 5 →