A decision tree-based clustering approach to state definition in an excitation modeling framework for HMM-based speech synthesis

被引：0

作者：

Maia, Ranniery

Toda, Tomoki

Tokuda, Keiichi

Sakai, Shinsuke

Nakamura, Satoshi

机构：

来源：

INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5 | 2009年

关键词：

speech synthesis; HMM-based speech synthesis; decision tree-based clustering; residual modeling;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This paper presents a decision tree-based algorithm to cluster residual segments assuming an excitation model based on state-dependent filtering of pulse train and white noise. The decision tree construction principle is the same as the one applied to speech recognition. Here parent nodes are split using the residual maximum likelihood criterion. Once these excitation decision trees are constructed for residual signals segmented by full context models, using questions related to the full context of the training sentences, they can be utilized for excitation modeling in speech synthesis based on hidden Markov models (HMM). Experimental results have shown that the algorithm in question is very effective in terms of clustering residual signals given segmentation. pitch marks and full context questions, resulting in filters with good residual modeling properties.

引用

页码：1743 / 1746

页数：4

共 50 条

[21] Soft context clustering for F0 modeling in HMM-based speech synthesis
Soheil Khorram
Hossein Sameti
Simon King
EURASIP Journal on Advances in Signal Processing, 2015
[22] Voiced/Unvoiced Decision Algorithm for HMM-based Speech Synthesis
Kang, Shiyin
Shuang, Zhiwei
Duan, Quansheng
Qin, Yong
Cai, Lianhong
INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 420 - +
[23] Decision Tree Based Context Clustering with Cross Likelihood Ratio for HMM-based TTS
Jung, Chi-Sang
Kang, Hong-Goo
JOURNAL OF THE ACOUSTICAL SOCIETY OF KOREA, 2013, 32 (02): : 174 - 180
[24] Inverse filter based excitation model for HMM-based speech synthesis system
Reddy, Mittapalle Kiran
Rao, Krothapalli Sreenivasa
IET SIGNAL PROCESSING, 2018, 12 (04) : 544 - 548
[25] Croatian HMM-based speech synthesis
Department of Informatics, Faculty of Philosophy, University of Rijeka, Omladinska 14, Rijeka
51000, Croatia
J. Compt. Inf. Technol., 2006, 4 (307-313):
[26] HMM-Based Vietnamese Speech Synthesis
Trinh Quoc Son
2015 IEEE/ACIS 14TH INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION SCIENCE (ICIS), 2015, : 349 - 353
[27] Czech HMM-Based Speech Synthesis
Hanzlicek, Zdenek
TEXT, SPEECH AND DIALOGUE, 2010, 6231 : 291 - 298
[28] Robustness of HMM-based Speech Synthesis
Yamagishi, Junichi
Ling, Zhenhua
King, Simon
INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 581 - 584
[29] SPEECH-LAUGHS: AN HMM-BASED APPROACH FOR AMUSED SPEECH SYNTHESIS
El Haddad, Kevin
Dupont, Stephane
Urbain, Jerome
Dutoit, Thierry
2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 4939 - 4943
[30] Arabic HMM-based Speech Synthesis
Khalil, Krichi Mohamed
Adnan, Cherif
2013 INTERNATIONAL CONFERENCE ON ELECTRICAL ENGINEERING AND SOFTWARE APPLICATIONS (ICEESA), 2013, : 450 - 454

← 1 2 3 4 5 →