Evaluation of Finnish Unit Selection and HMM-based Speech Synthesis

被引：0

作者：

Silen, Hanna ^{[1
]}

Helander, Elina ^{[1
]}

Nurminen, Jani ^{[2
]}

Gabbouji, Moncef ^{[1
]}

机构：

[1] Tampere Univ Technol, Dept Signal Proc, Tampere, Finland

[2] Nokia Devices R&D, Tampere, Finland

来源：

INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5 | 2008年

基金：

芬兰科学院;

关键词：

speech synthesis; unit selection; hidden Markov models;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Unit selection and hidden Markov model (HMM) based synthesis have become the dominant techniques in text-to-speech (US) research. In this work, we combine HMM-based signal generation with the front end originally designed for unit selection based Finnish ITS and we evaluate the prosody of the output generated by the two synthesis techniques using the same speech database. Furthermore, we study the effect that the training set size has for the prosody and intelligibility in HMM-based synthesis. The results indicate that the HMM-based approach is capable of providing better prosody than unit selection even if the training set size is severely limited. The size of the training set, however, affects the prosodic quality and intelligibility of the HMM-based synthesizer.

引用

页码：1853 / +

页数：2

共 50 条

[41] The Design and Implementation of HMM-based Dai Speech Synthesis
Wang, Zhan
Yang, Jian
Yang, Xin
2016 10TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2016,
[42] Objective measures to improve the selection of training speakers in HMM-based child speech synthesis
Govender, Avashna
de Wet, Febe
2016 PATTERN RECOGNITION ASSOCIATION OF SOUTH AFRICA AND ROBOTICS AND MECHATRONICS INTERNATIONAL CONFERENCE (PRASA-ROBMECH), 2016,
[43] DIALOGUE CONTEXT SENSITIVE HMM-BASED SPEECH SYNTHESIS
Tsiakoulis, Pirros
Breslin, Catherine
Gasic, Milica
Henderson, Matthew
Kim, Dongho
Szummer, Martin
Thomson, Blaise
Young, Steve
2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
[44] HMM-based Tibetan Lhasa Speech Synthesis System
Wu Zhiqiang
Yu Hongzhi
Li Guanyu
Wan Shuhui
2013 3RD INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND NETWORK TECHNOLOGY (ICCSNT), 2013, : 92 - 95
[45] SPEECH-LAUGHS: AN HMM-BASED APPROACH FOR AMUSED SPEECH SYNTHESIS
El Haddad, Kevin
Dupont, Stephane
Urbain, Jerome
Dutoit, Thierry
2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 4939 - 4943
[46] EVALUATION OF HMM-BASED VISUAL LAUGHTER SYNTHESIS
Cakmak, Huseyin
Urbain, Jerome
Tilmanne, Joelle
Dutoit, Thierry
2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
[47] Discrete/Continuous Modelling of Speaking Style in HMM-based Speech Synthesis: Design and Evaluation
Obin, Nicolas
Lanchantin, Pierre
Lacheret, Anne
Rodet, Xavier
12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 2796 - +
[48] Developing an HMM-Based Speech Synthesis System for Malay: A Comparison of Iterative and Isolated Unit Training
Mustafa, Mumtaz Begum
Don, Zuraidah Mohd
Ainon, Raja Noor
Zainuddin, Roziati
Knowles, Gerry
IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2014, E97D (05): : 1273 - 1282
[49] Feature pruning in likelihood evaluation of HMM-based speech recognition
Li, X
Bilmes, J
ASRU'03: 2003 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING ASRU '03, 2003, : 303 - 308
[50] An Evaluation of Parameter Generation Methods with Rich Context Models in HMM-Based Speech Synthesis
Takamichi, Shinnosuke
Toda, Tomoki
Shiga, Yoshinori
Kawai, Hisashi
Sakti, Sakriani
Nakamura, Satoshi
13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 1138 - 1141

← 1 2 3 4 5 →