Evaluation of Finnish Unit Selection and HMM-based Speech Synthesis

被引:0
|
作者
Silen, Hanna [1 ]
Helander, Elina [1 ]
Nurminen, Jani [2 ]
Gabbouji, Moncef [1 ]
机构
[1] Tampere Univ Technol, Dept Signal Proc, Tampere, Finland
[2] Nokia Devices R&D, Tampere, Finland
基金
芬兰科学院;
关键词
speech synthesis; unit selection; hidden Markov models;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Unit selection and hidden Markov model (HMM) based synthesis have become the dominant techniques in text-to-speech (US) research. In this work, we combine HMM-based signal generation with the front end originally designed for unit selection based Finnish ITS and we evaluate the prosody of the output generated by the two synthesis techniques using the same speech database. Furthermore, we study the effect that the training set size has for the prosody and intelligibility in HMM-based synthesis. The results indicate that the HMM-based approach is capable of providing better prosody than unit selection even if the training set size is severely limited. The size of the training set, however, affects the prosodic quality and intelligibility of the HMM-based synthesizer.
引用
收藏
页码:1853 / +
页数:2
相关论文
共 50 条
  • [41] The Design and Implementation of HMM-based Dai Speech Synthesis
    Wang, Zhan
    Yang, Jian
    Yang, Xin
    2016 10TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2016,
  • [42] Objective measures to improve the selection of training speakers in HMM-based child speech synthesis
    Govender, Avashna
    de Wet, Febe
    2016 PATTERN RECOGNITION ASSOCIATION OF SOUTH AFRICA AND ROBOTICS AND MECHATRONICS INTERNATIONAL CONFERENCE (PRASA-ROBMECH), 2016,
  • [43] DIALOGUE CONTEXT SENSITIVE HMM-BASED SPEECH SYNTHESIS
    Tsiakoulis, Pirros
    Breslin, Catherine
    Gasic, Milica
    Henderson, Matthew
    Kim, Dongho
    Szummer, Martin
    Thomson, Blaise
    Young, Steve
    2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
  • [44] HMM-based Tibetan Lhasa Speech Synthesis System
    Wu Zhiqiang
    Yu Hongzhi
    Li Guanyu
    Wan Shuhui
    2013 3RD INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND NETWORK TECHNOLOGY (ICCSNT), 2013, : 92 - 95
  • [45] SPEECH-LAUGHS: AN HMM-BASED APPROACH FOR AMUSED SPEECH SYNTHESIS
    El Haddad, Kevin
    Dupont, Stephane
    Urbain, Jerome
    Dutoit, Thierry
    2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 4939 - 4943
  • [46] EVALUATION OF HMM-BASED VISUAL LAUGHTER SYNTHESIS
    Cakmak, Huseyin
    Urbain, Jerome
    Tilmanne, Joelle
    Dutoit, Thierry
    2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
  • [47] Discrete/Continuous Modelling of Speaking Style in HMM-based Speech Synthesis: Design and Evaluation
    Obin, Nicolas
    Lanchantin, Pierre
    Lacheret, Anne
    Rodet, Xavier
    12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 2796 - +
  • [48] Developing an HMM-Based Speech Synthesis System for Malay: A Comparison of Iterative and Isolated Unit Training
    Mustafa, Mumtaz Begum
    Don, Zuraidah Mohd
    Ainon, Raja Noor
    Zainuddin, Roziati
    Knowles, Gerry
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2014, E97D (05): : 1273 - 1282
  • [49] Feature pruning in likelihood evaluation of HMM-based speech recognition
    Li, X
    Bilmes, J
    ASRU'03: 2003 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING ASRU '03, 2003, : 303 - 308
  • [50] An Evaluation of Parameter Generation Methods with Rich Context Models in HMM-Based Speech Synthesis
    Takamichi, Shinnosuke
    Toda, Tomoki
    Shiga, Yoshinori
    Kawai, Hisashi
    Sakti, Sakriani
    Nakamura, Satoshi
    13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 1138 - 1141