Unit-Selection Speech Synthesis Method Using Words as Search Units

被引:0
|
作者
Segi, Hiroyuki [1 ]
机构
[1] Seikei Univ, Dept Comp & Informat Sci, Tokyo, Japan
关键词
Broadcast Program; Mean Opinion Score; Search Unit; Speech Database; Speech Synthesis; Unit Selection; Word Unit;
D O I
10.4018/IJMDEM.2016040104
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Unit-selection speech-synthesis systems have been proposed. In most of the unit-selection speech-synthesis systems, search units are rather short such as syllables, phonemes and diphones. However, when applied to large speech databases, shorter units produce more voice-waveform candidates and a larger speech database cannot be used without narrow pruning for practical use. Narrow pruning impairs the quality of the synthesized speech. Here the author examined the possibility of using words as search units. Subjective evaluations indicated that 70% of the speech synthesized by the proposed method sounded more natural than that synthesized by a conventional method. The five-point mean opinion score of the synthesized speech was 3.5, and 21% was judged to sound as natural as human speech. These results demonstrate the effectiveness of unit-selection speech synthesis using words as search units.
引用
收藏
页码:53 / 67
页数:15
相关论文
共 50 条
  • [31] Progressive Neural Networks based Features Prediction for the Target Cost in Unit-Selection Speech Synthesizer
    Fu, Ruibo
    Tao, Jianhua
    Wen, Zhengqi
    [J]. PROCEEDINGS OF 2018 14TH IEEE INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING (ICSP), 2018, : 504 - 509
  • [32] Unit selection speech synthesis in noise
    Cernak, Milos
    [J]. 2006 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-13, 2006, : 761 - 764
  • [33] A statistical method for database reduction for embedded unit selection speech synthesis
    Tsiakoulis, Pirros
    Chalamandaris, Aimilios
    Karabetsos, Sotiris
    Raptis, Spyros
    [J]. 2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, : 4601 - 4604
  • [34] Concatenative speech synthesis based on the plural unit selection and fusion method
    Mizutani, T
    Kagoshima, T
    [J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2005, E88D (11): : 2565 - 2572
  • [35] Using Deep Bidirectional Recurrent Neural Networks for Prosodic-Target Prediction in a Unit-Selection Text-to-Speech System
    Fernandez, Raul
    Rendel, Asaf
    Ramabhadran, Bhuvana
    Hoory, Ron
    [J]. 16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 1606 - 1610
  • [36] OPTIMIZATION OF COST FUNCTION WEIGHTS FOR UNIT SELECTION SPEECH SYNTHESIS USING SPEECH RECOGNITION
    Pobar, Miran
    Martincic-Ipsic, Sanda
    Ipsic, Ivo
    [J]. NEURAL NETWORK WORLD, 2012, 22 (05) : 429 - 441
  • [37] IMPROVED UNIT SELECTION SPEECH SYNTHESIS METHOD UTILIZING SUBJECTIVE EVALUATION RESULTS ON SYNTHETIC SPEECH
    Xia, Xian-Jun
    Ling, Zhen-Hua
    Yang, Chen-Yu
    Dai, Li-Rong
    [J]. 2012 8TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING, 2012, : 160 - 164
  • [38] Scalable concatenative speech synthesis based on the plural unit selection and fusion method
    Tamura, M
    Mizutani, T
    Kagoshima, T
    [J]. 2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING, 2005, : 361 - 364
  • [39] Assessing a Speaker for Fast Speech in Unit Selection Speech Synthesis
    Moers, Donata
    Wagner, Petra
    [J]. INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 2015 - +
  • [40] Implementation and verification of speech database for unit selection speech synthesis
    Szklanny, Krzysztof
    Koszuta, Sebastian
    [J]. PROCEEDINGS OF THE 2017 FEDERATED CONFERENCE ON COMPUTER SCIENCE AND INFORMATION SYSTEMS (FEDCSIS), 2017, : 1263 - 1267