Unit-Selection Speech Synthesis Method Using Words as Search Units

被引:0
|
作者
Segi, Hiroyuki [1 ]
机构
[1] Seikei Univ, Dept Comp & Informat Sci, Tokyo, Japan
关键词
Broadcast Program; Mean Opinion Score; Search Unit; Speech Database; Speech Synthesis; Unit Selection; Word Unit;
D O I
10.4018/IJMDEM.2016040104
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Unit-selection speech-synthesis systems have been proposed. In most of the unit-selection speech-synthesis systems, search units are rather short such as syllables, phonemes and diphones. However, when applied to large speech databases, shorter units produce more voice-waveform candidates and a larger speech database cannot be used without narrow pruning for practical use. Narrow pruning impairs the quality of the synthesized speech. Here the author examined the possibility of using words as search units. Subjective evaluations indicated that 70% of the speech synthesized by the proposed method sounded more natural than that synthesized by a conventional method. The five-point mean opinion score of the synthesized speech was 3.5, and 21% was judged to sound as natural as human speech. These results demonstrate the effectiveness of unit-selection speech synthesis using words as search units.
引用
收藏
页码:53 / 67
页数:15
相关论文
共 50 条
  • [1] Hybrid statistical/unit-selection Turkish speech synthesis using suffix units
    Cenk Demiroğlu
    Ekrem Güner
    [J]. EURASIP Journal on Audio, Speech, and Music Processing, 2016
  • [2] Hybrid statistical/unit-selection Turkish speech synthesis using suffix units
    Demiroglu, Cenk
    Guner, Ekrem
    [J]. EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2016, : 1 - 16
  • [3] Expressive Prosody for Unit-selection Speech Synthesis
    Strom, Volker
    Clark, Robert
    King, Simon
    [J]. INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 1296 - 1299
  • [4] An efficient unit-selection method for embedded concatenative speech synthesis
    Gros, Jerneja Zganec
    Zganec, Mario
    [J]. INFORMACIJE MIDEM-JOURNAL OF MICROELECTRONICS ELECTRONIC COMPONENTS AND MATERIALS, 2007, 37 (03): : 158 - 164
  • [5] An efficient unit-selection method for concatenative Text-to-speech synthesis systems
    Gros, Jerneja Zganec
    Zganec, Mario
    [J]. Journal of Computing and Information Technology, 2008, 16 (01) : 69 - 78
  • [6] Efficient Unit-Selection in Text-to-Speech Synthesis
    Mihelic, Ales
    Gros, Jerneja Zganec
    [J]. TEXT, SPEECH AND DIALOGUE, PROCEEDINGS, 2008, 5246 : 411 - 418
  • [7] On the Impact of Annotation Errors on Unit-Selection Speech Synthesis
    Matousek, Jindrich
    Tihelka, Daniel
    Smidl, Lubos
    [J]. TEXT, SPEECH AND DIALOGUE, TSD 2012, 2012, 7499 : 456 - 463
  • [8] PROSODIC CONTROL OF UNIT-SELECTION SPEECH SYNTHESIS: A PROBABILISTIC APPROACH
    Veaux, Christophe
    Rodet, Xavier
    [J]. 2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 5360 - 5363
  • [9] Automatic Duration Weighting in Thai Unit-selection Speech Synthesis
    Saychum, S.
    Rugchatjaroen, A.
    Thatphithakkul, N.
    Wutiwiwatchai, C.
    Thangthai, A.
    [J]. ECTI-CON 2008: PROCEEDINGS OF THE 2008 5TH INTERNATIONAL CONFERENCE ON ELECTRICAL ENGINEERING/ELECTRONICS, COMPUTER, TELECOMMUNICATIONS AND INFORMATION TECHNOLOGY, VOLS 1 AND 2, 2008, : 549 - 552
  • [10] Slovak speech database for experiments and application building in unit-selection speech synthesis
    Rusko, M
    Trnka, M
    Darzágín, S
    Cernak, M
    [J]. TEXT, SPEECH AND DIALOGUE, PROCEEDINGS, 2004, 3206 : 457 - 464