Unit-Selection Speech Synthesis Method Using Words as Search Units

被引：0

作者：

Segi, Hiroyuki ^{[1
]}

机构：

[1] Seikei Univ, Dept Comp & Informat Sci, Tokyo, Japan

来源：

INTERNATIONAL JOURNAL OF MULTIMEDIA DATA ENGINEERING & MANAGEMENT | 2016年 / 7卷 / 02期

关键词：

Broadcast Program; Mean Opinion Score; Search Unit; Speech Database; Speech Synthesis; Unit Selection; Word Unit;

D O I：

10.4018/IJMDEM.2016040104

中图分类号：

TP31 [计算机软件];

学科分类号：

081202 ; 0835 ;

摘要：

Unit-selection speech-synthesis systems have been proposed. In most of the unit-selection speech-synthesis systems, search units are rather short such as syllables, phonemes and diphones. However, when applied to large speech databases, shorter units produce more voice-waveform candidates and a larger speech database cannot be used without narrow pruning for practical use. Narrow pruning impairs the quality of the synthesized speech. Here the author examined the possibility of using words as search units. Subjective evaluations indicated that 70% of the speech synthesized by the proposed method sounded more natural than that synthesized by a conventional method. The five-point mean opinion score of the synthesized speech was 3.5, and 21% was judged to sound as natural as human speech. These results demonstrate the effectiveness of unit-selection speech synthesis using words as search units.

引用

页码：53 / 67

页数：15

共 50 条

[1] Hybrid statistical/unit-selection Turkish speech synthesis using suffix units
Cenk Demiroğlu
Ekrem Güner
[J]. EURASIP Journal on Audio, Speech, and Music Processing, 2016
[2] Hybrid statistical/unit-selection Turkish speech synthesis using suffix units
Demiroglu, Cenk
Guner, Ekrem
[J]. EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2016, : 1 - 16
[3] Expressive Prosody for Unit-selection Speech Synthesis
Strom, Volker
Clark, Robert
King, Simon
[J]. INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 1296 - 1299
[4] An efficient unit-selection method for embedded concatenative speech synthesis
Gros, Jerneja Zganec
Zganec, Mario
[J]. INFORMACIJE MIDEM-JOURNAL OF MICROELECTRONICS ELECTRONIC COMPONENTS AND MATERIALS, 2007, 37 (03): : 158 - 164
[5] An efficient unit-selection method for concatenative Text-to-speech synthesis systems
Gros, Jerneja Zganec
Zganec, Mario
[J]. Journal of Computing and Information Technology, 2008, 16 (01) : 69 - 78
[6] Efficient Unit-Selection in Text-to-Speech Synthesis
Mihelic, Ales
Gros, Jerneja Zganec
[J]. TEXT, SPEECH AND DIALOGUE, PROCEEDINGS, 2008, 5246 : 411 - 418
[7] On the Impact of Annotation Errors on Unit-Selection Speech Synthesis
Matousek, Jindrich
Tihelka, Daniel
Smidl, Lubos
[J]. TEXT, SPEECH AND DIALOGUE, TSD 2012, 2012, 7499 : 456 - 463
[8] PROSODIC CONTROL OF UNIT-SELECTION SPEECH SYNTHESIS: A PROBABILISTIC APPROACH
Veaux, Christophe
Rodet, Xavier
[J]. 2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 5360 - 5363
[9] Automatic Duration Weighting in Thai Unit-selection Speech Synthesis
Saychum, S.
Rugchatjaroen, A.
Thatphithakkul, N.
Wutiwiwatchai, C.
Thangthai, A.
[J]. ECTI-CON 2008: PROCEEDINGS OF THE 2008 5TH INTERNATIONAL CONFERENCE ON ELECTRICAL ENGINEERING/ELECTRONICS, COMPUTER, TELECOMMUNICATIONS AND INFORMATION TECHNOLOGY, VOLS 1 AND 2, 2008, : 549 - 552
[10] Slovak speech database for experiments and application building in unit-selection speech synthesis
Rusko, M
Trnka, M
Darzágín, S
Cernak, M
[J]. TEXT, SPEECH AND DIALOGUE, PROCEEDINGS, 2004, 3206 : 457 - 464

← 1 2 3 4 5 →