Unit-Selection Speech Synthesis Method Using Words as Search Units

被引：0

作者：

Segi, Hiroyuki ^{[1
]}

机构：

[1] Seikei Univ, Dept Comp & Informat Sci, Tokyo, Japan

来源：

INTERNATIONAL JOURNAL OF MULTIMEDIA DATA ENGINEERING & MANAGEMENT | 2016年 / 7卷 / 02期

关键词：

Broadcast Program; Mean Opinion Score; Search Unit; Speech Database; Speech Synthesis; Unit Selection; Word Unit;

D O I：

10.4018/IJMDEM.2016040104

中图分类号：

TP31 [计算机软件];

学科分类号：

081202 ; 0835 ;

摘要：

Unit-selection speech-synthesis systems have been proposed. In most of the unit-selection speech-synthesis systems, search units are rather short such as syllables, phonemes and diphones. However, when applied to large speech databases, shorter units produce more voice-waveform candidates and a larger speech database cannot be used without narrow pruning for practical use. Narrow pruning impairs the quality of the synthesized speech. Here the author examined the possibility of using words as search units. Subjective evaluations indicated that 70% of the speech synthesized by the proposed method sounded more natural than that synthesized by a conventional method. The five-point mean opinion score of the synthesized speech was 3.5, and 21% was judged to sound as natural as human speech. These results demonstrate the effectiveness of unit-selection speech synthesis using words as search units.

引用

页码：53 / 67

页数：15

共 50 条

[31] Progressive Neural Networks based Features Prediction for the Target Cost in Unit-Selection Speech Synthesizer
Fu, Ruibo
Tao, Jianhua
Wen, Zhengqi
[J]. PROCEEDINGS OF 2018 14TH IEEE INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING (ICSP), 2018, : 504 - 509
[32] Unit selection speech synthesis in noise
Cernak, Milos
[J]. 2006 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-13, 2006, : 761 - 764
[33] A statistical method for database reduction for embedded unit selection speech synthesis
Tsiakoulis, Pirros
Chalamandaris, Aimilios
Karabetsos, Sotiris
Raptis, Spyros
[J]. 2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, : 4601 - 4604
[34] Concatenative speech synthesis based on the plural unit selection and fusion method
Mizutani, T
Kagoshima, T
[J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2005, E88D (11): : 2565 - 2572
[35] Using Deep Bidirectional Recurrent Neural Networks for Prosodic-Target Prediction in a Unit-Selection Text-to-Speech System
Fernandez, Raul
Rendel, Asaf
Ramabhadran, Bhuvana
Hoory, Ron
[J]. 16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 1606 - 1610
[36] OPTIMIZATION OF COST FUNCTION WEIGHTS FOR UNIT SELECTION SPEECH SYNTHESIS USING SPEECH RECOGNITION
Pobar, Miran
Martincic-Ipsic, Sanda
Ipsic, Ivo
[J]. NEURAL NETWORK WORLD, 2012, 22 (05) : 429 - 441
[37] IMPROVED UNIT SELECTION SPEECH SYNTHESIS METHOD UTILIZING SUBJECTIVE EVALUATION RESULTS ON SYNTHETIC SPEECH
Xia, Xian-Jun
Ling, Zhen-Hua
Yang, Chen-Yu
Dai, Li-Rong
[J]. 2012 8TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING, 2012, : 160 - 164
[38] Scalable concatenative speech synthesis based on the plural unit selection and fusion method
Tamura, M
Mizutani, T
Kagoshima, T
[J]. 2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING, 2005, : 361 - 364
[39] Assessing a Speaker for Fast Speech in Unit Selection Speech Synthesis
Moers, Donata
Wagner, Petra
[J]. INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 2015 - +
[40] Implementation and verification of speech database for unit selection speech synthesis
Szklanny, Krzysztof
Koszuta, Sebastian
[J]. PROCEEDINGS OF THE 2017 FEDERATED CONFERENCE ON COMPUTER SCIENCE AND INFORMATION SYSTEMS (FEDCSIS), 2017, : 1263 - 1267

← 1 2 3 4 5 →