An efficient unit-selection method for concatenative Text-to-speech synthesis systems

被引：5

作者：

Gros, Jerneja Zganec ^{[1
]}

Zganec, Mario ^{[1
]}

机构：

[1] Alpineon R and D, Language Technologies Group, Ulica Iga Grudna 15, Ljubljana, Slovenia

来源：

Journal of Computing and Information Technology | 2008年 / 16卷 / 01期

关键词：

D O I：

10.2498/cit.1001049

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

This paper presents a method for selecting speech units for polyphone concatenative speech synthesis, in which the simplification of procedures for search paths in a graph has accelerated the speed of the unit-selection procedure with minimum effects on the speech quality. The speech units selected are still optimal; only the costs of merging the units on which the selection is based are less accurately determined. Due to its low processing power and memory footprint requirements, the method is suitable for use in embedded speech synthesizers.

引用

页码：69 / 78

共 50 条

[1] Efficient Unit-Selection in Text-to-Speech Synthesis
Mihelic, Ales
Gros, Jerneja Zganec
[J]. TEXT, SPEECH AND DIALOGUE, PROCEEDINGS, 2008, 5246 : 411 - 418
[2] An efficient unit-selection method for embedded concatenative speech synthesis
Gros, Jerneja Zganec
Zganec, Mario
[J]. INFORMACIJE MIDEM-JOURNAL OF MICROELECTRONICS ELECTRONIC COMPONENTS AND MATERIALS, 2007, 37 (03): : 158 - 164
[3] PERCEPTUAL CLUSTERING BASED UNIT SELECTION OPTIMIZATION FOR CONCATENATIVE TEXT-TO-SPEECH SYNTHESIS
Jiang, Tao
Wu, Zhiyong
Jia, Jia
Cai, Lianhong
[J]. 2012 8TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING, 2012, : 64 - 68
[4] Applying Scalable Phonetic Context Similarity in Unit Selection of Concatenative Text-to-Speech
Zhang, Wei
Cui, Xiaodong
[J]. 11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 1-2, 2010, : 154 - 157
[5] RECENT IMPROVEMENTS OF PROBABILITY BASED PROSODY MODELS FOR UNIT SELECTION IN CONCATENATIVE TEXT-TO-SPEECH
Zhang, Wei
Gu, Liang
Gao, Yuqing
[J]. 2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 3777 - 3780
[6] Affective word ratings for concatenative text-to-speech synthesis
Tsiakoulis, Pirros
Raptis, Spiros
Karabetsos, Sotiris
Chalamandaris, Aimilios
[J]. 20TH PAN-HELLENIC CONFERENCE ON INFORMATICS (PCI 2016), 2016,
[7] A framework for a Bangla concatenative text-to-speech synthesis system
Syed, MR
Chakrobartty, S
Bignall, RJ
[J]. Innovations Through Information Technology, Vols 1 and 2, 2004, : 1318 - 1320
[8] A Rule-Based Concatenative Approach to Speech Synthesis in Indian Language Text-to-Speech Systems
Panda, Soumya Priyadarsini
Nayak, Ajit Kumar
[J]. INTELLIGENT COMPUTING, COMMUNICATION AND DEVICES, 2015, 309 : 523 - 531
[9] Efficient and reliable perceptual weight tuning for unit-selection text-to-speech synthesis based on active interactive genetic algorithms: A proof-of-concept
Alias, Francesc
Formiga, Lluis
Llora, Xavier
[J]. SPEECH COMMUNICATION, 2011, 53 (05) : 786 - 800
[10] Articulatory modeling: A possible role in concatenative text-to-speech synthesis
Sondhi, MM
[J]. PROCEEDINGS OF THE 2002 IEEE WORKSHOP ON SPEECH SYNTHESIS, 2002, : 73 - 78

← 1 2 3 4 5 →