Continuity Metric for Unit Selection based Text-to-Speech Synthesis

被引：0

作者：

Lakkavalli, Vikram Ramesh ^{[1
]}

Arulmozhi, P. ^{[1
]}

Ramakrishnan, A. G. ^{[1
]}

机构：

[1] Indian Inst Sci, Med Intelligence & Language Engn MILE Lab, Dept Elect Engn, Bangalore 560012, Karnataka, India

来源：

2010 INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND COMMUNICATIONS (SPCOM) | 2010年

关键词：

unit selection; MFCC; unit continuity metric; pitch continuity metric; spectral continuity metric; MILE-TTS; part-of-speech; pause model; Tamil; Kannada;

D O I：

暂无

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

A new method based on unit continuity metric (UCM) is proposed for optimal unit selection in text-to-speech (TTS) synthesis. UCM employs two features, namely, pitch continuity metric and spectral continuity metric. The methods have been implemented and tested on our test bed called MILE-TTS and it is available as web demo. After verification by a self selection test, the algorithms are evaluated on 8 paragraphs each for Kannada and Tamil by native users of the languages. Mean-opinion-score (MOS) shows that naturalness and comprehension are better with UCM based algorithm than the non-UCM based ones. The naturalness of the TTS output is further enhanced by a new rule based algorithm for pause prediction for Tamil language. The pauses between the words are predicted based on parts-of-speech information obtained from the input text.

引用

页数：5

共 50 条

[1] Efficient Unit-Selection in Text-to-Speech Synthesis
Mihelic, Ales
Gros, Jerneja Zganec
[J]. TEXT, SPEECH AND DIALOGUE, PROCEEDINGS, 2008, 5246 : 411 - 418
[2] Diphone-based unit selection for Catalan text-to-speech synthesis
Guaus, R
Iriondo, I
[J]. TEXT, SPEECH AND DIALOGUE, PROCEEDINGS, 2000, 1902 : 277 - 282
[3] PERCEPTUAL CLUSTERING BASED UNIT SELECTION OPTIMIZATION FOR CONCATENATIVE TEXT-TO-SPEECH SYNTHESIS
Jiang, Tao
Wu, Zhiyong
Jia, Jia
Cai, Lianhong
[J]. 2012 8TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING, 2012, : 64 - 68
[4] Embedded Unit Selection Text-to-Speech Synthesis for Mobile Devices
Karabetsos, Sotiris
Tsiakoulis, Pirros
Chalamandaris, Aimilios
Raptis, Spyros
[J]. IEEE TRANSACTIONS ON CONSUMER ELECTRONICS, 2009, 55 (02) : 613 - 621
[5] An Overview of the ILSP Unit Selection Text-to-Speech Synthesis System
Tsiakoulis, Pirros
Karabetsos, Sotiris
Chalamandaris, Aimilios
Raptis, Spyros
[J]. ARTIFICIAL INTELLIGENCE: METHODS AND APPLICATIONS, 2014, 8445 : 370 - 383
[6] Globally optimal training of unit boundaries in unit selection text-to-speech synthesis
Bellegarda, Jerome R.
[J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2007, 15 (03): : 957 - 965
[7] A Dynamic Cost Weighting Framework for Unit Selection Text-to-Speech Synthesis
Bellegarda, Jerome R.
[J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2010, 18 (06): : 1455 - 1463
[8] Including Pitch Accent Optionality in Unit Selection Text-to-Speech Synthesis
Badino, Leonardo
Clark, Robert A. J.
Strom, Volker
[J]. INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 2118 - 2121
[9] High quality Arabic text-to-speech synthesis using unit selection
Abdelmalek, Raja
Mnasri, Zied
[J]. 2016 13TH INTERNATIONAL MULTI-CONFERENCE ON SYSTEMS, SIGNALS & DEVICES (SSD), 2016, : 1 - 5
[10] Syllable specific unit selection cost functions for text-to-speech synthesis
Narendra, N.P.
Sreenivasa Rao, K.
[J]. ACM Transactions on Speech and Language Processing, 2012, 9 (03):

← 1 2 3 4 5 →