Evaluation of Prosody in Text-to-Speech Synthesis System of Bangla

被引：0

作者：

Basu, Tulika ^{[1
]}

Saha, Arup ^{[1
]}

机构：

[1] Ctr Dev Adv Comp, Adv Speech Proc Grp, Kolkata, India

来源：

2013 INTERNATIONAL CONFERENCE ORIENTAL COCOSDA HELD JOINTLY WITH 2013 CONFERENCE ON ASIAN SPOKEN LANGUAGE RESEARCH AND EVALUATION (O-COCOSDA/CASLRE) | 2013年

关键词：

Prosody; intonation; evaluation; perception;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In speech synthesis the role of prosody is very crucial. To make the synthesized speech more natural and soothing to the human ears various prosody and intonation model together with emotional model have been experimented over last few decades. Apart from the segmental quality and voice characteristics, it depends mostly on the quality of the prosody model which is responsible for the naturalness of any TTS system. But as it is very hard to evaluate prosody model in an objective way, a perceptual comparison method is adopted in this work to evaluate prosody model.

引用

页数：6

共 50 条

[31] A statistical model with hierarchical structure for predicting prosody in a mandarin text-to-speech system
Yu, MS
Pan, NH
JOURNAL OF THE CHINESE INSTITUTE OF ENGINEERS, 2005, 28 (03) : 385 - 399
[32] Speech Synthesis for Bangla Text to Speech Conversion
Arafat, Mohammad Yasir
Fahrin, Sanjana
Islam, Md. Jamirul
Siddiquee, Md. Ashraf
Khan, Afsana
Kotwal, Mohammed Rokibul Alam
Huda, Mohammad Nurul
8TH INTERNATIONAL CONFERENCE ON SOFTWARE, KNOWLEDGE, INFORMATION MANAGEMENT AND APPLICATIONS (SKIMA 2014), 2014,
[33] DIPHONES EVALUATION FOR TEXT-TO-SPEECH SYNTHESIS OF ITALIAN.
Salza, P.L.
Sandri, S.
Foti, E.
CSELT Technical Reports, 1988, 16 (01): : 9 - 11
[34] Text and Speech Corpora for Text-To-Speech Synthesis of Tales
Doukhan, David
Rosset, Sophie
Rilliard, Albert
d'Alessandro, Christophe
Adda-Decker, Martine
LREC 2012 - EIGHTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2012, : 1003 - 1010
[35] Optimisation of artificial neural network topology applied in the prosody control in text-to-speech synthesis
Sebesta, V
Tucková, J
SOFSEM 2000: THEORY AND PRACTICE OF INFORMATICS, 2000, 1963 : 420 - 430
[36] Multilingual text-to-speech synthesis
Black, AW
Lenzo, KA
2004 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL III, PROCEEDINGS: IMAGE AND MULTIDIMENSIONAL SIGNAL PROCESSING SPECIAL SESSIONS, 2004, : 761 - 764
[37] Prosody modeling for syllable based text-to-speech synthesis using feedforward neural networks
Reddy, V. Ramu
Rao, K. Sreenivasa
NEUROCOMPUTING, 2016, 171 : 1323 - 1334
[38] An introduction to text-to-speech synthesis
Fitzpatrick, E
COMPUTATIONAL LINGUISTICS, 1998, 24 (02) : 322 - 323
[39] Improving text-to-speech synthesis
Tatham, M
Lewis, E
ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 1856 - 1859
[40] Issues in text-to-speech synthesis
Macchi, M
IEEE INTERNATIONAL JOINT SYMPOSIA ON INTELLIGENCE AND SYSTEMS - PROCEEDINGS, 1998, : 318 - 325

← 1 2 3 4 5 →