Recent Trends in Text to Speech Synthesis of Indian Languages

被引：0

作者：

Joshi, Sarang L. ^{[1
]}

Bairagi, Vinayak K. ^{[1
]}

机构：

[1] AISSMS IOIT, Pune, Maharashtra, India

来源：

HELIX | 2019年 / 9卷 / 03期

关键词：

Concatenative; Prosody; Speech Synthesis; Syllable; TTS; Text to Speech;

D O I：

10.29042/2019-4931-4936

中图分类号：

Q81 [生物工程学（生物技术）]; Q93 [微生物学];

学科分类号：

071005 ; 0836 ; 090102 ; 100705 ;

摘要：

A Text To Speech (TTS) synthesizer is a computer application capable of converting arbitrary input text into speech. This conversion broadly involves two steps, namely, text processing and speech synthesis. Text processing converts the entered text to a sequence of synthesis units, while speech synthesis is the generation of an acoustic wave form corresponding to each of these units. Naturalness and intelligibility are the most important qualities expected from a TTS system. In this paper we aim to provide an overview of various techniques for text to speech synthesis, discuss their characteristics, summarize and compares advantages and drawbacks. We have listed various Text-to-Speech synthesis frameworks developed and implemented at different Indian institutes.

引用

页码：4931 / 4936

页数：6

共 50 条

[41] Modified Rule-Based Concatenative Technique for Intelligible Speech Synthesis in Indian Languages
Panda, Soumya Priyadarsini
Nayak, Ajit Kumar
ADVANCED SCIENCE LETTERS, 2016, 22 (02) : 557 - 563
[42] INDIAN PHILOSOPHY RECENT TRENDS
RIEPE, D
REVOLUTIONARY WORLD-AN INTERNATIONAL JOURNAL OF PHILOSOPHY, 1979, 33 : 32 - 39
[43] A Rule-Based Concatenative Approach to Speech Synthesis in Indian Language Text-to-Speech Systems
Panda, Soumya Priyadarsini
Nayak, Ajit Kumar
INTELLIGENT COMPUTING, COMMUNICATION AND DEVICES, 2015, 309 : 523 - 531
[44] Text and Speech Corpora for Text-To-Speech Synthesis of Tales
Doukhan, David
Rosset, Sophie
Rilliard, Albert
d'Alessandro, Christophe
Adda-Decker, Martine
LREC 2012 - EIGHTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2012, : 1003 - 1010
[45] GlobalPhone: A Multilingual Text & Speech Database in 20 Languages
Schultz, Tanja
Ngoc Thang Vu
Schlippe, Tim
2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 8126 - 8130
[46] Transfer Learning for Scene Text Recognition in Indian Languages
Gunna, Sanjana
Saluja, Rohit
Jawahar, C., V
DOCUMENT ANALYSIS AND RECOGNITION, ICDAR 2021 WORKSHOPS, PT I, 2021, 12916 : 182 - 197
[47] Transfer Learning for Scene Text Recognition in Indian Languages
Gunna, Sanjana
Saluja, Rohit
Jawahar, C.V.
arXiv, 2022,
[48] LCS based Text Steganography through Indian Languages
Changder, S.
Ghosh, D.
Debnath, N. C.
PROCEEDINGS OF 2010 3RD IEEE INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND INFORMATION TECHNOLOGY (ICCSIT 2010), VOL 8, 2010, : 53 - 57
[49] End-to-End Text-To-Speech synthesis for under resourced South African languages
Nthite, Thapelo
Tsoeu, Mohohlo
2020 INTERNATIONAL SAUPEC/ROBMECH/PRASA CONFERENCE, 2020, : 684 - 689
[50] Salient phonetic features of Indian languages in speech technology
Bhaskararao, Peri
SADHANA-ACADEMY PROCEEDINGS IN ENGINEERING SCIENCES, 2011, 36 (05): : 587 - 599

← 1 2 3 4 5 →