Recent Trends in Text to Speech Synthesis of Indian Languages

被引:0
|
作者
Joshi, Sarang L. [1 ]
Bairagi, Vinayak K. [1 ]
机构
[1] AISSMS IOIT, Pune, Maharashtra, India
来源
HELIX | 2019年 / 9卷 / 03期
关键词
Concatenative; Prosody; Speech Synthesis; Syllable; TTS; Text to Speech;
D O I
10.29042/2019-4931-4936
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
A Text To Speech (TTS) synthesizer is a computer application capable of converting arbitrary input text into speech. This conversion broadly involves two steps, namely, text processing and speech synthesis. Text processing converts the entered text to a sequence of synthesis units, while speech synthesis is the generation of an acoustic wave form corresponding to each of these units. Naturalness and intelligibility are the most important qualities expected from a TTS system. In this paper we aim to provide an overview of various techniques for text to speech synthesis, discuss their characteristics, summarize and compares advantages and drawbacks. We have listed various Text-to-Speech synthesis frameworks developed and implemented at different Indian institutes.
引用
收藏
页码:4931 / 4936
页数:6
相关论文
共 50 条
  • [41] Modified Rule-Based Concatenative Technique for Intelligible Speech Synthesis in Indian Languages
    Panda, Soumya Priyadarsini
    Nayak, Ajit Kumar
    ADVANCED SCIENCE LETTERS, 2016, 22 (02) : 557 - 563
  • [42] INDIAN PHILOSOPHY RECENT TRENDS
    RIEPE, D
    REVOLUTIONARY WORLD-AN INTERNATIONAL JOURNAL OF PHILOSOPHY, 1979, 33 : 32 - 39
  • [43] A Rule-Based Concatenative Approach to Speech Synthesis in Indian Language Text-to-Speech Systems
    Panda, Soumya Priyadarsini
    Nayak, Ajit Kumar
    INTELLIGENT COMPUTING, COMMUNICATION AND DEVICES, 2015, 309 : 523 - 531
  • [44] Text and Speech Corpora for Text-To-Speech Synthesis of Tales
    Doukhan, David
    Rosset, Sophie
    Rilliard, Albert
    d'Alessandro, Christophe
    Adda-Decker, Martine
    LREC 2012 - EIGHTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2012, : 1003 - 1010
  • [45] GlobalPhone: A Multilingual Text & Speech Database in 20 Languages
    Schultz, Tanja
    Ngoc Thang Vu
    Schlippe, Tim
    2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 8126 - 8130
  • [46] Transfer Learning for Scene Text Recognition in Indian Languages
    Gunna, Sanjana
    Saluja, Rohit
    Jawahar, C., V
    DOCUMENT ANALYSIS AND RECOGNITION, ICDAR 2021 WORKSHOPS, PT I, 2021, 12916 : 182 - 197
  • [47] Transfer Learning for Scene Text Recognition in Indian Languages
    Gunna, Sanjana
    Saluja, Rohit
    Jawahar, C.V.
    arXiv, 2022,
  • [48] LCS based Text Steganography through Indian Languages
    Changder, S.
    Ghosh, D.
    Debnath, N. C.
    PROCEEDINGS OF 2010 3RD IEEE INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND INFORMATION TECHNOLOGY (ICCSIT 2010), VOL 8, 2010, : 53 - 57
  • [49] End-to-End Text-To-Speech synthesis for under resourced South African languages
    Nthite, Thapelo
    Tsoeu, Mohohlo
    2020 INTERNATIONAL SAUPEC/ROBMECH/PRASA CONFERENCE, 2020, : 684 - 689
  • [50] Salient phonetic features of Indian languages in speech technology
    Bhaskararao, Peri
    SADHANA-ACADEMY PROCEEDINGS IN ENGINEERING SCIENCES, 2011, 36 (05): : 587 - 599