Recent Trends in Text to Speech Synthesis of Indian Languages

被引:0
|
作者
Joshi, Sarang L. [1 ]
Bairagi, Vinayak K. [1 ]
机构
[1] AISSMS IOIT, Pune, Maharashtra, India
来源
HELIX | 2019年 / 9卷 / 03期
关键词
Concatenative; Prosody; Speech Synthesis; Syllable; TTS; Text to Speech;
D O I
10.29042/2019-4931-4936
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
A Text To Speech (TTS) synthesizer is a computer application capable of converting arbitrary input text into speech. This conversion broadly involves two steps, namely, text processing and speech synthesis. Text processing converts the entered text to a sequence of synthesis units, while speech synthesis is the generation of an acoustic wave form corresponding to each of these units. Naturalness and intelligibility are the most important qualities expected from a TTS system. In this paper we aim to provide an overview of various techniques for text to speech synthesis, discuss their characteristics, summarize and compares advantages and drawbacks. We have listed various Text-to-Speech synthesis frameworks developed and implemented at different Indian institutes.
引用
收藏
页码:4931 / 4936
页数:6
相关论文
共 50 条
  • [21] Multilingual Text-to-Speech Synthesis for Turkic Languages Using Transliteration
    Yeshpanov, Rustem
    Mussakhojayeva, Saida
    Khassanov, Yerbolat
    arXiv, 2023,
  • [22] Meta Learning Text-to-Speech Synthesis in over 7000 Languages
    Lux, Florian
    Meyer, Sarina
    Behringer, Lyonel
    Zalkow, Frank
    Do, Phat
    Coler, Matt
    Habets, Emanuel A. P.
    Ngoc Thang Vu
    INTERSPEECH 2024, 2024, : 4958 - 4962
  • [23] Survey of Issues with Text to Speech Synthesis of Multilingual Indian Texts
    Krishnamoorthy, Suban
    Suen, Ching Y.
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE (ICPRAI 2018), 2018, : 360 - 365
  • [24] Speech to Text Conversion for Multilingual Languages
    Ghadage, Yogita H.
    Shelke, Sushama D.
    2016 INTERNATIONAL CONFERENCE ON COMMUNICATION AND SIGNAL PROCESSING (ICCSP), VOL. 1, 2016, : 236 - 240
  • [25] RECENT BOOKS ON INDIAN LANGUAGES
    LOZANO, E
    LATIN AMERICAN INDIAN LITERATURES JOURNAL, 1988, 4 (01): : 85 - 96
  • [26] Text Normalisation in Text-to-Speech Synthesis for South African Languages: Native Number Expansion
    Schlunz, Georg I.
    Dlamini, Nkosikhona
    Tshoane, Alfred
    Ramunyisi, Stan
    2017 PATTERN RECOGNITION ASSOCIATION OF SOUTH AFRICA AND ROBOTICS AND MECHATRONICS (PRASA-ROBMECH), 2017, : 230 - 235
  • [27] AUTOMATIC TEXT SUMMARIZATION FOR INDIAN LANGUAGES
    Kumar, Jeetendra
    Shekhar, Shashi
    Gupta, Rashmi
    EVERYMANS SCIENCE, 2022, 57 (01):
  • [28] Part-of-Speech Tagging and Chunking in Text-to-Speech Synthesis for South African Languages
    Schlunz, Georg I.
    Dlamini, Nkosikhona
    Kruger, Rynhardt P.
    17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 3554 - 3558
  • [29] Indian Languages Corpus for Speech Recognition
    Basu, Joyanta
    Khan, Soma
    Roy, Rajib
    Saxena, Babita
    Ganguly, Dipankar
    Arora, Sunita
    Arora, Karunesh Kumar
    Bansal, Shweta
    Agrawal, Shyam Sunder
    2019 22ND CONFERENCE OF THE ORIENTAL COCOSDA INTERNATIONAL COMMITTEE FOR THE CO-ORDINATION AND STANDARDISATION OF SPEECH DATABASES AND ASSESSMENT TECHNIQUES (O-COCOSDA), 2019, : 13 - 18
  • [30] RECENT ADVANCES IN ROMANIAN LANGUAGE TEXT-TO-SPEECH SYNTHESIS
    Burileanu, Dragos
    Negrescu, Cristian
    Surmei, Mihai
    PROCEEDINGS OF THE ROMANIAN ACADEMY SERIES A-MATHEMATICS PHYSICS TECHNICAL SCIENCES INFORMATION SCIENCE, 2010, 11 (01): : 92 - 99