Development of syllable-based text to speech synthesis system in Bengali

被引:47
|
作者
Narendra, N. [1 ]
Rao, K. [1 ]
Ghosh, Krishnendu [1 ]
Vempada, Ramu [1 ]
Maity, Sudhamay [1 ]
机构
[1] Indian Inst Technol Kharagpur, Sch Informat Technol, Kharagpur 721302, W Bengal, India
关键词
Text to speech synthesis; Unrestricted TTS; Prototype TTS; Bengali TTS; Festival;
D O I
10.1007/s10772-011-9094-4
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This paper presents the design and development of unrestricted text to speech synthesis (TTS) system in Bengali language. Unrestricted TTS system is capable to synthesize good quality of speech in different domains. In this work, syllables are used as basic units for synthesis. Festival framework has been used for building the TTS system. Speech collected from a female artist is used as speech corpus. Initially five speakers' speech is collected and a prototype TTS is built from each of the five speakers. Best speaker among the five is selected through subjective and objective evaluation of natural and synthesized waveforms. Then development of unrestricted TTS is carried out by addressing the issues involved at each stage to produce good quality synthesizer. Evaluation is carried out in four stages by conducting objective and subjective listening tests on synthesized speech. At the first stage, TTS system is built with basic festival framework. In the following stages, additional features are incorporated into the system and quality of synthesis is evaluated. The subjective and objective measures indicate that the proposed features and methods have improved the quality of the synthesized speech from stage-2 to stage-4.
引用
收藏
页码:167 / 181
页数:15
相关论文
共 50 条
  • [1] Development of Concatenative Syllable-Based Text to Speech Synthesis System for Tamil
    Sudhakar, B.
    Bensraj, R.
    [J]. ARTIFICIAL INTELLIGENCE AND EVOLUTIONARY ALGORITHMS IN ENGINEERING SYSTEMS, VOL 1, 2015, 324 : 585 - 592
  • [2] Improved Syllable-Based Text to Speech Synthesis for Tone Language Systems
    Ekpenyong, Moses
    Udoh, EmemObong
    Udosen, Escor
    Urua, Eno-Abasi
    [J]. HUMAN LANGUAGE TECHNOLOGY CHALLENGES FOR COMPUTER SCIENCE AND LINGUISTICS, 2014, 8387 : 3 - 15
  • [3] A Novel Text-to-Speech Synthesis System Using Syllable-Based HMM for Tamil Language
    Manoharan, J. Samuel
    [J]. PROCEEDINGS OF SECOND INTERNATIONAL CONFERENCE ON SUSTAINABLE EXPERT SYSTEMS (ICSES 2021), 2022, 351 : 305 - 314
  • [4] Syllable-Based Concatenative Speech Synthesis for Marathi Language
    Ghate, Pravin M.
    Shirbahadurkar, Suresh D.
    [J]. INFORMATION AND COMMUNICATION TECHNOLOGY FOR COMPETITIVE STRATEGIES, 2019, 40 : 615 - 624
  • [5] Syllable-based Chinese text/spoken document retrieval using text/speech queries
    Bai, BR
    Chen, BL
    Wang, HM
    [J]. INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2000, 14 (05) : 603 - 616
  • [6] Sonority rise: Aiding backoff in syllable-based speech synthesis
    Rallabandi, Saikrishna
    Pandey, Ayushi
    Rallabandi, Saisirisha
    Godambe, Tejas
    Gangashetty, Suryakanth V.
    [J]. 2016 TWENTY SECOND NATIONAL CONFERENCE ON COMMUNICATION (NCC), 2016,
  • [7] The development of syllable based text to speech system for Tamil language
    Karthikadevi, M.
    Srinivasagan, K.G.
    [J]. 2014 International Conference on Recent Trends in Information Technology, ICRTIT 2014, 2014,
  • [8] The Development of Syllable Based Text to Speech System for Tamil language
    Karthikadevi, M.
    Srinivasagan, K. G.
    [J]. 2014 INTERNATIONAL CONFERENCE ON RECENT TRENDS IN INFORMATION TECHNOLOGY (ICRTIT), 2014,
  • [9] A Syllable-Based Technique for Uyghur Text Compression
    Abliz, Wayit
    Wu, Hao
    Maimaiti, Maihemuti
    Wushouer, Jiamila
    Abiderexiti, Kahaerjiang
    Yibulayin, Tuergen
    Wumaier, Aishan
    [J]. INFORMATION, 2020, 11 (03)
  • [10] Genetic Algorithms in Syllable-Based Text Compression
    Kuthan, Tomas
    Lansky, Jan
    [J]. DATESO 2007 - DATABASES, TEXTS, SPECIFICATIONS, OBJECTS: PROCEEDINGS OF THE 7TH ANNUAL INTERNATIONAL WORKSHOP, 2007, 235 : 21 - 34