IndicSpeech: Text-to-Speech Corpus for Indian Languages

被引:0
|
作者
Srivastava, Nimisha [1 ]
Mukhopadhyay, Rudrabha [1 ]
Prajwal, K. R. [1 ]
Jawahar, C., V [1 ]
机构
[1] IIIT Hyderabad, Hyderabad, India
关键词
Text-to-speech; Indian languages; TTS corpus;
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
India is a country where several tens of languages are spoken by over a billion strong population. Text-to-speech systems for such languages will thus be extremely beneficial for wide-spread content creation and accessibility. Despite this, the current TTS systems for even the most popular Indian languages fall short of the contemporary state-of-the-art systems for English, Chinese, etc. We believe that one of the major reasons for this is the lack of large, publicly available text-to-speech corpora in these languages that are suitable for training neural text-to-speech systems. To mitigate this, we release a 24 hour text-to-speech corpus for 3 major Indian languages namely Hindi, Malayalam and Bengali. In this work, we also train a state-of-the-art TTS system for each of these languages and report their performances. The collected corpus, code, and trained models are made publicly available.
引用
收藏
页码:6417 / 6422
页数:6
相关论文
共 50 条
  • [31] TEXT-TO-SPEECH SYNTHESIS
    SPROAT, RW
    OLIVE, JP
    [J]. AT&T TECHNICAL JOURNAL, 1995, 74 (02): : 35 - 44
  • [32] The Art of Text-to-Speech
    Lindquist, Benjamin
    [J]. CRITICAL INQUIRY, 2024, 50 (02) : 225 - 251
  • [33] Software text-to-speech
    Hallahan W.I.
    [J]. International Journal of Speech Technology, 1997, 1 (2) : 121 - 134
  • [34] Text-to-speech for customers
    不详
    [J]. EXPERT SYSTEMS, 1998, 15 (01) : 66 - 66
  • [35] A Hybrid HMM-Waveglow based Text-to-speech Synthesizer using Histogram Equalization for Low resource Indian Languages
    Kumar, Mano Ranjith M.
    Srivastava, Sudhanshu
    Prakash, Anusha
    Murthy, Hema A.
    [J]. INTERSPEECH 2020, 2020, : 2037 - 2041
  • [36] Recent Trends in Text to Speech Synthesis of Indian Languages
    Joshi, Sarang L.
    Bairagi, Vinayak K.
    [J]. HELIX, 2019, 9 (03): : 4931 - 4936
  • [37] NORMALIZATION OF TEXT MESSAGES FOR TEXT-TO-SPEECH
    Pennell, Deana L.
    Liu, Yang
    [J]. 2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 4842 - 4845
  • [38] Text and Speech Corpora for Text-To-Speech Synthesis of Tales
    Doukhan, David
    Rosset, Sophie
    Rilliard, Albert
    d'Alessandro, Christophe
    Adda-Decker, Martine
    [J]. LREC 2012 - EIGHTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2012, : 1003 - 1010
  • [39] Hierarchical Transfer Learning for Text-to-Speech in Indonesian, Java']Javanese, and Sundanese Languages
    Azizah, Kurniawati
    Adriani, Mirna
    [J]. ICACSIS 2020: 2020 12TH INTERNATIONAL CONFERENCE ON ADVANCED COMPUTER SCIENCE AND INFORMATION SYSTEMS (ICACSIS), 2020, : 421 - 428
  • [40] A set of corpus-based text-to-speech synthesis technologies for Mandarin Chinese
    Chou, FC
    Tseng, CY
    Lee, LS
    [J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2002, 10 (07): : 481 - 494