IndicSpeech: Text-to-Speech Corpus for Indian Languages

被引:0
|
作者
Srivastava, Nimisha [1 ]
Mukhopadhyay, Rudrabha [1 ]
Prajwal, K. R. [1 ]
Jawahar, C., V [1 ]
机构
[1] IIIT Hyderabad, Hyderabad, India
关键词
Text-to-speech; Indian languages; TTS corpus;
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
India is a country where several tens of languages are spoken by over a billion strong population. Text-to-speech systems for such languages will thus be extremely beneficial for wide-spread content creation and accessibility. Despite this, the current TTS systems for even the most popular Indian languages fall short of the contemporary state-of-the-art systems for English, Chinese, etc. We believe that one of the major reasons for this is the lack of large, publicly available text-to-speech corpora in these languages that are suitable for training neural text-to-speech systems. To mitigate this, we release a 24 hour text-to-speech corpus for 3 major Indian languages namely Hindi, Malayalam and Bengali. In this work, we also train a state-of-the-art TTS system for each of these languages and report their performances. The collected corpus, code, and trained models are made publicly available.
引用
收藏
页码:6417 / 6422
页数:6
相关论文
共 50 条
  • [1] An efficient model for text-to-speech synthesis in Indian languages
    Panda, Soumya Priyadarsini
    Nayak, Ajit Kumar
    [J]. INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2015, 18 (03) : 305 - 315
  • [2] Shruti: an embedded text-to-speech system for Indian languages
    Mukhopadhyay, A.
    Chakraborty, S.
    Choudhury, M.
    Lahiri, A.
    Dey, S.
    Basu, A.
    [J]. IEE PROCEEDINGS-SOFTWARE, 2006, 153 (02): : 75 - 79
  • [3] Resyllabification in Indian Languages and its Implications in Text-to-speech Systems
    Mahesh, M.
    Prakash, Jeena J.
    Murthy, Hema A.
    [J]. 19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 212 - 216
  • [4] SIGNIFICANCE OF KNOWLEDGE SOURCES OR A TEXT-TO-SPEECH SYSTEM FOR INDIAN LANGUAGES
    YEGNANARAYANA, B
    RAJENDRAN, S
    RAMACHANDRAN, VR
    MADHUKUMAR, AS
    [J]. SADHANA-ACADEMY PROCEEDINGS IN ENGINEERING SCIENCES, 1994, 19 : 147 - 169
  • [5] An Approach to Building Language-Independent Text-to-Speech Synthesis for Indian Languages
    Prakash, Anusha
    Reddy, M. Ramasubba
    Nagarajan, T.
    Murthy, Hema A.
    [J]. 2014 TWENTIETH NATIONAL CONFERENCE ON COMMUNICATIONS (NCC), 2014,
  • [6] RyanSpeech: A Corpus for Conversational Text-to-Speech Synthesis
    Zandie, Rohola
    Mahoor, Mohammad H.
    Madsen, Julia
    Emamian, Eshrat S.
    [J]. INTERSPEECH 2021, 2021, : 2751 - 2755
  • [7] BOOTSTRAPPING TEXT-TO-SPEECH FOR SPEECH PROCESSING IN LANGUAGES WITHOUT AN ORTHOGRAPHY
    Sitaram, Sunayana
    Palkar, Sukhada
    Chen, Yun-Nung
    Parlikar, Alok
    Black, Alan W.
    [J]. 2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 7992 - 7996
  • [8] Indian Languages Corpus for Speech Recognition
    Basu, Joyanta
    Khan, Soma
    Roy, Rajib
    Saxena, Babita
    Ganguly, Dipankar
    Arora, Sunita
    Arora, Karunesh Kumar
    Bansal, Shweta
    Agrawal, Shyam Sunder
    [J]. 2019 22ND CONFERENCE OF THE ORIENTAL COCOSDA INTERNATIONAL COMMITTEE FOR THE CO-ORDINATION AND STANDARDISATION OF SPEECH DATABASES AND ASSESSMENT TECHNIQUES (O-COCOSDA), 2019, : 13 - 18
  • [9] Text-to-speech synthesis with an Indian language perspective
    Panda, Soumya Priyadarsini
    Nayak, Ajit Kumar
    Patnaik, Srikanta
    [J]. INTERNATIONAL JOURNAL OF GRID AND UTILITY COMPUTING, 2015, 6 (3-4) : 170 - 178
  • [10] LibriTTS: A Corpus Derived from LibriSpeech for Text-to-Speech
    Zen, Heiga
    Dang, Viet
    Clark, Rob
    Zhang, Yu
    Weiss, Ron J.
    Jia, Ye
    Chen, Zhifeng
    Wu, Yonghui
    [J]. INTERSPEECH 2019, 2019, : 1526 - 1530