IndicSpeech: Text-to-Speech Corpus for Indian Languages

被引：0

作者：

Srivastava, Nimisha ^{[1
]}

Mukhopadhyay, Rudrabha ^{[1
]}

Prajwal, K. R. ^{[1
]}

Jawahar, C., V ^{[1
]}

机构：

[1] IIIT Hyderabad, Hyderabad, India

来源：

PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2020) | 2020年

关键词：

Text-to-speech; Indian languages; TTS corpus;

D O I：

暂无

中图分类号：

TP39 [计算机的应用];

学科分类号：

081203 ; 0835 ;

摘要：

India is a country where several tens of languages are spoken by over a billion strong population. Text-to-speech systems for such languages will thus be extremely beneficial for wide-spread content creation and accessibility. Despite this, the current TTS systems for even the most popular Indian languages fall short of the contemporary state-of-the-art systems for English, Chinese, etc. We believe that one of the major reasons for this is the lack of large, publicly available text-to-speech corpora in these languages that are suitable for training neural text-to-speech systems. To mitigate this, we release a 24 hour text-to-speech corpus for 3 major Indian languages namely Hindi, Malayalam and Bengali. In this work, we also train a state-of-the-art TTS system for each of these languages and report their performances. The collected corpus, code, and trained models are made publicly available.

引用

页码：6417 / 6422

页数：6

共 50 条

[31] TEXT-TO-SPEECH SYNTHESIS
SPROAT, RW
OLIVE, JP
[J]. AT&T TECHNICAL JOURNAL, 1995, 74 (02): : 35 - 44
[32] The Art of Text-to-Speech
Lindquist, Benjamin
[J]. CRITICAL INQUIRY, 2024, 50 (02) : 225 - 251
[33] Software text-to-speech
Hallahan W.I.
[J]. International Journal of Speech Technology, 1997, 1 (2) : 121 - 134
[34] Text-to-speech for customers
不详
[J]. EXPERT SYSTEMS, 1998, 15 (01) : 66 - 66
[35] A Hybrid HMM-Waveglow based Text-to-speech Synthesizer using Histogram Equalization for Low resource Indian Languages
Kumar, Mano Ranjith M.
Srivastava, Sudhanshu
Prakash, Anusha
Murthy, Hema A.
[J]. INTERSPEECH 2020, 2020, : 2037 - 2041
[36] Recent Trends in Text to Speech Synthesis of Indian Languages
Joshi, Sarang L.
Bairagi, Vinayak K.
[J]. HELIX, 2019, 9 (03): : 4931 - 4936
[37] NORMALIZATION OF TEXT MESSAGES FOR TEXT-TO-SPEECH
Pennell, Deana L.
Liu, Yang
[J]. 2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 4842 - 4845
[38] Text and Speech Corpora for Text-To-Speech Synthesis of Tales
Doukhan, David
Rosset, Sophie
Rilliard, Albert
d'Alessandro, Christophe
Adda-Decker, Martine
[J]. LREC 2012 - EIGHTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2012, : 1003 - 1010
[39] Hierarchical Transfer Learning for Text-to-Speech in Indonesian, Java']Javanese, and Sundanese Languages
Azizah, Kurniawati
Adriani, Mirna
[J]. ICACSIS 2020: 2020 12TH INTERNATIONAL CONFERENCE ON ADVANCED COMPUTER SCIENCE AND INFORMATION SYSTEMS (ICACSIS), 2020, : 421 - 428
[40] A set of corpus-based text-to-speech synthesis technologies for Mandarin Chinese
Chou, FC
Tseng, CY
Lee, LS
[J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2002, 10 (07): : 481 - 494

← 1 2 3 4 5 →