IndicSpeech: Text-to-Speech Corpus for Indian Languages

被引：0

作者：

Srivastava, Nimisha ^{[1
]}

Mukhopadhyay, Rudrabha ^{[1
]}

Prajwal, K. R. ^{[1
]}

Jawahar, C., V ^{[1
]}

机构：

[1] IIIT Hyderabad, Hyderabad, India

来源：

PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2020) | 2020年

关键词：

Text-to-speech; Indian languages; TTS corpus;

D O I：

暂无

中图分类号：

TP39 [计算机的应用];

学科分类号：

081203 ; 0835 ;

摘要：

India is a country where several tens of languages are spoken by over a billion strong population. Text-to-speech systems for such languages will thus be extremely beneficial for wide-spread content creation and accessibility. Despite this, the current TTS systems for even the most popular Indian languages fall short of the contemporary state-of-the-art systems for English, Chinese, etc. We believe that one of the major reasons for this is the lack of large, publicly available text-to-speech corpora in these languages that are suitable for training neural text-to-speech systems. To mitigate this, we release a 24 hour text-to-speech corpus for 3 major Indian languages namely Hindi, Malayalam and Bengali. In this work, we also train a state-of-the-art TTS system for each of these languages and report their performances. The collected corpus, code, and trained models are made publicly available.

引用

页码：6417 / 6422

页数：6

共 50 条

[1] An efficient model for text-to-speech synthesis in Indian languages
Panda, Soumya Priyadarsini
Nayak, Ajit Kumar
[J]. INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2015, 18 (03) : 305 - 315
[2] Shruti: an embedded text-to-speech system for Indian languages
Mukhopadhyay, A.
Chakraborty, S.
Choudhury, M.
Lahiri, A.
Dey, S.
Basu, A.
[J]. IEE PROCEEDINGS-SOFTWARE, 2006, 153 (02): : 75 - 79
[3] Resyllabification in Indian Languages and its Implications in Text-to-speech Systems
Mahesh, M.
Prakash, Jeena J.
Murthy, Hema A.
[J]. 19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 212 - 216
[4] Significance of knowledge sources for a text-to-speech system for Indian languages
Yegnanarayana, B.
Rajendran, S.
Ramachandran, V.R.
Madhukumar, A.S.
[J]. Sadhana - Academy Proceedings in Engineering Sciences, 1994, 19 (pt 1)
[5] SIGNIFICANCE OF KNOWLEDGE SOURCES OR A TEXT-TO-SPEECH SYSTEM FOR INDIAN LANGUAGES
YEGNANARAYANA, B
RAJENDRAN, S
RAMACHANDRAN, VR
MADHUKUMAR, AS
[J]. SADHANA-ACADEMY PROCEEDINGS IN ENGINEERING SCIENCES, 1994, 19 : 147 - 169
[6] An Approach to Building Language-Independent Text-to-Speech Synthesis for Indian Languages
Prakash, Anusha
Reddy, M. Ramasubba
Nagarajan, T.
Murthy, Hema A.
[J]. 2014 TWENTIETH NATIONAL CONFERENCE ON COMMUNICATIONS (NCC), 2014,
[7] RyanSpeech: A Corpus for Conversational Text-to-Speech Synthesis
Zandie, Rohola
Mahoor, Mohammad H.
Madsen, Julia
Emamian, Eshrat S.
[J]. INTERSPEECH 2021, 2021, : 2751 - 2755
[8] BOOTSTRAPPING TEXT-TO-SPEECH FOR SPEECH PROCESSING IN LANGUAGES WITHOUT AN ORTHOGRAPHY
Sitaram, Sunayana
Palkar, Sukhada
Chen, Yun-Nung
Parlikar, Alok
Black, Alan W.
[J]. 2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 7992 - 7996
[9] Indian Languages Corpus for Speech Recognition
Basu, Joyanta
Khan, Soma
Roy, Rajib
Saxena, Babita
Ganguly, Dipankar
Arora, Sunita
Arora, Karunesh Kumar
Bansal, Shweta
Agrawal, Shyam Sunder
[J]. 2019 22ND CONFERENCE OF THE ORIENTAL COCOSDA INTERNATIONAL COMMITTEE FOR THE CO-ORDINATION AND STANDARDISATION OF SPEECH DATABASES AND ASSESSMENT TECHNIQUES (O-COCOSDA), 2019, : 13 - 18
[10] Text-to-speech synthesis with an Indian language perspective
Panda, Soumya Priyadarsini
Nayak, Ajit Kumar
Patnaik, Srikanta
[J]. INTERNATIONAL JOURNAL OF GRID AND UTILITY COMPUTING, 2015, 6 (3-4) : 170 - 178

← 1 2 3 4 5 →