A survey on speech synthesis techniques in Indian languages

被引：11

作者：

Panda, Soumya Priyadarsini ^{[1
]}

Nayak, Ajit Kumar ^{[2
]}

Rai, Satyananda Champati ^{[3
]}

机构：

[1] Silicon Inst Technol, Dept CSE, Bhubaneswar, Odisha, India

[2] Siksha O Anusandhan Univ, Dept CS & IT, Bhubaneswar, Odisha, India

[3] Silicon Inst Technol, Dept IT, Bhubaneswar, Odisha, India

来源：

MULTIMEDIA SYSTEMS | 2020年 / 26卷 / 04期

关键词：

Text to speech system; Speech synthesis; Indian languages; Concatenative synthesis; Formant synthesis; Articulatory synthesis; Syllable-based synthesis; HMM-based synthesis; Statistical parametric synthesis; Polyglot synthesis; Multilingual synthesis; Waveform concatenation; Deep learning; SYNTHESIS SYSTEM; ARTICULATORY SYNTHESIS; TEXT; SELECTION; INTELLIGIBILITY; FEATURES; GENERATION; QUALITY; IDENTIFICATION; ENHANCEMENT;

D O I：

10.1007/s00530-020-00659-4

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

The text to speech technology has achieved significant progress during the past decade and is an active area of research and development in providing different human-computer interactive systems. Even though a number of speech synthesis models are available for different languages focusing on the domain requirements with many motive applications, a source of information on current trends in Indian language speech synthesis is unavailable till date making it difficult for the beginners to initiate research for the development of TTS systems for the low-resourced languages. This paper provides a review of the contributions made by different researchers in the field of Indian language speech synthesis along with a study on the Indian language characteristics and the associated challenges in designing TTS systems. A set of available applications and tools results out of different projects undertaken by different organizations along with a set of possible future developments are also discussed to provide a single reference to an important strand of research in speech synthesis which may benefit anyone interested to initiate research in this area.

引用

页码：453 / 478

页数：26

共 50 条

[41] PREFATORY NOTE + ANNUAL SURVEY OF INDIAN LANGUAGES AND LITERATURES
MALIK, K
[J]. INDIAN LITERATURE, 1980, 23 (06) : 5 - 5
[42] A SURVEY OF SPEECH BANDWIDTH COMPRESSION TECHNIQUES
CAMPANELLA, SJ
[J]. IRE TRANSACTIONS ON AUDIO, 1958, 6 (05): : 104 - 116
[43] A Survey: Speech Recognition Approaches and Techniques
Singh, Atma Prakash
Nath, Ravindra
Kumar, Santosh
[J]. 2018 5TH IEEE UTTAR PRADESH SECTION INTERNATIONAL CONFERENCE ON ELECTRICAL, ELECTRONICS AND COMPUTER ENGINEERING (UPCON), 2018, : 563 - 566
[44] SURVEY OF DIGITAL SPEECH PROCESSING TECHNIQUES
SCHAFER, RW
[J]. IEEE TRANSACTIONS ON AUDIO AND ELECTROACOUSTICS, 1972, AU20 (01): : 28 - +
[45] OBJECTIVES AND TECHNIQUES OF SPEECH SYNTHESIS
PETERSON, GE
SIVERTSEN, E
[J]. LANGUAGE AND SPEECH, 1960, 3 (02) : 84 - 95
[46] SEGMENTATION TECHNIQUES IN SPEECH SYNTHESIS
PETERSON, GE
WANG, WSY
SIVERTSEN, E
[J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1958, 30 (08): : 739 - 742
[47] Significance of knowledge sources for a text-to-speech system for Indian languages
Yegnanarayana, B.
Rajendran, S.
Ramachandran, V.R.
Madhukumar, A.S.
[J]. Sadhana - Academy Proceedings in Engineering Sciences, 1994, 19 (pt 1)
[48] Resyllabification in Indian Languages and its Implications in Text-to-speech Systems
Mahesh, M.
Prakash, Jeena J.
Murthy, Hema A.
[J]. 19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 212 - 216
[49] SIGNIFICANCE OF KNOWLEDGE SOURCES OR A TEXT-TO-SPEECH SYSTEM FOR INDIAN LANGUAGES
YEGNANARAYANA, B
RAJENDRAN, S
RAMACHANDRAN, VR
MADHUKUMAR, AS
[J]. SADHANA-ACADEMY PROCEEDINGS IN ENGINEERING SCIENCES, 1994, 19 : 147 - 169
[50] Development of speech corpora for speaker recognition research and evaluation in Indian languages
Patil, Hemant
Basu, T.
[J]. INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2008, 11 (01) : 17 - 32

← 1 2 3 4 5 →