ASRoIL: a comprehensive survey for automatic speech recognition of Indian languages

被引：27

作者：

Singh, Amitoj ^{[1
]}

Kadyan, Virender ^{[2
]}

Kumar, Munish ^{[1
]}

Bassan, Nancy ^{[3
]}

机构：

[1] Maharaja Ranjit Singh Punjab Tech Univ, Dept Computat Sci, Bathinda, Punjab, India

[2] Chitkara Univ, Inst Engn & Technol, Dept Comp Sci & Engn, Rajpura, Punjab, India

[3] Baba Farid Coll Engn & Technol, Dept Mech Engn, Bathinda, Punjab, India

来源：

ARTIFICIAL INTELLIGENCE REVIEW | 2020年 / 53卷 / 05期

关键词：

Automatic speech recognition; Indian languages; Feature extraction techniques; Classification techniques; Speech corpus; EMOTION RECOGNITION; SPEAKER VERIFICATION; WORD RECOGNITION; SYSTEM; FEATURES; CLASSIFICATION; MODEL; PERFORMANCE; ACCURACY; DATABASE;

D O I：

10.1007/s10462-019-09775-8

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

India is the land of language diversity with 22 major languages having more than 720 dialects, written in 13 different scripts. Out of 22, Hindi, Bengali, Punjabi is ranked 3rd, 7th and 10th most spoken languages around the globe. Expect Hindi, where one can find some significant research going on, other two major languages and other Indian languages have not fully developed Automatic Speech Recognition systems. The main aim of this paper is to provide a systematic survey of the existing literature related to automatic speech recognition (i.e. speech to text) for Indian languages. The survey analyses the possible opportunities, challenges, techniques, methods and to locate, appraise and synthesize the evidence from studies to provide empirical answers to the scientific questions. The survey was conducted based on the relevant research articles published from 2000 to 2018. The purpose of this systematic survey is to sum up the best available research on automatic speech recognition of Indian languages that is done by synthesizing the results of several studies.

引用

页码：3673 / 3704

页数：32

共 50 条

[21] Automatic Speech Recognition Advancements for Indigenous Languages of the Americas
Romero, Monica
Gomez-Canaval, Sandra
Torre, Ivan G.
APPLIED SCIENCES-BASEL, 2024, 14 (15):
[22] Hybrid deep learning based automatic speech recognition model for recognizing non-Indian languages
Gupta, Astha
Kumar, Rakesh
Kumar, Yogesh
MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (10) : 30145 - 30166
[23] Hybrid deep learning based automatic speech recognition model for recognizing non-Indian languages
Astha Gupta
Rakesh Kumar
Yogesh Kumar
Multimedia Tools and Applications, 2024, 83 : 30145 - 30166
[24] A Comprehensive Examination of Phoneme Recognition in Automatic Speech Recognition Systems
Bhatt, Shobha
Bansal, Shweta
Kumar, Ankit
Pandey, Saroj Kumar
Ojha, Manoj Kumar
Singh, Kamred Udham
Chakraborty, Sanjay
Singh, Teekam
Swarup, Chetan
TRAITEMENT DU SIGNAL, 2023, 40 (05) : 1997 - 2008
[25] Integrated End-to-End Automatic Speech Recognition for Languages for Agglutinative Languages
Bekarystankyzy, Akbayan
Mamyrbayev, Orken
Anarbekova, Tolganay
ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2024, 23 (06)
[26] A survey of technologies for automatic Dysarthric speech recognition
Qian, Zhaopeng
Xiao, Kejing
Yu, Chongchong
EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2023, 2023 (01)
[27] Machine Learning in Automatic Speech Recognition: A Survey
Padmanabhan, Jayashree
Premkumar, Melvin Jose Johnson
IETE TECHNICAL REVIEW, 2015, 32 (04) : 240 - 251
[28] A Survey of Multilingual Models for Automatic Speech Recognition
Yadav, Hemant
Sitaram, Sunayana
LREC 2022: THIRTEEN INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2022, : 5071 - 5079
[29] A detailed survey of Turkish automatic speech recognition
Arslan, Recep Sinan
Barisci, Necaattin
TURKISH JOURNAL OF ELECTRICAL ENGINEERING AND COMPUTER SCIENCES, 2020, 28 (06) : 3253 - 3269
[30] A survey of technologies for automatic Dysarthric speech recognition
Zhaopeng Qian
Kejing Xiao
Chongchong Yu
EURASIP Journal on Audio, Speech, and Music Processing, 2023

← 1 2 3 4 5 →