ASRoIL: a comprehensive survey for automatic speech recognition of Indian languages

被引:27
|
作者
Singh, Amitoj [1 ]
Kadyan, Virender [2 ]
Kumar, Munish [1 ]
Bassan, Nancy [3 ]
机构
[1] Maharaja Ranjit Singh Punjab Tech Univ, Dept Computat Sci, Bathinda, Punjab, India
[2] Chitkara Univ, Inst Engn & Technol, Dept Comp Sci & Engn, Rajpura, Punjab, India
[3] Baba Farid Coll Engn & Technol, Dept Mech Engn, Bathinda, Punjab, India
关键词
Automatic speech recognition; Indian languages; Feature extraction techniques; Classification techniques; Speech corpus; EMOTION RECOGNITION; SPEAKER VERIFICATION; WORD RECOGNITION; SYSTEM; FEATURES; CLASSIFICATION; MODEL; PERFORMANCE; ACCURACY; DATABASE;
D O I
10.1007/s10462-019-09775-8
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
India is the land of language diversity with 22 major languages having more than 720 dialects, written in 13 different scripts. Out of 22, Hindi, Bengali, Punjabi is ranked 3rd, 7th and 10th most spoken languages around the globe. Expect Hindi, where one can find some significant research going on, other two major languages and other Indian languages have not fully developed Automatic Speech Recognition systems. The main aim of this paper is to provide a systematic survey of the existing literature related to automatic speech recognition (i.e. speech to text) for Indian languages. The survey analyses the possible opportunities, challenges, techniques, methods and to locate, appraise and synthesize the evidence from studies to provide empirical answers to the scientific questions. The survey was conducted based on the relevant research articles published from 2000 to 2018. The purpose of this systematic survey is to sum up the best available research on automatic speech recognition of Indian languages that is done by synthesizing the results of several studies.
引用
收藏
页码:3673 / 3704
页数:32
相关论文
共 50 条
  • [41] Development of speech corpora for speaker recognition research and evaluation in Indian languages
    Patil, Hemant
    Basu, T.
    INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2008, 11 (01) : 17 - 32
  • [42] Automatic speech recognition systems: A survey of discriminative techniques
    Kaur, Amrit Preet
    Singh, Amitoj
    Sachdeva, Rohit
    Kukreja, Vinay
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (09) : 13307 - 13339
  • [43] Automatic Speech Recognition Using Limited Vocabulary: A Survey
    Fendji, Jean Louis K. E.
    Tala, Diane C. M.
    Yenke, Blaise O.
    Atemkeng, Marcellin
    APPLIED ARTIFICIAL INTELLIGENCE, 2022, 36 (01)
  • [44] Discriminative training of HMMs for automatic speech recognition: A survey
    Jiang, Hui
    COMPUTER SPEECH AND LANGUAGE, 2010, 24 (04): : 589 - 608
  • [45] Adversarial Attacks on Automatic Speech Recognition (ASR): A Survey
    Bhanushali, Amisha Rajnikant
    Mun, Hyunjun
    Yun, Joobeom
    IEEE ACCESS, 2024, 12 : 88279 - 88302
  • [46] A Lightweight Downscaled Approach to Automatic Speech Recognition for Small Indigenous Languages
    Stan, George Vlad
    Baart, Andre
    Dittoh, Francis
    Akkermans, Hans
    Bon, Anna
    PROCEEDINGS OF THE 14TH ACM WEB SCIENCE CONFERENCE, WEBSCI 2022, 2022, : 451 - 458
  • [47] Digits to Words Converter for Slavic Languages in Systems of Automatic Speech Recognition
    Chaloupka, Josef
    SPEECH AND COMPUTER, SPECOM 2017, 2017, 10458 : 312 - 321
  • [48] Automatic transcription of continuous speech into syllable-like units for Indian languages
    Sarada, G. Lakshmi
    Lakshmi, A.
    Murthy, Hema A.
    Nagarajan, T.
    SADHANA-ACADEMY PROCEEDINGS IN ENGINEERING SCIENCES, 2009, 34 (02): : 221 - 233
  • [49] ISI ASR System for the Low Resource Speech Recognition Challenge for Indian Languages
    Billa, Jayadev
    19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 3207 - 3211
  • [50] IMPROVING THE PERFORMANCE OF TRANSFORMER BASED LOW RESOURCE SPEECH RECOGNITION FOR INDIAN LANGUAGES
    Shetty, Vishwas M.
    Mary, Metilda Sagaya N. J.
    Umesh, S.
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 8279 - 8283