ASRoIL: a comprehensive survey for automatic speech recognition of Indian languages

被引:27
|
作者
Singh, Amitoj [1 ]
Kadyan, Virender [2 ]
Kumar, Munish [1 ]
Bassan, Nancy [3 ]
机构
[1] Maharaja Ranjit Singh Punjab Tech Univ, Dept Computat Sci, Bathinda, Punjab, India
[2] Chitkara Univ, Inst Engn & Technol, Dept Comp Sci & Engn, Rajpura, Punjab, India
[3] Baba Farid Coll Engn & Technol, Dept Mech Engn, Bathinda, Punjab, India
关键词
Automatic speech recognition; Indian languages; Feature extraction techniques; Classification techniques; Speech corpus; EMOTION RECOGNITION; SPEAKER VERIFICATION; WORD RECOGNITION; SYSTEM; FEATURES; CLASSIFICATION; MODEL; PERFORMANCE; ACCURACY; DATABASE;
D O I
10.1007/s10462-019-09775-8
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
India is the land of language diversity with 22 major languages having more than 720 dialects, written in 13 different scripts. Out of 22, Hindi, Bengali, Punjabi is ranked 3rd, 7th and 10th most spoken languages around the globe. Expect Hindi, where one can find some significant research going on, other two major languages and other Indian languages have not fully developed Automatic Speech Recognition systems. The main aim of this paper is to provide a systematic survey of the existing literature related to automatic speech recognition (i.e. speech to text) for Indian languages. The survey analyses the possible opportunities, challenges, techniques, methods and to locate, appraise and synthesize the evidence from studies to provide empirical answers to the scientific questions. The survey was conducted based on the relevant research articles published from 2000 to 2018. The purpose of this systematic survey is to sum up the best available research on automatic speech recognition of Indian languages that is done by synthesizing the results of several studies.
引用
收藏
页码:3673 / 3704
页数:32
相关论文
共 50 条
  • [31] Chinese dialect speech recognition: a comprehensive survey
    Qiang Li
    Qianyu Mai
    Mandou Wang
    Mingjuan Ma
    Artificial Intelligence Review, 57
  • [32] Trends in speech emotion recognition: a comprehensive survey
    Kaur, Kamaldeep
    Singh, Parminder
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (19) : 29307 - 29351
  • [33] Trends in speech emotion recognition: a comprehensive survey
    Kamaldeep Kaur
    Parminder Singh
    Multimedia Tools and Applications, 2023, 82 : 29307 - 29351
  • [34] Chinese dialect speech recognition: a comprehensive survey
    Li, Qiang
    Mai, Qianyu
    Wang, Mandou
    Ma, Mingjuan
    ARTIFICIAL INTELLIGENCE REVIEW, 2024, 57 (02)
  • [35] Automatic Speech Recognition for African Languages with Vowel Length Contrast
    Gauthier, Elodie
    Besacier, Laurent
    Voisin, Sylvie
    SLTU-2016 5TH WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGIES FOR UNDER-RESOURCED LANGUAGES, 2016, 81 : 136 - 143
  • [36] Reusing Automatic Speech Recognition Platform for Resource Deficient Languages
    Patel, Chirag
    Kopparapu, Sunil
    2014 8TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND COMMUNICATION SYSTEMS (ICSPCS), 2014,
  • [37] Automatic Speech Recognition and Query By Example for Creole Languages Documentation
    Macaire, Cecile
    Schwab, Didier
    Lecouteux, Benjamin
    Schang, Emmanuel
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022), 2022, : 2512 - 2520
  • [38] A Survey of Machine Translation and Parts of Speech Tagging for Indian Languages
    Khedkar, Vijayshri
    Shah, Pritesh
    INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2022, 22 (04): : 245 - 253
  • [39] A Survey of Automatic Text Summarization Techniques for Indian and Foreign Languages
    Shah, Prachi
    Desai, Nikita P.
    2016 INTERNATIONAL CONFERENCE ON ELECTRICAL, ELECTRONICS, AND OPTIMIZATION TECHNIQUES (ICEEOT), 2016, : 4598 - 4601
  • [40] Multimodal Machine Translation Approaches for Indian Languages: A Comprehensive Survey
    Paul, Binnu
    Rudrapal, Dwijen
    Chakma, Kunal
    Jamatia, Anupam
    JOURNAL OF UNIVERSAL COMPUTER SCIENCE, 2024, 30 (05) : 694 - 717