ASRoIL: a comprehensive survey for automatic speech recognition of Indian languages

被引:27
|
作者
Singh, Amitoj [1 ]
Kadyan, Virender [2 ]
Kumar, Munish [1 ]
Bassan, Nancy [3 ]
机构
[1] Maharaja Ranjit Singh Punjab Tech Univ, Dept Computat Sci, Bathinda, Punjab, India
[2] Chitkara Univ, Inst Engn & Technol, Dept Comp Sci & Engn, Rajpura, Punjab, India
[3] Baba Farid Coll Engn & Technol, Dept Mech Engn, Bathinda, Punjab, India
关键词
Automatic speech recognition; Indian languages; Feature extraction techniques; Classification techniques; Speech corpus; EMOTION RECOGNITION; SPEAKER VERIFICATION; WORD RECOGNITION; SYSTEM; FEATURES; CLASSIFICATION; MODEL; PERFORMANCE; ACCURACY; DATABASE;
D O I
10.1007/s10462-019-09775-8
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
India is the land of language diversity with 22 major languages having more than 720 dialects, written in 13 different scripts. Out of 22, Hindi, Bengali, Punjabi is ranked 3rd, 7th and 10th most spoken languages around the globe. Expect Hindi, where one can find some significant research going on, other two major languages and other Indian languages have not fully developed Automatic Speech Recognition systems. The main aim of this paper is to provide a systematic survey of the existing literature related to automatic speech recognition (i.e. speech to text) for Indian languages. The survey analyses the possible opportunities, challenges, techniques, methods and to locate, appraise and synthesize the evidence from studies to provide empirical answers to the scientific questions. The survey was conducted based on the relevant research articles published from 2000 to 2018. The purpose of this systematic survey is to sum up the best available research on automatic speech recognition of Indian languages that is done by synthesizing the results of several studies.
引用
收藏
页码:3673 / 3704
页数:32
相关论文
共 50 条
  • [11] Automatic speech recognition: a survey
    Malik, Mishaim
    Malik, Muhammad Kamran
    Mehmood, Khawar
    Makhdoom, Imran
    MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (06) : 9411 - 9457
  • [12] A survey on speech synthesis techniques in Indian languages
    Soumya Priyadarsini Panda
    Ajit Kumar Nayak
    Satyananda Champati Rai
    Multimedia Systems, 2020, 26 : 453 - 478
  • [13] A survey of hate speech detection in Indian languages
    Nandi, Arpan
    Sarkar, Kamal
    Mallick, Arjun
    De, Arkadeep
    SOCIAL NETWORK ANALYSIS AND MINING, 2024, 14 (01)
  • [14] A survey on speech synthesis techniques in Indian languages
    Panda, Soumya Priyadarsini
    Nayak, Ajit Kumar
    Rai, Satyananda Champati
    MULTIMEDIA SYSTEMS, 2020, 26 (04) : 453 - 478
  • [15] A Survey of Automatic Speech Recognition for Dysarthric Speech
    Qian, Zhaopeng
    Xiao, Kejing
    ELECTRONICS, 2023, 12 (20)
  • [16] An automatic speech recognition system in Indian and foreign languages: A state-of-the-art review analysis
    Gupta A.
    Kumar R.
    Kumar Y.
    Intelligent Decision Technologies, 2023, 17 (02) : 505 - 526
  • [17] Automatic Speech Emotion Recognition: A Survey
    Chandrasekar, Purnima
    Chapaneri, Santosh
    Jayaswal, Deepak
    2014 INTERNATIONAL CONFERENCE ON CIRCUITS, SYSTEMS, COMMUNICATION AND INFORMATION TECHNOLOGY APPLICATIONS (CSCITA), 2014, : 341 - 346
  • [18] SVMs for Automatic Speech Recognition:: A survey
    Solera-Urena, R.
    Padrell-Sendra, J.
    Martin-Iglesias, D.
    Gallardo-Antolin, A.
    Pelaez-Moreno, C.
    Diaz-de-Maria, F.
    PROGRESS IN NONLINEAR SPEECH PROCESSING, 2007, 4391 : 190 - +
  • [19] Speech Emotion Recognition: A Comprehensive Survey
    Mohammed Jawad Al-Dujaili
    Abbas Ebrahimi-Moghadam
    Wireless Personal Communications, 2023, 129 : 2525 - 2561
  • [20] Speech Emotion Recognition: A Comprehensive Survey
    Al-Dujaili, Mohammed Jawad
    Ebrahimi-Moghadam, Abbas
    WIRELESS PERSONAL COMMUNICATIONS, 2023, 129 (04) : 2525 - 2561