A comprehensive survey on automatic speech recognition using neural networks

被引:20
|
作者
Dhanjal, Amandeep Singh [1 ]
Singh, Williamjeet [2 ]
机构
[1] Punjabi Univ, Dept Comp Sci, Rajpura Rd, Patiala 147001, Punjab, India
[2] Punjabi Univ, Dept Comp Sci & Engn, Rajpura Rd, Patiala 147001, Punjab, India
关键词
Speech recognition; Dataset; Tools; Neural network; Deep learning; ARABIC SPEECH; SYSTEM; NOISE; HMM; ARCHITECTURES; SEGMENTATION; PRIMER;
D O I
10.1007/s11042-023-16438-y
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The continuous development in Automatic Speech Recognition has grown and demonstrated its enormous potential in Human Interaction Communication systems. It is quite a challenging task to achieve high accuracy due to several parameters such as different dialects, spontaneous speech, speaker's enrolment, computation power, dataset, and noisy environment that decrease the performance of the speech recognition system. It has motivated various researchers to make innovative contributions to the development of a robust speech recognition system. The study presents a systematic analysis of current state-of-the-art research work done in this field during 2015-2021. The prime focus of the study is to highlight the neural network-based speech recognition techniques, datasets, toolkits, and evaluation metrics utilized in the past seven years. It also synthesizes the evidence from past studies to provide empirical solutions for accuracy improvement. This study highlights the current status of speech recognition systems using neural networks and provides a brief knowledge to the new researchers.
引用
收藏
页码:23367 / 23412
页数:46
相关论文
共 50 条
  • [1] A comprehensive survey on automatic speech recognition using neural networks
    Amandeep Singh Dhanjal
    Williamjeet Singh
    Multimedia Tools and Applications, 2024, 83 : 23367 - 23412
  • [2] Automatic Recognition of Kazakh Speech Using Deep Neural Networks
    Mamyrbayev, Orken
    Turdalyuly, Mussa
    Mekebayev, Nurbapa
    Alimhan, Keylan
    Kydyrbekova, Aizat
    Turdalykyzy, Tolganay
    INTELLIGENT INFORMATION AND DATABASE SYSTEMS, ACIIDS 2019, PT II, 2019, 11432 : 465 - 474
  • [3] Automatic Speech Recognition Based on Neural Networks
    Schlueter, Ralf
    Doetsch, Patrick
    Golik, Pavel
    Kitza, Markus
    Menne, Tobias
    Irie, Kazuki
    Tueske, Zoltan
    Zeyer, Albert
    SPEECH AND COMPUTER, 2016, 9811 : 3 - 17
  • [4] Automatic Naturalness Recognition from Acted Speech Using Neural Networks
    Atmaja, Bagus Tris
    Sasou, Akira
    Akagi, Masato
    2021 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2021, : 731 - 736
  • [5] Automatic speech recognition of Portuguese phonemes using neural networks ensemble
    Nedjah, Nadia
    Bonilla, Alejandra D.
    Mourelle, Luiza de Macedo
    EXPERT SYSTEMS WITH APPLICATIONS, 2023, 229
  • [6] Automatic Speech Recognition with Deep Neural Networks for Impaired Speech
    Espana-Bonet, Cristina
    Fonollosa, Jose A. R.
    ADVANCES IN SPEECH AND LANGUAGE TECHNOLOGIES FOR IBERIAN LANGUAGES, IBERSPEECH 2016, 2016, 10077 : 97 - 107
  • [7] ASRoIL: a comprehensive survey for automatic speech recognition of Indian languages
    Singh, Amitoj
    Kadyan, Virender
    Kumar, Munish
    Bassan, Nancy
    ARTIFICIAL INTELLIGENCE REVIEW, 2020, 53 (05) : 3673 - 3704
  • [8] ASRoIL: a comprehensive survey for automatic speech recognition of Indian languages
    Amitoj Singh
    Virender Kadyan
    Munish Kumar
    Nancy Bassan
    Artificial Intelligence Review, 2020, 53 : 3673 - 3704
  • [9] DYNAMIC SPARSITY NEURAL NETWORKS FOR AUTOMATIC SPEECH RECOGNITION
    Wu, Zhaofeng
    Zhao, Ding
    Liang, Qiao
    Yu, Jiahui
    Gulati, Anmol
    Pang, Ruoming
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 6014 - 6018
  • [10] Speech recognition using neural networks
    Khan, SU
    Sharma, G
    Rao, PRK
    PROCEEDINGS OF IEEE INTERNATIONAL CONFERENCE ON INDUSTRIAL TECHNOLOGY 2000, VOLS 1 AND 2, 2000, : 432 - 437