A comprehensive survey on automatic speech recognition using neural networks

被引:20
|
作者
Dhanjal, Amandeep Singh [1 ]
Singh, Williamjeet [2 ]
机构
[1] Punjabi Univ, Dept Comp Sci, Rajpura Rd, Patiala 147001, Punjab, India
[2] Punjabi Univ, Dept Comp Sci & Engn, Rajpura Rd, Patiala 147001, Punjab, India
关键词
Speech recognition; Dataset; Tools; Neural network; Deep learning; ARABIC SPEECH; SYSTEM; NOISE; HMM; ARCHITECTURES; SEGMENTATION; PRIMER;
D O I
10.1007/s11042-023-16438-y
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The continuous development in Automatic Speech Recognition has grown and demonstrated its enormous potential in Human Interaction Communication systems. It is quite a challenging task to achieve high accuracy due to several parameters such as different dialects, spontaneous speech, speaker's enrolment, computation power, dataset, and noisy environment that decrease the performance of the speech recognition system. It has motivated various researchers to make innovative contributions to the development of a robust speech recognition system. The study presents a systematic analysis of current state-of-the-art research work done in this field during 2015-2021. The prime focus of the study is to highlight the neural network-based speech recognition techniques, datasets, toolkits, and evaluation metrics utilized in the past seven years. It also synthesizes the evidence from past studies to provide empirical solutions for accuracy improvement. This study highlights the current status of speech recognition systems using neural networks and provides a brief knowledge to the new researchers.
引用
收藏
页码:23367 / 23412
页数:46
相关论文
共 50 条
  • [41] AUTOMATIC SPEECH RECOGNITION USING HIDDEN CONDITIONAL NEURAL FIELDS
    Fujii, Yasuhisa
    Yamamoto, Kazumasa
    Nakagawa, Seiichi
    2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 5036 - 5039
  • [42] Emotional Speech Recognition Using Deep Neural Networks
    Trinh Van, Loan
    Dao Thi Le, Thuy
    Le Xuan, Thanh
    Castelli, Eric
    SENSORS, 2022, 22 (04)
  • [43] Recognition and Processing of Speech Signals Using Neural Networks
    Douglas O’Shaughnessy
    Circuits, Systems, and Signal Processing, 2019, 38 : 3454 - 3481
  • [44] Speech emotion recognition using spiking neural networks
    Buscicchio, Cosimo A.
    Gorecki, Przemyslaw
    Caponetti, Laura
    FOUNDATIONS OF INTELLIGENT SYSTEMS, PROCEEDINGS, 2006, 4203 : 38 - 46
  • [45] Arabic speech recognition using recurrent neural networks
    El Choubassi, MM
    El Khoury, HE
    Alagha, CEJ
    Skaf, JA
    Al-Alaoui, MA
    PROCEEDINGS OF THE 3RD IEEE INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND INFORMATION TECHNOLOGY, 2003, : 543 - 547
  • [46] Using Neural Networks for a Discriminant Speech Recognition System
    Schiopu, Daniela
    Oprea, Mihaela
    2014 INTERNATIONAL CONFERENCE ON DEVELOPMENT AND APPLICATION SYSTEMS (DAS), 2014, : 165 - 169
  • [47] Using neural networks and LPCC to improve speech recognition
    Zbancioc, M
    Costin, M
    SCS 2003: INTERNATIONAL SYMPOSIUM ON SIGNALS, CIRCUITS AND SYSTEMS, VOLS 1 AND 2, PROCEEDINGS, 2003, : 445 - 448
  • [48] Remarks on emotional speech recognition using neural networks
    Takahashi, Kazuhiko
    Nakatsu, Ryohei
    Nippon Kikai Gakkai Ronbunshu, C Hen/Transactions of the Japan Society of Mechanical Engineers, Part C, 2002, 68 (08): : 2339 - 2345
  • [49] Isolated speech recognition using artificial neural networks
    Polur, PD
    Zhou, RB
    Yang, J
    Adnani, F
    Hobson, RS
    PROCEEDINGS OF THE 23RD ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY, VOLS 1-4: BUILDING NEW BRIDGES AT THE FRONTIERS OF ENGINEERING AND MEDICINE, 2001, 23 : 1731 - 1734
  • [50] Recognition and Processing of Speech Signals Using Neural Networks
    O'Shaughnessy, Douglas
    CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2019, 38 (08) : 3454 - 3481