Speech recognition in real world: Artificial neural networks and robustness

被引:1
|
作者
Kabre, H
机构
来源
WAVELET APPLICATIONS IV | 1997年 / 3078卷
关键词
robustness; artificial neural networks; environment; speech recognition and perception; artificial life;
D O I
10.1117/12.271715
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This paper presents an empirical modeling of the role of environment for Automatic Speech Recognition systems in real world, taken in the framework of an Artificial Life (AL) methodology. Environment is modeled as an active system which triggers the shift between the training and testing states of Automatic Speech Recognition Systems (ASRSs) which are built from Artificial Neural Networks. First an initial set of ASRSs are created to recognize speech under the constraints of an unpredictable acoustic world (defined by the different kind of noises present in it). The training of the ASRSs starts and goes on until ASRSs no longer decrease their error of classification in the current acoustic environment because of noises. This moment is detected by the reactive environment and the structure of the ASRSs are changed. The simulation performed with mathematical models of real rooms as environment showed that our system could be used as a prediction tool of ASRSs performances for the study of any speech perceiver based on Artificial Neural Networks or on Hidden Markov Models. Moreover it is shown that on a task of French digits recognition, the new method performs better than conventional ones.
引用
收藏
页码:175 / 181
页数:7
相关论文
共 50 条
  • [11] Speech Stress Recognition by Sound Analysis and Multilayer Artificial Neural Networks
    Balabanova, Ivelina
    Kostadinova, Stela
    Markova, Valentina
    Georgiev, Georgi
    [J]. PROCEEDINGS OF THE 2020 INTERNATIONAL CONFERENCE ON BIOMEDICAL INNOVATIONS AND APPLICATIONS (BIA 2020), 2020, : 77 - 80
  • [12] IMPROVING ROBUSTNESS OF DEEP NEURAL NETWORKS VIA SPECTRAL MASKING FOR AUTOMATIC SPEECH RECOGNITION
    Li, Bo
    Sim, Khe Chai
    [J]. 2013 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING (ASRU), 2013, : 279 - 284
  • [13] On the Robustness of Speech Emotion Recognition for Human-Robot Interaction with Deep Neural Networks
    Lakomkin, Egor
    Zamani, Mohammad Ali
    Weber, Cornelius
    Magg, Sven
    Wermter, Stefan
    [J]. 2018 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2018, : 854 - 860
  • [14] Artificial Neural Networks Based Image Processing & Pattern Recognition: From Concepts to Real-World Applications
    Madani, Kurosh
    [J]. 2008 FIRST INTERNATIONAL WORKSHOPS ON IMAGE PROCESSING THEORY, TOOLS AND APPLICATIONS (IPTA), 2008, : 19 - 27
  • [15] Defog Artificial Intelligence Glasses: Neural Networks for the Imperfect Real World
    Rojas, Nilton
    [J]. THIRTY-EIGTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 21, 2024, : 23755 - 23756
  • [16] Industrial and real world applications of artificial neural networks - Illusion or reality?
    Madani, Kurosh
    [J]. INFORMATICS IN CONTROL, AUTOMATION AND ROBOTICS I, 2006, : 11 - 26
  • [17] Neural networks used for speech recognition
    El-Ramly, SH
    Abdel-Kader, NS
    El-Adawi, R
    [J]. 2002 IEEE PROCEEDINGS OF THE NINETEENTH NATIONAL RADIO SCIENCE CONFERENCE, VOLS 1 AND 2, 2002, : 200 - 207
  • [18] Review of Neural Networks for Speech Recognition
    Lippmann, Richard P.
    [J]. NEURAL COMPUTATION, 1989, 1 (01) : 1 - 38
  • [19] Binary neural networks for speech recognition
    Qian, Yan-min
    Xiang, Xu
    [J]. FRONTIERS OF INFORMATION TECHNOLOGY & ELECTRONIC ENGINEERING, 2019, 20 (05) : 701 - 715
  • [20] Residual Neural Networks for Speech Recognition
    Vydana, Hari Krishna
    Vuppala, Anil Kumar
    [J]. 2017 25TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2017, : 543 - 547