Speech recognition in real world: Artificial neural networks and robustness

被引:1
|
作者
Kabre, H
机构
来源
WAVELET APPLICATIONS IV | 1997年 / 3078卷
关键词
robustness; artificial neural networks; environment; speech recognition and perception; artificial life;
D O I
10.1117/12.271715
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This paper presents an empirical modeling of the role of environment for Automatic Speech Recognition systems in real world, taken in the framework of an Artificial Life (AL) methodology. Environment is modeled as an active system which triggers the shift between the training and testing states of Automatic Speech Recognition Systems (ASRSs) which are built from Artificial Neural Networks. First an initial set of ASRSs are created to recognize speech under the constraints of an unpredictable acoustic world (defined by the different kind of noises present in it). The training of the ASRSs starts and goes on until ASRSs no longer decrease their error of classification in the current acoustic environment because of noises. This moment is detected by the reactive environment and the structure of the ASRSs are changed. The simulation performed with mathematical models of real rooms as environment showed that our system could be used as a prediction tool of ASRSs performances for the study of any speech perceiver based on Artificial Neural Networks or on Hidden Markov Models. Moreover it is shown that on a task of French digits recognition, the new method performs better than conventional ones.
引用
收藏
页码:175 / 181
页数:7
相关论文
共 50 条
  • [1] Speech recognition with artificial neural networks
    Dede, Guelin
    Sazli, Murat Huesnue
    [J]. DIGITAL SIGNAL PROCESSING, 2010, 20 (03) : 763 - 768
  • [2] Speech recognition using artificial neural networks
    Lim, CP
    Woo, SC
    Loh, AS
    Osman, R
    [J]. PROCEEDINGS OF THE FIRST INTERNATIONAL CONFERENCE ON WEB INFORMATION SYSTEMS ENGINEERING, VOL I, 2000, : 419 - 423
  • [3] Improving environmental robustness of speech recognition using neural networks
    Sirigos, J
    Fakotakis, N
    Kokkinakis, G
    [J]. DSP 97: 1997 13TH INTERNATIONAL CONFERENCE ON DIGITAL SIGNAL PROCESSING PROCEEDINGS, VOLS 1 AND 2: SPECIAL SESSIONS, 1997, : 575 - 578
  • [4] Isolated speech recognition using artificial neural networks
    Polur, PD
    Zhou, RB
    Yang, J
    Adnani, F
    Hobson, RS
    [J]. PROCEEDINGS OF THE 23RD ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY, VOLS 1-4: BUILDING NEW BRIDGES AT THE FRONTIERS OF ENGINEERING AND MEDICINE, 2001, 23 : 1731 - 1734
  • [5] An Effective Speech Emotion Recognition Using Artificial Neural Networks
    Anoop, V.
    Rao, P. V.
    Aruna, S.
    [J]. INTERNATIONAL PROCEEDINGS ON ADVANCES IN SOFT COMPUTING, INTELLIGENT SYSTEMS AND APPLICATIONS, ASISA 2016, 2018, 628 : 393 - 401
  • [6] On the similarities of representations in artificial and brain neural networks for speech recognition
    Wingfield, Cai
    Zhang, Chao
    Devereux, Barry
    Fonteneau, Elisabeth
    Thwaites, Andrew
    Liu, Xunying
    Woodland, Phil
    Marslen-Wilson, William
    Su, Li
    [J]. FRONTIERS IN COMPUTATIONAL NEUROSCIENCE, 2022, 16
  • [7] EXPERIMENTS IN DYSARTHRIC SPEECH RECOGNITION USING ARTIFICIAL NEURAL NETWORKS
    JAYARAM, G
    ABDELHAMIED, K
    [J]. JOURNAL OF REHABILITATION RESEARCH AND DEVELOPMENT, 1995, 32 (02): : 162 - 169
  • [8] Emotion Recognition from Speech using Artificial Neural Networks and. Recurrent Neural Networks
    Sharma, Shambhavi
    [J]. 2021 11TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING, DATA SCIENCE & ENGINEERING (CONFLUENCE 2021), 2021, : 153 - 158
  • [9] Utilizing latency for object recognition in real and artificial neural networks
    Worgotter, F
    Opara, R
    Funke, K
    Eysel, U
    [J]. NEUROREPORT, 1996, 7 (03) : 741 - 744
  • [10] Fast speaker adaptation of artificial neural networks for automatic speech recognition
    Dupont, S
    Cheboub, L
    [J]. 2000 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS, VOLS I-VI, 2000, : 1795 - 1798