RECURRENT NEURAL NETWORKS FOR SPEECH RECOGNITION

被引:0
|
作者
VERDEJO, JED
HERREROS, AP
LUNA, JCS
ORTUZAR, MCB
AYUSO, AR
机构
关键词
D O I
暂无
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
In this paper we present some results from a net-like structure for Hidden Markov Models, applied to speech recognition. Net topology is a Recurrent Neural Network in which each temporary step is identified as a layer. Backpropagation techniques are used to train the RNN-HMM. Two types of training estimations are used: Maximum Likelihood and Competitive Training. Maximum Likelihood estimation algorithm using backpropagation provides the same updating equations as Baum-Welch algorithm used in HMM. Competitive Training is based on the probability of correct labelling the sequences from the Maximum Likelihood measures. Our results have shown that the best procedure is to train first with Maximum Likelihood estimation and then with Competitive Training reestimation.
引用
收藏
页码:361 / 369
页数:9
相关论文
共 50 条
  • [1] SPEECH RECOGNITION WITH HIERARCHICAL RECURRENT NEURAL NETWORKS
    CHEN, WY
    LIAO, YF
    CHEN, SH
    [J]. PATTERN RECOGNITION, 1995, 28 (06) : 795 - 805
  • [2] Visual speech recognition by recurrent neural networks
    Rabi, G
    Lu, SW
    [J]. JOURNAL OF ELECTRONIC IMAGING, 1998, 7 (01) : 61 - 69
  • [3] Visual speech recognition by recurrent neural networks
    Rabi, G
    Lu, SW
    [J]. 1997 CANADIAN CONFERENCE ON ELECTRICAL AND COMPUTER ENGINEERING, CONFERENCE PROCEEDINGS, VOLS I AND II: ENGINEERING INNOVATION: VOYAGE OF DISCOVERY, 1997, : 55 - 58
  • [4] Unfolded Recurrent Neural Networks for Speech Recognition
    Saon, George
    Soltau, Hagen
    Emami, Ahmad
    Picheny, Michael
    [J]. 15TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2014), VOLS 1-4, 2014, : 343 - 347
  • [5] SPEECH RECOGNITION WITH DEEP RECURRENT NEURAL NETWORKS
    Graves, Alex
    Mohamed, Abdel-rahman
    Hinton, Geoffrey
    [J]. 2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 6645 - 6649
  • [6] RECURRENT DEEP NEURAL NETWORKS FOR ROBUST SPEECH RECOGNITION
    Weng, Chao
    Yu, Dong
    Watanabe, Shinji
    Juang, Biing-Hwang
    [J]. 2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
  • [7] Arabic speech recognition using recurrent neural networks
    El Choubassi, MM
    El Khoury, HE
    Alagha, CEJ
    Skaf, JA
    Al-Alaoui, MA
    [J]. PROCEEDINGS OF THE 3RD IEEE INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND INFORMATION TECHNOLOGY, 2003, : 543 - 547
  • [8] A new model of recurrent neural networks for speech recognition
    Xu, W
    Zhu, XY
    [J]. ICONIP'98: THE FIFTH INTERNATIONAL CONFERENCE ON NEURAL INFORMATION PROCESSING JOINTLY WITH JNNS'98: THE 1998 ANNUAL CONFERENCE OF THE JAPANESE NEURAL NETWORK SOCIETY - PROCEEDINGS, VOLS 1-3, 1998, : 1134 - 1137
  • [9] Chaotic recurrent neural networks and their application to speech recognition
    Ryeu, JK
    Chung, HS
    [J]. NEUROCOMPUTING, 1996, 13 (2-4) : 281 - 294
  • [10] Audio Visual Speech Recognition with Multimodal Recurrent Neural Networks
    Feng, Weijiang
    Guan, Naiyang
    Li, Yuan
    Zhang, Xiang
    Luo, Zhigang
    [J]. 2017 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2017, : 681 - 688