RECURRENT NEURAL NETWORKS FOR SPEECH RECOGNITION

被引:0
|
作者
VERDEJO, JED
HERREROS, AP
LUNA, JCS
ORTUZAR, MCB
AYUSO, AR
机构
关键词
D O I
暂无
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
In this paper we present some results from a net-like structure for Hidden Markov Models, applied to speech recognition. Net topology is a Recurrent Neural Network in which each temporary step is identified as a layer. Backpropagation techniques are used to train the RNN-HMM. Two types of training estimations are used: Maximum Likelihood and Competitive Training. Maximum Likelihood estimation algorithm using backpropagation provides the same updating equations as Baum-Welch algorithm used in HMM. Competitive Training is based on the probability of correct labelling the sequences from the Maximum Likelihood measures. Our results have shown that the best procedure is to train first with Maximum Likelihood estimation and then with Competitive Training reestimation.
引用
收藏
页码:361 / 369
页数:9
相关论文
共 50 条
  • [21] Segmental Recurrent Neural Networks for End-to-end Speech Recognition
    Lu, Liang
    Kong, Lingpeng
    Dyer, Chris
    Smith, Noah A.
    Renals, Steve
    [J]. 17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 385 - 389
  • [22] CHARACTER-LEVEL INCREMENTAL SPEECH RECOGNITION WITH RECURRENT NEURAL NETWORKS
    Hwang, Kyuyeon
    Sung, Wonyong
    [J]. 2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 2016, : 5335 - 5339
  • [23] Speech Emotion Recognition using Convolutional Recurrent Neural Networks and Spectrograms
    Qamhan, Mustafa A.
    Meftah, Ali H.
    Selouani, Sid-Ahmed
    Alotaibi, Yousef A.
    Zakariah, Mohammed
    Seddiq, Yasser Mohammad
    [J]. 2020 IEEE CANADIAN CONFERENCE ON ELECTRICAL AND COMPUTER ENGINEERING (CCECE), 2020,
  • [24] Continuous mandarin speech recognition using hierarchical recurrent neural networks
    Liao, YF
    Chen, WY
    Chen, SH
    [J]. 1996 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, CONFERENCE PROCEEDINGS, VOLS 1-6, 1996, : 3370 - 3373
  • [25] Temporal Feedback Convolutional Recurrent Neural Networks for Speech Command Recognition
    Kim, Taejun
    Nam, Juhan
    [J]. PROCEEDINGS OF 2022 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2022, : 437 - 441
  • [26] Towards End-to-End Speech Recognition with Recurrent Neural Networks
    Graves, Alex
    Jaitly, Navdeep
    [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 32 (CYCLE 2), 2014, 32 : 1764 - 1772
  • [27] SPEECH RECOGNITION WITH PREDICTION-ADAPTATION-CORRECTION RECURRENT NEURAL NETWORKS
    Zhang, Yu
    Yu, Dong
    Seltzer, Michael L.
    Droppo, Jasha
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 5004 - 5008
  • [28] Dynamic programming prediction errors of recurrent neural fuzzy networks for speech recognition
    Juang, Chia-Feng
    Lai, Chun-Lung
    Tu, Chiu-Chuan
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2009, 36 (03) : 6368 - 6374
  • [29] Recent advances in conversational speech recognition using conventional and recurrent neural networks
    Saon, G.
    Picheny, M.
    [J]. IBM JOURNAL OF RESEARCH AND DEVELOPMENT, 2017, 61 (4-5)
  • [30] Speech Emotion Recognition: Recurrent Neural Networks Compared to SVM and Linear Regression
    Kerkeni, Leila
    Serrestou, Youssef
    Mbarki, Mohamed
    Mahjoub, Mohamed Ali
    Raoof, Kosai
    Cleder, Catherine
    [J]. ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2017, PT I, 2017, 10613 : 451 - 453