Deep Long Short-Term Memory Networks for Speech Recognition

被引:0
|
作者
Chien, Jen-Tzung [1 ]
Misbullah, Alim [1 ]
机构
[1] Natl Chiao Tung Univ, Dept Elect & Comp Engn, Hsinchu, Taiwan
关键词
speech recognition; acoustic modeling; hybrid neural network; long short-term memory;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Speech recognition has been significantly improved by applying acoustic models based on deep neural network which could be realized as the feedforward NN (FNN) or the recurrent NN (RNN). In general, FNN is feasible to project the observations onto a deep invariant feature space while RNN is beneficial to capture the temporal information in a sequential data for speech recognition. RNN based on long short-term memory (LSTM) is capable of storing inputs over a long time period and thus exploiting a self-learned mechanism for long-range temporal context. Considering the complimentary FNN and RNN in their modeling capabilities, this paper presents a deep model which is constructed by stacking LSTM and FNN. Through the cascade of LSTM cells and fully-connected feedforward units, we explore the temporal patterns and summarize the long history of previous inputs in a deep learning machine. The experiments on 3rd CHiME challenge and Aurora-4 show that the stacks of hybrid model with FNN post-processor outperform stand-alone FNN and LSTM and the other hybrid models for robust speech recognition.
引用
收藏
页数:5
相关论文
共 50 条
  • [21] Detecting Overlapping Speech with Long Short-Term Memory Recurrent Neural Networks
    Geiger, Juergen T.
    Eyben, Florian
    Schuller, Bjoern
    Rigoll, Gerhard
    [J]. 14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 1667 - 1671
  • [22] Robust Speech Recognition using Long Short-Term Memory Recurrent Neural Networks for Hybrid Acoustic Modelling
    Geiger, Juergen T.
    Zhang, Zixing
    Weninger, Felix
    Schuller, Bjoern
    Rigoll, Gerhard
    [J]. 15TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2014), VOLS 1-4, 2014, : 631 - 635
  • [23] Deep neural learning techniques with long short-term memory for gesture recognition
    Deepak Kumar Jain
    Aniket Mahanti
    Pourya Shamsolmoali
    Ramachandran Manikandan
    [J]. Neural Computing and Applications, 2020, 32 : 16073 - 16089
  • [24] ROBOT TASK RECOGNITION USING DEEP CONVOLUTIONAL LONG SHORT-TERM MEMORY
    Midhun, M. S.
    Kurian, James
    [J]. MECHATRONIC SYSTEMS AND CONTROL, 2023, 51 (02): : 106 - 113
  • [25] Deep neural learning techniques with long short-term memory for gesture recognition
    Jain, Deepak Kumar
    Mahanti, Aniket
    Shamsolmoali, Pourya
    Manikandan, Ramachandran
    [J]. NEURAL COMPUTING & APPLICATIONS, 2020, 32 (20): : 16073 - 16089
  • [26] Applying Deep Bidirectional Long Short-Term Memory to Mandarin Tone Recognition
    Yang, Longfei
    Xie, Yanlu
    Zhang, Jinsong
    [J]. PROCEEDINGS OF 2018 14TH IEEE INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING (ICSP), 2018, : 1124 - 1127
  • [27] Recognition of Spontaneous Conversational Speech using Long Short-Term Memory Phoneme Predictions
    Woellmer, Martin
    Eyben, Florian
    Schuller, Bjoern
    Rigoll, Gerhard
    [J]. 11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 3 AND 4, 2010, : 1946 - 1949
  • [28] Speech Dereverberation Using Long Short-Term Memory
    Mimura, Masato
    Sakai, Shinsuke
    Kawahara, Tatsuya
    [J]. 16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 2435 - 2439
  • [29] A Speech Recognition Method Using Long Short-Term Memory Network in Low Resources
    Shu, Fan
    Qu, Dan
    Zhang, Wenlin
    Zhou, Lili
    Guo, Wu
    [J]. Hsi-An Chiao Tung Ta Hsueh/Journal of Xi'an Jiaotong University, 2017, 51 (10): : 120 - 127
  • [30] Long Short-Term Memory Based Language Model for Indonesian Spontaneous Speech Recognition
    Putri, Fanda Yuliana
    Lestari, Dessi Puji
    Widyantoro, Dwi Hendratmo
    [J]. 2018 INTERNATIONAL CONFERENCE ON COMPUTER, CONTROL, INFORMATICS AND ITS APPLICATIONS (IC3INA), 2018, : 44 - 48