DEEP RECURRENT REGULARIZATION NEURAL NETWORK FOR SPEECH RECOGNITION

被引：0

作者：

Chien, Jen-Tzung ^{[1
]}

Lu, Tsai-Wei ^{[1
]}

机构：

[1] Natl Chiao Tung Univ, Dept Elect & Comp Engn, Hsinchu 30010, Taiwan

来源：

2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP) | 2015年

关键词：

Recurrent neural network; model regularization; deep learning; acoustic model;

D O I：

暂无

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

This paper presents a deep recurrent regularization neural network (DRRNN) for speech recognition. Our idea is to build a regularization neural network acoustic model by conducting the hybrid Tikhonov and weight-decay regularization which compensates the variations due to the input speech as well as the model parameters in the restricted Boltzmann machine as a pre-training stage for feature learning and structural modeling. In addition, a new backpropagation through time (BPTT) algorithm is developed by extending the truncated minibatch training for recurrent neural network where the minibatch BPTT is not only performed in recurrent layer but also in feedforward layer. The DRRNN acoustic model is accordingly established to capture the temporal correlation in a regularization neural network. Experimental results on the tasks of RM and Aurora4 show the effectiveness and robustness of using DRRNN for speech recognition.

引用

页码：4560 / 4564

页数：5

共 50 条

[1] SPEECH RECOGNITION WITH DEEP RECURRENT NEURAL NETWORKS
Graves, Alex
Mohamed, Abdel-rahman
Hinton, Geoffrey
[J]. 2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 6645 - 6649
[2] Stochastic Recurrent Neural Network for Speech Recognition
Chien, Jen-Tzung
Shen, Chen
[J]. 18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 1313 - 1317
[3] RECURRENT DEEP NEURAL NETWORKS FOR ROBUST SPEECH RECOGNITION
Weng, Chao
Yu, Dong
Watanabe, Shinji
Juang, Biing-Hwang
[J]. 2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
[4] Stimulated Deep Neural Network for Speech Recognition
Wu, Chunyang
Karanasou, Penny
Gales, Mark J. F.
Sim, Khe Chai
[J]. 17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 400 - 404
[5] DEEP CONVOLUTIONAL RECURRENT NEURAL NETWORK WITH ATTENTION MECHANISM FOR ROBUST SPEECH EMOTION RECOGNITION
Huang, Che-Wei
Narayanan, Shrikanth
[J]. 2017 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2017, : 583 - 588
[6] Speech Emotion Recognition Using Deep Convolutional Neural Network and Simple Recurrent Unit
Jiang, Pengxu
Fu, Hongliang
Tao, Huawei
[J]. ENGINEERING LETTERS, 2019, 27 (04) : 901 - 906
[7] Implementation of an autoassociative Recurrent Neural Network for speech recognition
Cocchiglia, A
Paplinski, A
[J]. IEEE TENCON'97 - IEEE REGIONAL 10 ANNUAL CONFERENCE, PROCEEDINGS, VOLS 1 AND 2: SPEECH AND IMAGE TECHNOLOGIES FOR COMPUTING AND TELECOMMUNICATIONS, 1997, : 245 - 248
[8] Time Delay Recurrent Neural Network for Speech Recognition
Liu, Boji
Zhang, Weibin
Xu, Xiangming
Chen, Dongpeng
[J]. 2019 3RD INTERNATIONAL CONFERENCE ON MACHINE VISION AND INFORMATION TECHNOLOGY (CMVIT 2019), 2019, 1229
[9] Discretized Continuous Speech Emotion Recognition with Multi-Task Deep Recurrent Neural Network
Duc Le
Aldeneh, Zakaria
Provost, Emily Mower
[J]. 18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 1108 - 1112
[10] Towards Efficient Recurrent Architectures: A Deep LSTM Neural Network Applied to Speech Enhancement and Recognition
Wang, Jing
Saleem, Nasir
Gunawan, Teddy Surya
[J]. COGNITIVE COMPUTATION, 2024, 16 (03) : 1221 - 1236

← 1 2 3 4 5 →