Learning deep hierarchical and temporal recurrent neural networks with residual learning

被引:1
|
作者
Tehseen Zia
Assad Abbas
Usman Habib
Muhammad Sajid Khan
机构
[1] COMSATS University Islamabad,Department of Computer Science
[2] National University of Computer and Emerging Sciences (FAST-NUCES),College of Computer Science
[3] Sichuan University,undefined
关键词
Deep learning; Recurrent neural networks; Residual learning; Long-short term memory; Sequence modeling;
D O I
暂无
中图分类号
学科分类号
摘要
Learning both hierarchical and temporal dependencies can be crucial for recurrent neural networks (RNNs) to deeply understand sequences. To this end, a unified RNN framework is required that can ease the learning of both the deep hierarchical and temporal structures by allowing gradients to propagate back from both ends without being vanished. The residual learning (RL) has appeared as an effective and less-costly method to facilitate backward propagation of gradients. The significance of the RL is exclusively shown for learning deep hierarchical representations and temporal dependencies. Nevertheless, there is lack of efforts to unify these finding into a single framework for learning deep RNNs. In this study, we aim to prove that approximating identity mapping is crucial for optimizing both hierarchical and temporal structures. We propose a framework called hierarchical and temporal residual RNNs, to learn RNNs by approximating identity mappings across hierarchical and temporal structures. To validate the proposed method, we explore the efficacy of employing shortcut connections for training deep RNNs structures for sequence learning problems. Experiments are performed on Penn Treebank, Hutter Prize and IAM-OnDB datasets and results demonstrate the utility of the framework in terms of accuracy and computational complexity. We demonstrate that even for large datasets exploiting parameters for increasing network depth can gain computational benefits with reduced size of the RNN "state".
引用
收藏
页码:873 / 882
页数:9
相关论文
共 50 条
  • [21] Hierarchical deep-learning neural networks: finite elements and beyond
    Lei Zhang
    Lin Cheng
    Hengyang Li
    Jiaying Gao
    Cheng Yu
    Reno Domel
    Yang Yang
    Shaoqiang Tang
    Wing Kam Liu
    Computational Mechanics, 2021, 67 : 207 - 230
  • [22] Hierarchical deep-learning neural networks: finite elements and beyond
    Zhang, Lei
    Cheng, Lin
    Li, Hengyang
    Gao, Jiaying
    Yu, Cheng
    Domel, Reno
    Yang, Yang
    Tang, Shaoqiang
    Liu, Wing Kam
    COMPUTATIONAL MECHANICS, 2021, 67 (01) : 207 - 230
  • [23] MODULAR HIERARCHICAL FEATURE LEARNING WITH DEEP NEURAL NETWORKS FOR FACE VERIFICATION
    Chen, Xue
    Xiao, Baihua
    Wang, Chunheng
    Cai, Xinyuan
    Lv, Zhijian
    Shi, Yanqin
    2013 20TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP 2013), 2013, : 3690 - 3694
  • [24] Development of residual learning in deep neural networks for computer vision: A survey
    Xu, Guoping
    Wang, Xiaxia
    Wu, Xinglong
    Leng, Xuesong
    Xu, Yongchao
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2025, 142
  • [25] High Learning Hierarchical Neural Networks
    Bobrowski, Leon
    COMPUTATIONAL COLLECTIVE INTELLIGENCE, PT II, ICCCI 2024, 2024, 14811 : 295 - 304
  • [26] Supervised Brain Network Learning Based on Deep Recurrent Neural Networks
    Zhao, Shijie
    Cui, Yan
    Huang, Linwei
    Xie, Li
    Chen, Yaowu
    Han, Junwei
    Guo, Lei
    Zhang, Shu
    Liu, Tianming
    Lv, Jinglei
    IEEE ACCESS, 2020, 8 (08): : 69967 - 69978
  • [27] A Deep Learning Approach for Intrusion Detection Using Recurrent Neural Networks
    Yin, Chuanlong
    Zhu, Yuefei
    Fei, Jinlong
    He, Xinzheng
    IEEE ACCESS, 2017, 5 : 21954 - 21961
  • [28] Hierarchical learning recurrent neural networks for 3D motion synthesis
    Dongsheng Zhou
    Chongyang Guo
    Rui Liu
    Chao Che
    Deyun Yang
    Qiang Zhang
    Xiaopeng Wei
    International Journal of Machine Learning and Cybernetics, 2021, 12 : 2255 - 2267
  • [29] Deep Diabetologist: Learning to Prescribe Hypoglycemic Medications with Recurrent Neural Networks
    Mei, Jing
    Zhao, Shiwan
    Jin, Feng
    Zhang, Lingxiao
    Liu, Haifeng
    Li, Xiang
    Xie, Guotong
    Li, Xuejun
    Xu, Meilin
    MEDINFO 2017: PRECISION HEALTHCARE THROUGH INFORMATICS, 2017, 245 : 1277 - 1277
  • [30] Hierarchical learning recurrent neural networks for 3D motion synthesis
    Zhou, Dongsheng
    Guo, Chongyang
    Liu, Rui
    Che, Chao
    Yang, Deyun
    Zhang, Qiang
    Wei, Xiaopeng
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2021, 12 (08) : 2255 - 2267