Learning deep hierarchical and temporal recurrent neural networks with residual learning

被引:1
|
作者
Tehseen Zia
Assad Abbas
Usman Habib
Muhammad Sajid Khan
机构
[1] COMSATS University Islamabad,Department of Computer Science
[2] National University of Computer and Emerging Sciences (FAST-NUCES),College of Computer Science
[3] Sichuan University,undefined
关键词
Deep learning; Recurrent neural networks; Residual learning; Long-short term memory; Sequence modeling;
D O I
暂无
中图分类号
学科分类号
摘要
Learning both hierarchical and temporal dependencies can be crucial for recurrent neural networks (RNNs) to deeply understand sequences. To this end, a unified RNN framework is required that can ease the learning of both the deep hierarchical and temporal structures by allowing gradients to propagate back from both ends without being vanished. The residual learning (RL) has appeared as an effective and less-costly method to facilitate backward propagation of gradients. The significance of the RL is exclusively shown for learning deep hierarchical representations and temporal dependencies. Nevertheless, there is lack of efforts to unify these finding into a single framework for learning deep RNNs. In this study, we aim to prove that approximating identity mapping is crucial for optimizing both hierarchical and temporal structures. We propose a framework called hierarchical and temporal residual RNNs, to learn RNNs by approximating identity mappings across hierarchical and temporal structures. To validate the proposed method, we explore the efficacy of employing shortcut connections for training deep RNNs structures for sequence learning problems. Experiments are performed on Penn Treebank, Hutter Prize and IAM-OnDB datasets and results demonstrate the utility of the framework in terms of accuracy and computational complexity. We demonstrate that even for large datasets exploiting parameters for increasing network depth can gain computational benefits with reduced size of the RNN "state".
引用
收藏
页码:873 / 882
页数:9
相关论文
共 50 条
  • [1] Learning deep hierarchical and temporal recurrent neural networks with residual learning
    Zia, Tehseen
    Abbas, Assad
    Habib, Usman
    Khan, Muhammad Sajid
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2020, 11 (04) : 873 - 882
  • [2] Deep Residual Learning in Spiking Neural Networks
    Fang, Wei
    Yu, Zhaofei
    Chen, Yanqi
    Huang, Tiejun
    Masquelier, Timothee
    Tian, Yonghong
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [3] Residual Recurrent Neural Networks for Learning Sequential Representations
    Yue, Boxuan
    Fu, Junwei
    Liang, Jun
    INFORMATION, 2018, 9 (03)
  • [4] Temporal pattern learning in noisy recurrent neural networks
    Das, S
    Olurotimi, O
    ISCAS 96: 1996 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS - CIRCUITS AND SYSTEMS CONNECTING THE WORLD, VOL 3, 1996, : 598 - 600
  • [5] Sequential Learning Network With Residual Blocks: Incorporating Temporal Convolutional Information Into Recurrent Neural Networks
    Shan, Dongjing
    Yao, Kun
    Zhang, Xiongwei
    IEEE TRANSACTIONS ON COGNITIVE AND DEVELOPMENTAL SYSTEMS, 2024, 16 (01) : 396 - 401
  • [6] Learning to Learn and Compositionality with Deep Recurrent Neural Networks
    de Freitas, Nando
    KDD'16: PROCEEDINGS OF THE 22ND ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2016, : 3 - 3
  • [7] Learning Contextual Dependence With Convolutional Hierarchical Recurrent Neural Networks
    Zuo, Zhen
    Shuai, Bing
    Wang, Gang
    Liu, Xiao
    Wang, Xingxing
    Wang, Bing
    Chen, Yushi
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2016, 25 (07) : 2983 - 2996
  • [8] Residual Echo State Networks: Residual recurrent neural networks with stable dynamics and fast learning
    Ceni, Andrea
    Gallicchio, Claudio
    NEUROCOMPUTING, 2024, 597
  • [9] Advancing Spiking Neural Networks Toward Deep Residual Learning
    Hu, Yifan
    Deng, Lei
    Wu, Yujie
    Yao, Man
    Li, Guoqi
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2025, 36 (02) : 2353 - 2367
  • [10] Residual learning of deep convolutional neural networks for image denoising
    Shan, Chuanhui
    Guo, Xirong
    Ou, Jun
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2019, 37 (02) : 2809 - 2818