Recurrent Neural Networks With Finite Memory Length

被引:7
|
作者
Long, Dingkun [1 ,2 ]
Zhang, Richong [1 ,2 ]
Mao, Yongyi [3 ]
机构
[1] Beihang Univ, BDBC, Beijing 100191, Peoples R China
[2] Beihang Univ, Sch Comp Sci & Engn, SKLSDE Lab, Beijing 100191, Peoples R China
[3] Univ Ottawa, Sch Elect Engn & Comp Sci, Ottawa, ON KN56N2, Canada
来源
IEEE ACCESS | 2019年 / 7卷
基金
中国国家自然科学基金;
关键词
Recurrent neural networks; memory length;
D O I
10.1109/ACCESS.2018.2890297
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The working of recurrent neural networks has not been well understood to date. The construction of such network models, hence, largely relies on heuristics and intuition. This paper formalizes the notion of "memory length" for recurrent networks and consequently discovers a generic family of recurrent networks having maximal memory lengths. Stacking such networks into multiple layers is shown to result in powerful models, including the gated convolutional networks. We show that the structure of such networks potentially enables a more principled design approach in practice and entails no gradient vanishing or exploding during back-propagation. We also present a new example in this family, termed attentive activation recurrent unit (AARU). Experimentally we demonstrate that the performance of this network family, particularly AARU, is superior to the LSTM and GRU networks.
引用
收藏
页码:12511 / 12520
页数:10
相关论文
共 50 条
  • [21] State-Frequency Memory Recurrent Neural Networks
    Hu, Hao
    Qi, Guo-Jun
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 70, 2017, 70
  • [22] Associative memory by recurrent neural networks with delay elements
    Miyoshi, S
    Yanai, HF
    Okada, M
    ICONIP'02: PROCEEDINGS OF THE 9TH INTERNATIONAL CONFERENCE ON NEURAL INFORMATION PROCESSING: COMPUTATIONAL INTELLIGENCE FOR THE E-AGE, 2002, : 70 - 74
  • [23] Associative memory by recurrent neural networks with delay elements
    Miyoshi, S
    Yanai, HF
    Okada, M
    NEURAL NETWORKS, 2004, 17 (01) : 55 - 63
  • [24] Neural Mechanisms of Working Memory Accuracy Revealed by Recurrent Neural Networks
    Xie, Yuanqi
    Liu, Yichen Henry
    Constantinidis, Christos
    Zhou, Xin
    FRONTIERS IN SYSTEMS NEUROSCIENCE, 2022, 16
  • [25] Learning various length dependence by dual recurrent neural networks
    Zhang, Chenpeng
    Li, Shuai
    Ye, Mao
    Zhu, Ce
    Li, Xue
    NEUROCOMPUTING, 2021, 466 : 1 - 15
  • [26] General transient length upper bound for recurrent neural networks
    Ho, AMCL
    DeWilde, P
    FROM NATURAL TO ARTIFICIAL NEURAL COMPUTATION, 1995, 930 : 202 - 208
  • [27] Learning beyond finite memory in recurrent networks of spiking neurons
    Tino, P
    Mills, AJS
    NEURAL COMPUTATION, 2006, 18 (03) : 591 - 613
  • [28] Learning beyond finite memory in recurrent networks of spiking neurons
    Tino, P
    Mills, A
    ADVANCES IN NATURAL COMPUTATION, PT 2, PROCEEDINGS, 2005, 3611 : 666 - 675
  • [29] Efficient finite word length determination for neural networks implementation
    Emir, Damergi
    Abdellatif, Benrabaa
    Ammar, Bouallegue
    INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE FOR MODELLING, CONTROL & AUTOMATION JOINTLY WITH INTERNATIONAL CONFERENCE ON INTELLIGENT AGENTS, WEB TECHNOLOGIES & INTERNET COMMERCE, VOL 2, PROCEEDINGS, 2006, : 31 - +
  • [30] Identification of Finite State Automata With a Class of Recurrent Neural Networks
    Won, Sung Hwan
    Song, Iickho
    Lee, Sun Young
    Park, Cheol Hoon
    IEEE TRANSACTIONS ON NEURAL NETWORKS, 2010, 21 (09): : 1408 - 1421