Recurrent Neural Networks With Finite Memory Length

被引:7
|
作者
Long, Dingkun [1 ,2 ]
Zhang, Richong [1 ,2 ]
Mao, Yongyi [3 ]
机构
[1] Beihang Univ, BDBC, Beijing 100191, Peoples R China
[2] Beihang Univ, Sch Comp Sci & Engn, SKLSDE Lab, Beijing 100191, Peoples R China
[3] Univ Ottawa, Sch Elect Engn & Comp Sci, Ottawa, ON KN56N2, Canada
来源
IEEE ACCESS | 2019年 / 7卷
基金
中国国家自然科学基金;
关键词
Recurrent neural networks; memory length;
D O I
10.1109/ACCESS.2018.2890297
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The working of recurrent neural networks has not been well understood to date. The construction of such network models, hence, largely relies on heuristics and intuition. This paper formalizes the notion of "memory length" for recurrent networks and consequently discovers a generic family of recurrent networks having maximal memory lengths. Stacking such networks into multiple layers is shown to result in powerful models, including the gated convolutional networks. We show that the structure of such networks potentially enables a more principled design approach in practice and entails no gradient vanishing or exploding during back-propagation. We also present a new example in this family, termed attentive activation recurrent unit (AARU). Experimentally we demonstrate that the performance of this network family, particularly AARU, is superior to the LSTM and GRU networks.
引用
收藏
页码:12511 / 12520
页数:10
相关论文
共 50 条
  • [31] Representation and identification of finite state automata by recurrent neural networks
    Kuroe, Y
    NEURAL INFORMATION PROCESSING, 2004, 3316 : 261 - 268
  • [32] Learning Finite State Models from Recurrent Neural Networks
    Muskardin, Edi
    Aichernig, Bernhard K.
    Pill, Ingo
    Tappler, Martin
    INTEGRATED FORMAL METHODS, IFM 2022, 2022, 13274 : 229 - 248
  • [33] Finite-size effects in separable recurrent neural networks
    Castellanos, A
    Coolen, ACC
    Viana, L
    JOURNAL OF PHYSICS A-MATHEMATICAL AND GENERAL, 1998, 31 (31): : 6615 - 6634
  • [34] Excitable networks for finite state computation with continuous time recurrent neural networks
    Ashwin, Peter
    Postlethwaite, Claire
    BIOLOGICAL CYBERNETICS, 2021, 115 (05) : 519 - 538
  • [35] Excitable networks for finite state computation with continuous time recurrent neural networks
    Peter Ashwin
    Claire Postlethwaite
    Biological Cybernetics, 2021, 115 : 519 - 538
  • [36] FARM: A Flexible Accelerator for Recurrent and Memory Augmented Neural Networks
    Nagadastagiri Challapalle
    Sahithi Rampalli
    Nicholas Jao
    Akshaykrishna Ramanathan
    John Sampson
    Vijaykrishnan Narayanan
    Journal of Signal Processing Systems, 2020, 92 : 1247 - 1261
  • [37] A Theory of Sequence Indexing and Working Memory in Recurrent Neural Networks
    Frady, E. Paxon
    Kleyko, Denis
    Sommer, Friedrich T.
    NEURAL COMPUTATION, 2018, 30 (06) : 1449 - 1513
  • [38] Bio-inspired memory generation by recurrent neural networks
    Bedia, Manuel G.
    Corchado, Juan M.
    Castillo, Luis F.
    COMPUTATIONAL AND AMBIENT INTELLIGENCE, 2007, 4507 : 55 - +
  • [39] Gating Recurrent Enhanced Memory Neural Networks on Language Identification
    Geng, Wang
    Zhao, Yuanyan
    Wang, Wenfu
    Cai, Xinyuan
    Xu, Bo
    17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 3280 - 3284
  • [40] LightRNN: Memory and Computation-Efficient Recurrent Neural Networks
    Li, Xiang
    Qin, Tao
    Yang, Jian
    Liu, Tie-Yan
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 29 (NIPS 2016), 2016, 29