Improved Generalization in Recurrent Neural Networks Using the Tangent Plane Algorithm

被引:0
|
作者
May, P. [1 ]
Zhou, E. [2 ]
Lee, C. W. [2 ]
机构
[1] K Coll, Brook St, Tonbridge, Kent, England
[2] Univ Bolton, Acad Grp, Appl Engn & Sci, Bolton, England
关键词
real time recurrent learning; tangent plane; generalization; weight elimination; temporal pattern recognition; non-linear process control;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
The tangent plane algorithm for real time recurrent learning (TPA-RTRL) is an effective online training method for fully recurrent neural networks. TPA-RTRL uses the method of approaching tangent planes to accelerate the learning processes. Compared to the original gradient descent real time recurrent learning algorithm (GD-RTRL) it is very fast and avoids problems like local minima of the search space. However, the TPA-RTRL algorithm actively encourages the formation of large weight values that can be harmful to generalization. This paper presents a new TPA-RTRL variant that encourages small weight values to decay to zero by using a weight elimination procedure built into the geometry of the algorithm. Experimental results show that the new algorithm gives good generalization over a range of network sizes whilst retaining the fast convergence speed of the TPA-RTRL algorithm.
引用
收藏
页码:118 / 126
页数:9
相关论文
共 50 条
  • [31] Using Recurrent Neural Networks to Build a Stopping Algorithm for an Adaptive Assessment
    Matayoshi, Jeffrey
    Cosyn, Eric
    Uzun, Hasan
    ARTIFICIAL INTELLIGENCE IN EDUCATION, AIED 2019, PT II, 2019, 11626 : 179 - 184
  • [32] A new boosting algorithm for improved time-series forecasting with recurrent neural networks
    Assaad, Mohammad
    Bone, Romuald
    Cardot, Hubert
    INFORMATION FUSION, 2008, 9 (01) : 41 - 55
  • [33] Breast tumor classification in ultrasound images using neural networks with improved generalization methods
    Silva, S. D. de S.
    Costa, M. G. F.
    Pereira, W. C. de A.
    Costa Filho, C. F. F.
    2015 37TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY (EMBC), 2015, : 6321 - 6325
  • [34] An improved tangent search algorithm
    Pachung, Probhat
    Bansal, Jagdish Chand
    METHODSX, 2022, 9
  • [35] Learning in fully recurrent neural networks by approaching tangent planes to constraint surfaces
    May, P.
    Zhou, E.
    Lee, C. W.
    NEURAL NETWORKS, 2012, 34 : 72 - 79
  • [36] Sparse signal reconstruction via recurrent neural networks with hyperbolic tangent function
    Wen, Hongsong
    He, Xing
    Huang, Tingwen
    NEURAL NETWORKS, 2022, 153 : 1 - 12
  • [37] AN EVOLUTIONARY ALGORITHM THAT CONSTRUCTS RECURRENT NEURAL NETWORKS
    ANGELINE, PJ
    SAUNDERS, GM
    POLLACK, JB
    IEEE TRANSACTIONS ON NEURAL NETWORKS, 1994, 5 (01): : 54 - 65
  • [38] A training algorithm for partial recurrent neural networks
    Al-Faysale, MSM
    Modelling and Simulation 2004, 2004, : 350 - 354
  • [40] Enhancing the generalization ability of neural networks by using Gram-Schmidt orthogonalization algorithm
    Wan, WS
    Hirasawa, K
    Hu, JL
    Murata, J
    IJCNN'01: INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOLS 1-4, PROCEEDINGS, 2001, : 1721 - 1726