Improved Generalization in Recurrent Neural Networks Using the Tangent Plane Algorithm

被引:0
|
作者
May, P. [1 ]
Zhou, E. [2 ]
Lee, C. W. [2 ]
机构
[1] K Coll, Brook St, Tonbridge, Kent, England
[2] Univ Bolton, Acad Grp, Appl Engn & Sci, Bolton, England
关键词
real time recurrent learning; tangent plane; generalization; weight elimination; temporal pattern recognition; non-linear process control;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
The tangent plane algorithm for real time recurrent learning (TPA-RTRL) is an effective online training method for fully recurrent neural networks. TPA-RTRL uses the method of approaching tangent planes to accelerate the learning processes. Compared to the original gradient descent real time recurrent learning algorithm (GD-RTRL) it is very fast and avoids problems like local minima of the search space. However, the TPA-RTRL algorithm actively encourages the formation of large weight values that can be harmful to generalization. This paper presents a new TPA-RTRL variant that encourages small weight values to decay to zero by using a weight elimination procedure built into the geometry of the algorithm. Experimental results show that the new algorithm gives good generalization over a range of network sizes whilst retaining the fast convergence speed of the TPA-RTRL algorithm.
引用
收藏
页码:118 / 126
页数:9
相关论文
共 50 条
  • [21] Fast algorithm for recurrent neural networks
    Wang, Ju
    Liu, Heping
    Beijing Keji Daxue Xuebao/Journal of University of Science and Technology Beijing, 2000, 22 (01): : 89 - 92
  • [22] An incremental parallel tangent learning algorithm for artificial neural networks
    Nezami, AR
    Bhavsar, VC
    Ghorbani, AA
    1997 CANADIAN CONFERENCE ON ELECTRICAL AND COMPUTER ENGINEERING, CONFERENCE PROCEEDINGS, VOLS I AND II: ENGINEERING INNOVATION: VOYAGE OF DISCOVERY, 1997, : 301 - 304
  • [23] An Improved Scene-based Nonuniformity Correction Algorithm for Infrared Focal Plane Arrays Using Neural Networks
    隋婧
    金伟其
    董立泉
    王霞
    郭宏
    Journal of China Ordnance, 2006, (02) : 117 - 122
  • [24] Using neural networks for generalization problems
    Dutta, Soumitra
    Shekhar, Shashi
    Neural Networks, 1988, 1 (1 SUPPL)
  • [25] Improved generalization performance of convolutional neural networks with LossDA
    Liu, Juncheng
    Zhao, Yili
    APPLIED INTELLIGENCE, 2023, 53 (11) : 13852 - 13866
  • [26] Improved generalization performance of convolutional neural networks with LossDA
    Juncheng Liu
    Yili Zhao
    Applied Intelligence, 2023, 53 : 13852 - 13866
  • [27] Improved Prediction of Frost Depth Penetration Using Recurrent Neural Networks
    Slone, Scott Michael
    Zody, Zachary
    Ibey, Robert
    Lein, Wade A.
    COLD REGIONS ENGINEERING 2024: SUSTAINABLE AND RESILIENT ENGINEERING SOLUTIONS FOR CHANGING COLD REGIONS, 2024, : 1 - 11
  • [28] Improved Prediction of Soil Thermal Properties Using Recurrent Neural Networks
    Slone, Scott Michael
    Zody, Zachary
    Ibey, Robert
    Lein, Wade A.
    INTERNATIONAL CONFERENCE ON TRANSPORTATION AND DEVELOPMENT 2024: TRANSPORTATION SAFETY AND EMERGING TECHNOLOGIES, ICTD 2024, 2024, : 431 - 441
  • [29] Improved nonlinear predictive control performance using recurrent neural networks
    Kuure-Kinsey, Matthew
    Bequette, B. Wayne
    2008 AMERICAN CONTROL CONFERENCE, VOLS 1-12, 2008, : 4197 - 4202
  • [30] Can SGD Learn Recurrent Neural Networks with Provable Generalization?
    Allen-Zhu, Zeyuan
    Li, Yuanzhi
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32