Improved Generalization in Recurrent Neural Networks Using the Tangent Plane Algorithm

被引：0

作者：

May, P. ^{[1
]}

Zhou, E. ^{[2
]}

Lee, C. W. ^{[2
]}

机构：

[1] K Coll, Brook St, Tonbridge, Kent, England

[2] Univ Bolton, Acad Grp, Appl Engn & Sci, Bolton, England

来源：

INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS | 2014年 / 5卷 / 03期

关键词：

real time recurrent learning; tangent plane; generalization; weight elimination; temporal pattern recognition; non-linear process control;

D O I：

暂无

中图分类号：

TP301 [理论、方法];

学科分类号：

081202 ;

摘要：

The tangent plane algorithm for real time recurrent learning (TPA-RTRL) is an effective online training method for fully recurrent neural networks. TPA-RTRL uses the method of approaching tangent planes to accelerate the learning processes. Compared to the original gradient descent real time recurrent learning algorithm (GD-RTRL) it is very fast and avoids problems like local minima of the search space. However, the TPA-RTRL algorithm actively encourages the formation of large weight values that can be harmful to generalization. This paper presents a new TPA-RTRL variant that encourages small weight values to decay to zero by using a weight elimination procedure built into the geometry of the algorithm. Experimental results show that the new algorithm gives good generalization over a range of network sizes whilst retaining the fast convergence speed of the TPA-RTRL algorithm.

引用

页码：118 / 126

页数：9

共 50 条

[21] Fast algorithm for recurrent neural networks
Wang, Ju
Liu, Heping
Beijing Keji Daxue Xuebao/Journal of University of Science and Technology Beijing, 2000, 22 (01): : 89 - 92
[22] An incremental parallel tangent learning algorithm for artificial neural networks
Nezami, AR
Bhavsar, VC
Ghorbani, AA
1997 CANADIAN CONFERENCE ON ELECTRICAL AND COMPUTER ENGINEERING, CONFERENCE PROCEEDINGS, VOLS I AND II: ENGINEERING INNOVATION: VOYAGE OF DISCOVERY, 1997, : 301 - 304
[23] An Improved Scene-based Nonuniformity Correction Algorithm for Infrared Focal Plane Arrays Using Neural Networks
隋婧
金伟其
董立泉
王霞
郭宏
Journal of China Ordnance, 2006, (02) : 117 - 122
[24] Using neural networks for generalization problems
Dutta, Soumitra
Shekhar, Shashi
Neural Networks, 1988, 1 (1 SUPPL)
[25] Improved generalization performance of convolutional neural networks with LossDA
Liu, Juncheng
Zhao, Yili
APPLIED INTELLIGENCE, 2023, 53 (11) : 13852 - 13866
[26] Improved generalization performance of convolutional neural networks with LossDA
Juncheng Liu
Yili Zhao
Applied Intelligence, 2023, 53 : 13852 - 13866
[27] Improved Prediction of Frost Depth Penetration Using Recurrent Neural Networks
Slone, Scott Michael
Zody, Zachary
Ibey, Robert
Lein, Wade A.
COLD REGIONS ENGINEERING 2024: SUSTAINABLE AND RESILIENT ENGINEERING SOLUTIONS FOR CHANGING COLD REGIONS, 2024, : 1 - 11
[28] Improved Prediction of Soil Thermal Properties Using Recurrent Neural Networks
Slone, Scott Michael
Zody, Zachary
Ibey, Robert
Lein, Wade A.
INTERNATIONAL CONFERENCE ON TRANSPORTATION AND DEVELOPMENT 2024: TRANSPORTATION SAFETY AND EMERGING TECHNOLOGIES, ICTD 2024, 2024, : 431 - 441
[29] Improved nonlinear predictive control performance using recurrent neural networks
Kuure-Kinsey, Matthew
Bequette, B. Wayne
2008 AMERICAN CONTROL CONFERENCE, VOLS 1-12, 2008, : 4197 - 4202
[30] Can SGD Learn Recurrent Neural Networks with Provable Generalization?
Allen-Zhu, Zeyuan
Li, Yuanzhi
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32

← 1 2 3 4 5 →