Learning to learn using gradient descent

被引:0
|
作者
Hochreiter, S [1 ]
Younger, AS
Conwell, PR
机构
[1] Univ Colorado, Dept Comp Sci, Boulder, CO 80309 USA
[2] Westminster Coll, Dept Phys, Salt Lake City, UT USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper introduces the application of gradient descent methods to meta-learning. The concept of "meta-learning", i.e. of a system that improves or discovers a learning algorithm, has been of interest in machine learning for decades because of its appealing applications. Previous meta-learning approaches have been based on evolutionary methods and, therefore, have been restricted to small models with few free parameters. We make meta-learning in large systems feasible by using recurrent neural networks with their attendant learning routines as meta-learning systems. Our system derived complex well performing learning algorithms from scratch. In this paper we also show that our approach performs non-stationary time series prediction.
引用
收藏
页码:87 / 94
页数:8
相关论文
共 50 条
  • [31] Gradient Descent Using Stochastic Circuits for Efficient Training of Learning Machines
    Liu, Siting
    Jiang, Honglan
    Liu, Leibo
    Han, Jie
    [J]. IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 2018, 37 (11) : 2530 - 2541
  • [32] Learning weights in genetic programs using gradient descent for object recognition
    Zhang, MJ
    Smart, W
    [J]. APPLICATIONS OF EVOLUTIONARY COMPUTING, PROCEEDINGS, 2005, 3449 : 417 - 427
  • [33] Dual Space Gradient Descent for Online Learning
    Trung Le
    Tu Dinh Nguyen
    Vu Nguyen
    Dinh Phung
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 29 (NIPS 2016), 2016, 29
  • [34] Quantum Shadow Gradient Descent for Quantum Learning
    Heidari, Mohsen
    Naved, Mobasshir A.
    Xie, Wenbo
    Grama, Arjun Jacob
    Szpankowski, Wojciech
    [J]. arXiv, 2023,
  • [35] Online learning via congregational gradient descent
    Kim L. Blackmore
    Robert C. Williamson
    Iven M. Y. Mareels
    William A. Sethares
    [J]. Mathematics of Control, Signals and Systems, 1997, 10 : 331 - 363
  • [36] On the momentum term in gradient descent learning algorithms
    Qian, N
    [J]. NEURAL NETWORKS, 1999, 12 (01) : 145 - 151
  • [37] Robust supervised learning with coordinate gradient descent
    Ibrahim Merad
    Stéphane Gaïffas
    [J]. Statistics and Computing, 2023, 33
  • [38] Online learning via congregational gradient descent
    Blackmore, RL
    Williamson, RC
    Mareels, IMY
    Sethares, WA
    [J]. MATHEMATICS OF CONTROL SIGNALS AND SYSTEMS, 1997, 10 (04) : 331 - 363
  • [39] Natural gradient descent for on-line learning
    Rattray, M
    Saad, D
    Amari, S
    [J]. PHYSICAL REVIEW LETTERS, 1998, 81 (24) : 5461 - 5464
  • [40] Limited Gradient Descent: Learning With Noisy Labels
    Sun, Yi
    Tian, Yan
    Xu, Yiping
    Li, Jianxiang
    [J]. IEEE ACCESS, 2019, 7 : 168296 - 168306