Learning to learn using gradient descent

被引：0

作者：

Hochreiter, S ^{[1
]}

Younger, AS

Conwell, PR

机构：

[1] Univ Colorado, Dept Comp Sci, Boulder, CO 80309 USA

[2] Westminster Coll, Dept Phys, Salt Lake City, UT USA

来源：

ARTIFICIAL NEURAL NETWORKS-ICANN 2001, PROCEEDINGS | 2001年 / 2130卷

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This paper introduces the application of gradient descent methods to meta-learning. The concept of "meta-learning", i.e. of a system that improves or discovers a learning algorithm, has been of interest in machine learning for decades because of its appealing applications. Previous meta-learning approaches have been based on evolutionary methods and, therefore, have been restricted to small models with few free parameters. We make meta-learning in large systems feasible by using recurrent neural networks with their attendant learning routines as meta-learning systems. Our system derived complex well performing learning algorithms from scratch. In this paper we also show that our approach performs non-stationary time series prediction.

引用

页码：87 / 94

页数：8

共 50 条

[1] Learning to learn by gradient descent by gradient descent
Andrychowicz, Marcin
Denil, Misha
Colmenarejo, Sergio Gomez
Hoffman, Matthew W.
Pfau, David
Schaul, Tom
Shillingford, Brendan
de Freitas, Nando
[J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 29 (NIPS 2016), 2016, 29
[2] Learning to Learn without Gradient Descent by Gradient Descent
Chen, Yutian
Hoffman, Matthew W.
Colmenarejo, Sergio Gomez
Denil, Misha
Lillicrap, Timothy P.
Botvinick, Matt
de Freitas, Nando
[J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 70, 2017, 70
[3] Learning to Learn Gradient Aggregation by Gradient Descent
Ji, Jinlong
Chen, Xuhui
Wang, Qianlong
Yu, Lixing
Li, Pan
[J]. PROCEEDINGS OF THE TWENTY-EIGHTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2019, : 2614 - 2620
[4] Learning-to-Learn Stochastic Gradient Descent with Biased Regularization
Denevi, Giulia
Ciliberto, Carlo
Grazzi, Riccardo
Pontil, Massimiliano
[J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 97, 2019, 97
[5] Learning Functors using Gradient Descent
Gavranovic, Bruno
[J]. ELECTRONIC PROCEEDINGS IN THEORETICAL COMPUTER SCIENCE, 2020, (323): : 230 - 245
[6] Transformers learn to implement preconditioned gradient descent for in-context learning
Ahn, Kwangjun
Cheng, Xiang
Daneshmand, Hadi
Sra, Suvrit
[J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
[7] Learning a Single Neuron with Bias Using Gradient Descent
Vardi, Gal
Yehudai, Gilad
Shamir, Ohad
[J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
[8] Learning Fractals by Gradient Descent
Tu, Cheng-Hao
Chen, Hong-You
Carlyn, David
Chao, Wei-Lun
[J]. THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 2, 2023, : 2456 - 2464
[9] Gradient Descent Learning With Floats
Sun, Tao
Tang, Ke
Li, Dongsheng
[J]. IEEE TRANSACTIONS ON CYBERNETICS, 2022, 52 (03) : 1763 - 1771
[10] LEARNING BY ONLINE GRADIENT DESCENT
BIEHL, M
SCHWARZE, H
[J]. JOURNAL OF PHYSICS A-MATHEMATICAL AND GENERAL, 1995, 28 (03): : 643 - 656

← 1 2 3 4 5 →