Online Meta-Learning

被引:0
|
作者
Finn, Chelsea [1 ]
Rajeswaran, Aravind [2 ]
Kakade, Sham [2 ]
Levine, Sergey [1 ]
机构
[1] Univ Calif Berkeley, Berkeley, CA 94720 USA
[2] Univ Washington, Seattle, WA 98195 USA
基金
美国国家科学基金会;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A central capability of intelligent systems is the ability to continuously build upon previous experiences to speed up and enhance learning of new tasks. Two distinct research paradigms have studied this question. Meta-learning views this problem as learning a prior over model parameters that is amenable for fast adaptation on a new task, but typically assumes the tasks are available together as a batch. In contrast, online (regret based) learning considers a setting where tasks are revealed one after the other, but conventionally trains a single model without task-specific adaptation. This work introduces an online meta-learning setting, which merges ideas from both paradigms to better capture the spirit and practice of continual lifelong learning. We propose the follow the meta leader (MIMI) algorithm which extends the MAML, algorithm to this setting. Theoretically, this work provides an O(log T) regret guarantee with one additional higher order smoothness assumption (in comparison to the standard online setting). Our experimental evaluation on three different largescale problems suggest that the proposed algorithm significantly outperforms alternatives based on traditional online learning approaches.
引用
收藏
页数:11
相关论文
共 50 条
  • [1] Online Structured Meta-learning
    Yao, Huaxiu
    Zhou, Yingbo
    Mahdavi, Mehrdad
    Li, Zhenhui
    Socher, Richard
    Xiong, Caiming
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
  • [2] Online-Within-Online Meta-Learning
    Denevi, Giulia
    Stamos, Dimitris
    Ciliberto, Carlo
    Pontil, Massimiliano
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
  • [3] Online meta-learning for POI recommendation
    Lv, Yao
    Sang, Yu
    Tai, Chong
    Cheng, Wanjun
    Shang, Jedi S.
    Qu, Jianfeng
    Chu, Xiaomin
    Zhang, Ruoqian
    [J]. GEOINFORMATICA, 2023, 27 (01) : 61 - 76
  • [4] Online meta-learning for POI recommendation
    Yao Lv
    Yu Sang
    Chong Tai
    Wanjun Cheng
    Jedi S. Shang
    Jianfeng Qu
    Xiaomin Chu
    Ruoqian Zhang
    [J]. GeoInformatica, 2023, 27 : 61 - 76
  • [5] Online Optimization Method of Learning Process for Meta-Learning
    Xu, Zhixiong
    Zhang, Wei
    Li, Ailin
    Zhao, Feifei
    Jing, Yuanyuan
    Wan, Zheng
    Cao, Lei
    Chen, Xiliang
    [J]. COMPUTER JOURNAL, 2023, 67 (05): : 1645 - 1651
  • [6] Online Optimization Method of Learning Process for Meta-Learning
    Xu, Zhixiong
    Zhang, Wei
    Li, Ailin
    Zhao, Feifei
    Jing, Yuanyuan
    Wan, Zheng
    Cao, Lei
    Chen, Xiliang
    [J]. Computer Journal, 1600, 67 (05): : 1645 - 1651
  • [7] Online Meta-Learning by Parallel Algorithm Competition
    Elfwing, Stefan
    Uchibe, Eiji
    Doya, Kenji
    [J]. GECCO'18: PROCEEDINGS OF THE 2018 GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE, 2018, : 426 - 433
  • [8] Meta-Learning for Online Update of Recommender Systems
    Kim, Minseok
    Song, Hwanjun
    Shin, Yooju
    Park, Dongmin
    Shin, Kijung
    Lee, Jae-Gil
    [J]. THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / THE TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 4065 - 4074
  • [9] Fairness-Aware Online Meta-learning
    Zhao, Chen
    Chen, Feng
    Thuraisingham, Bhavani
    [J]. KDD '21: PROCEEDINGS OF THE 27TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2021, : 2294 - 2304
  • [10] Reconciling meta-learning and continual learning with online mixtures of tasks
    Jerfel, Ghassen
    Grant, Erin
    Griffiths, Thomas L.
    Heller, Katherine
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32