MAML2: meta reinforcement learning via meta-learning for task categories

被引:0
|
作者
FU Qiming [1 ]
WANG Zhechao [1 ]
FANG Nengwei [2 ]
XING Bin [2 ]
ZHANG Xiao [3 ]
CHEN Jianping [4 ]
机构
[1] School of Electronics and Information Engineering, Suzhou University of Science and Technology, Suzhou , China
[2] Chongqing Industrial Big Data Innovation Center Co Ltd, Chongqing , China
[3] School of Medical Informatics, Xuzhou Medical University, Xuzhou , China
[4] School of Architecture and Urban Planning, Suzhou University of Science and Technology, Suzhou ,
关键词
meta-learning; reinforcement learning; few-shot learning; negative adaptation;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Meta-learning has been widely applied to solving few-shot reinforcement learning problems, where we hope to obtain an agent that can learn quickly in a new task. However, these algorithms often ignore some isolated tasks in pursuit of the average performance, which may result in negative adaptation in these isolated tasks, and they usually need sufficient learning in a stationary task distribution. In this paper, our algorithm presents a hierarchical framework of double meta-learning, and the whole framework includes classification, meta-learning, and re-adaptation. Firstly, in the classification process, we classify tasks into several task subsets, considered as some categories of tasks, by learned parameters of each task, which can separate out some isolated tasks thereafter. Secondly, in the meta-learning process, we learn category parameters in all subsets via meta-learning. Simultaneously, based on the gradient of each category parameter in each subset, we use meta-learning again to learn a new meta-parameter related to the whole task set, which can be used as an initial parameter for the new task. Finally, in the re-adaption process, we adapt the parameter of the new task with two steps, by the meta-parameter and the appropriate category parameter successively. Experimentally, we demonstrate our algorithm prevents the agent from negative adaptation without losing the average performance for the whole task set. Additionally, our algorithm presents a more rapid adaptation process within re-adaptation. Moreover, we show the good performance of our algorithm with fewer samples as the agent is exposed to an online meta-learning setting.
引用
收藏
相关论文
共 50 条
  • [1] MAML2: meta reinforcement learning via meta-learning for task categories
    Fu, Qiming
    Wang, Zhechao
    Fang, Nengwei
    Xing, Bin
    Zhang, Xiao
    Chen, Jianping
    FRONTIERS OF COMPUTER SCIENCE, 2023, 17 (04)
  • [2] Meta-learning in Reinforcement Learning
    Schweighofer, N
    Doya, K
    NEURAL NETWORKS, 2003, 16 (01) : 5 - 9
  • [3] Multi-Task Reinforcement Meta-Learning in Neural Networks
    Shakah, Ghazi
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2022, 13 (07) : 263 - 269
  • [4] XB-MAML: Learning Expandable Basis Parameters for Effective Meta-Learning with Wide Task Coverage
    Lee, Jae-Jun
    Yoon, Sung Whan
    INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 238, 2024, 238
  • [5] Evo-MAML: Meta-Learning with Evolving Gradient
    Chen, Jiaxing
    Yuan, Weilin
    Chen, Shaofei
    Hu, Zhenzhen
    Li, Peng
    ELECTRONICS, 2023, 12 (18)
  • [6] ROBUST MAML: PRIORITIZATION TASK BUFFER WITH ADAPTIVE LEARNING PROCESS FOR MODEL-AGNOSTIC META-LEARNING
    Thanh Nguyen
    Tung Luu
    Trung Pham
    Rakhimkul, Sanzhar
    Yoo, Chang D.
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 3460 - 3464
  • [7] Improving Generalization in Meta-learning via Task Augmentation
    Yao, Huaxiu
    Huang, Long-Kai
    Zhang, Linjun
    Wei, Ying
    Tian, Li
    Zou, James
    Huang, Junzhou
    Li, Zhenhui
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139
  • [8] Towards Task Sampler Learning for Meta-Learning
    Wang, Jingyao
    Qiang, Wenwen
    Su, Xingzhe
    Zheng, Changwen
    Sun, Fuchun
    Xiong, Hui
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2024, 132 (12) : 5534 - 5564
  • [9] ST-MAML : A Stochastic-Task based Method for Task-Heterogeneous Meta-Learning
    Wang, Zhe
    Grigsby, Jake
    Sekhon, Arshdeep
    Qi, Yanjun
    UNCERTAINTY IN ARTIFICIAL INTELLIGENCE, VOL 180, 2022, 180 : 2066 - 2074
  • [10] TASK2VEC: Task Embedding for Meta-Learning
    Achille, Alessandro
    Lam, Michael
    Tewari, Rahul
    Ravichandran, Avinash
    Maji, Subhransu
    Fowlkes, Charless
    Soatto, Stefano
    Perona, Pietro
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 6439 - 6448