MAML2: meta reinforcement learning via meta-learning for task categories

被引：0

作者：

FU Qiming ^{[1
]}

WANG Zhechao ^{[1
]}

FANG Nengwei ^{[2
]}

XING Bin ^{[2
]}

ZHANG Xiao ^{[3
]}

CHEN Jianping ^{[4
]}

机构：

[1] School of Electronics and Information Engineering, Suzhou University of Science and Technology, Suzhou , China

[2] Chongqing Industrial Big Data Innovation Center Co Ltd, Chongqing , China

[3] School of Medical Informatics, Xuzhou Medical University, Xuzhou , China

[4] School of Architecture and Urban Planning, Suzhou University of Science and Technology, Suzhou ,

来源：

Frontiers of Computer Science | 2023年 / 17卷 / 04期

关键词：

meta-learning; reinforcement learning; few-shot learning; negative adaptation;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Meta-learning has been widely applied to solving few-shot reinforcement learning problems, where we hope to obtain an agent that can learn quickly in a new task. However, these algorithms often ignore some isolated tasks in pursuit of the average performance, which may result in negative adaptation in these isolated tasks, and they usually need sufficient learning in a stationary task distribution. In this paper, our algorithm presents a hierarchical framework of double meta-learning, and the whole framework includes classification, meta-learning, and re-adaptation. Firstly, in the classification process, we classify tasks into several task subsets, considered as some categories of tasks, by learned parameters of each task, which can separate out some isolated tasks thereafter. Secondly, in the meta-learning process, we learn category parameters in all subsets via meta-learning. Simultaneously, based on the gradient of each category parameter in each subset, we use meta-learning again to learn a new meta-parameter related to the whole task set, which can be used as an initial parameter for the new task. Finally, in the re-adaption process, we adapt the parameter of the new task with two steps, by the meta-parameter and the appropriate category parameter successively. Experimentally, we demonstrate our algorithm prevents the agent from negative adaptation without losing the average performance for the whole task set. Additionally, our algorithm presents a more rapid adaptation process within re-adaptation. Moreover, we show the good performance of our algorithm with fewer samples as the agent is exposed to an online meta-learning setting.

引用

共 50 条

[1] MAML2: meta reinforcement learning via meta-learning for task categories
Fu, Qiming
Wang, Zhechao
Fang, Nengwei
Xing, Bin
Zhang, Xiao
Chen, Jianping
FRONTIERS OF COMPUTER SCIENCE, 2023, 17 (04)
[2] Meta-learning in Reinforcement Learning
Schweighofer, N
Doya, K
NEURAL NETWORKS, 2003, 16 (01) : 5 - 9
[3] Multi-Task Reinforcement Meta-Learning in Neural Networks
Shakah, Ghazi
INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2022, 13 (07) : 263 - 269
[4] XB-MAML: Learning Expandable Basis Parameters for Effective Meta-Learning with Wide Task Coverage
Lee, Jae-Jun
Yoon, Sung Whan
INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 238, 2024, 238
[5] Evo-MAML: Meta-Learning with Evolving Gradient
Chen, Jiaxing
Yuan, Weilin
Chen, Shaofei
Hu, Zhenzhen
Li, Peng
ELECTRONICS, 2023, 12 (18)
[6] ROBUST MAML: PRIORITIZATION TASK BUFFER WITH ADAPTIVE LEARNING PROCESS FOR MODEL-AGNOSTIC META-LEARNING
Thanh Nguyen
Tung Luu
Trung Pham
Rakhimkul, Sanzhar
Yoo, Chang D.
2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 3460 - 3464
[7] Improving Generalization in Meta-learning via Task Augmentation
Yao, Huaxiu
Huang, Long-Kai
Zhang, Linjun
Wei, Ying
Tian, Li
Zou, James
Huang, Junzhou
Li, Zhenhui
INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139
[8] Towards Task Sampler Learning for Meta-Learning
Wang, Jingyao
Qiang, Wenwen
Su, Xingzhe
Zheng, Changwen
Sun, Fuchun
Xiong, Hui
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2024, 132 (12) : 5534 - 5564
[9] ST-MAML : A Stochastic-Task based Method for Task-Heterogeneous Meta-Learning
Wang, Zhe
Grigsby, Jake
Sekhon, Arshdeep
Qi, Yanjun
UNCERTAINTY IN ARTIFICIAL INTELLIGENCE, VOL 180, 2022, 180 : 2066 - 2074
[10] TASK2VEC: Task Embedding for Meta-Learning
Achille, Alessandro
Lam, Michael
Tewari, Rahul
Ravichandran, Avinash
Maji, Subhransu
Fowlkes, Charless
Soatto, Stefano
Perona, Pietro
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 6439 - 6448

← 1 2 3 4 5 →