SCORE: Simple Contrastive Representation and Reset-Ensemble for offline meta-reinforcement learning

被引:0
|
作者
Yang, Hanjie [1 ]
Lin, Kai [1 ]
Yang, Tao [1 ]
Sun, Guohan [1 ]
机构
[1] Dalian Univ Technol, Dept Comp Sci & Technol, Dalian, Peoples R China
关键词
Offline meta reinforcement learning; Contrastive learning; Reset-Ensemble;
D O I
10.1016/j.knosys.2024.112767
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Offline meta-reinforcement learning (OMRL) aims to train agents to quickly adapt to new tasks using only pre-collected data. However, existing OMRL methods often involve numerous ineffective training iterations and may experience performance collapse in the later stages of training. We identify the root cause-shallow memorization problem, where agents overspecialize in specific solutions for encountered states, hindering their generalization performance. This issue arises due to the loss of plasticity and the premature fitting of neural networks, which restricts the exploration of the agents. To address this challenge, we propose S imple CO ntrastive Representation and R eset-Ensemble for OMRL (SCORE), a novel context-based OMRL approach. SCORE introduces an end-to-end contrastive learning framework without negative samples to pre-train a context encoder, enabling more robust task representations. Subsequently, the context encoder is fine-tuned during meta-training. Furthermore, SCORE employs a Reset-Ensemble mechanism that periodically resets and ensembles partial networks to maintain the agents' continual learning ability and enhance their perception of characteristics across diverse tasks. Extensive experiments demonstrate that our SCORE method effectively avoids premature fitting and exhibits excellent generalization performance.
引用
收藏
页数:10
相关论文
共 50 条
  • [1] Robust Task Representations for Offline Meta-Reinforcement Learning via Contrastive Learning
    Yuan, Haoqi
    Lu, Zongqing
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
  • [2] Generalizable Task Representation Learning for Offline Meta-Reinforcement Learning with Data Limitations
    Zhou, Renzhe
    Gao, Chen-Xiao
    Zhang, Zongzhang
    Yu, Yang
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 15, 2024, : 17132 - 17140
  • [3] Offline Meta-Reinforcement Learning for Industrial Insertion
    Zhao, Tony Z.
    Luo, Jianlan
    Sushkov, Oleg
    Pevceviciute, Rugile
    Heess, Nicolas
    Scholz, Jon
    Schaal, Stefan
    Levine, Sergey
    2022 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, ICRA 2022, 2022, : 6386 - 6393
  • [4] Offline Meta-Reinforcement Learning with Advantage Weighting
    Mitchell, Eric
    Rafailov, Rafael
    Peng, Xue Bin
    Levine, Sergey
    Finn, Chelsea
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139
  • [5] PAC-Bayesian offline Meta-reinforcement learning
    Sun, Zheng
    Jing, Chenheng
    Guo, Shangqi
    An, Lingling
    APPLIED INTELLIGENCE, 2023, 53 (22) : 27128 - 27147
  • [6] PAC-Bayesian offline Meta-reinforcement learning
    Zheng Sun
    Chenheng Jing
    Shangqi Guo
    Lingling An
    Applied Intelligence, 2023, 53 : 27128 - 27147
  • [7] Context Shift Reduction for Offline Meta-Reinforcement Learning
    Gao, Yunkai
    Zhang, Rui
    Guo, Jiaming
    Wu, Fan
    Yi, Qi
    Peng, Shaohui
    Lan, Siming
    Chen, Ruizhi
    Du, Zidong
    Hu, Xing
    Guo, Qi
    Li, Ling
    Chen, Yunji
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [8] Meta-reinforcement learning for the tuning of PI controllers: An offline approach
    McClement, Daniel G.
    Lawrence, Nathan P.
    Backstroem, Johan U.
    Loewen, Philip D.
    Forbes, Michael G.
    Gopaluni, R. Bhushan
    JOURNAL OF PROCESS CONTROL, 2022, 118 : 139 - 152
  • [9] Offline Meta-Reinforcement Learning with Online Self-Supervision
    Pong, Vitchyr H.
    Nair, Ashvin
    Smith, Laura
    Huang, Catherine
    Levine, Sergey
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
  • [10] Towards Effective Context for Meta-Reinforcement Learning: an Approach based on Contrastive Learning
    Fu, Haotian
    Tang, Hongyao
    Hao, Jianye
    Chen, Chen
    Feng, Xidong
    Li, Dong
    Liu, Wulong
    THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 7457 - 7465