SCORE: Simple Contrastive Representation and Reset-Ensemble for offline meta-reinforcement learning

被引:0
|
作者
Yang, Hanjie [1 ]
Lin, Kai [1 ]
Yang, Tao [1 ]
Sun, Guohan [1 ]
机构
[1] Dalian Univ Technol, Dept Comp Sci & Technol, Dalian, Peoples R China
关键词
Offline meta reinforcement learning; Contrastive learning; Reset-Ensemble;
D O I
10.1016/j.knosys.2024.112767
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Offline meta-reinforcement learning (OMRL) aims to train agents to quickly adapt to new tasks using only pre-collected data. However, existing OMRL methods often involve numerous ineffective training iterations and may experience performance collapse in the later stages of training. We identify the root cause-shallow memorization problem, where agents overspecialize in specific solutions for encountered states, hindering their generalization performance. This issue arises due to the loss of plasticity and the premature fitting of neural networks, which restricts the exploration of the agents. To address this challenge, we propose S imple CO ntrastive Representation and R eset-Ensemble for OMRL (SCORE), a novel context-based OMRL approach. SCORE introduces an end-to-end contrastive learning framework without negative samples to pre-train a context encoder, enabling more robust task representations. Subsequently, the context encoder is fine-tuned during meta-training. Furthermore, SCORE employs a Reset-Ensemble mechanism that periodically resets and ensembles partial networks to maintain the agents' continual learning ability and enhance their perception of characteristics across diverse tasks. Extensive experiments demonstrate that our SCORE method effectively avoids premature fitting and exhibits excellent generalization performance.
引用
收藏
页数:10
相关论文
共 50 条
  • [41] Meta-Reinforcement Learning via Exploratory Task Clustering
    Chu, Zhendong
    Cai, Renqin
    Wang, Hongning
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 10, 2024, : 11633 - 11641
  • [42] Taming MAML: Efficient Unbiased Meta-Reinforcement Learning
    Liu, Hao
    Socher, Richard
    Xiong, Caiming
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 97, 2019, 97
  • [43] A Theoretical Understanding of Gradient Bias in Meta-Reinforcement Learning
    Liu, Bo
    Feng, Xidong
    Ren, Jie
    Mai, Luo
    Zhu, Rui
    Zhang, Haifeng
    Wang, Jun
    Yang, Yaodong
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
  • [44] GlobalLocal Decomposition of Contextual Representations in Meta-Reinforcement Learning
    Ma, Nelson
    Xuan, Junyu
    Zhang, Guangquan
    Lu, Jie
    IEEE TRANSACTIONS ON CYBERNETICS, 2025, 55 (03) : 1277 - 1287
  • [45] Hyperparameter optimization through context-based meta-reinforcement learning with task-aware representation
    Wu, Jia
    Liu, Xiyuan
    Chen, Senpeng
    KNOWLEDGE-BASED SYSTEMS, 2023, 260
  • [46] Contrastive Learning-Based Bayes-Adaptive Meta-Reinforcement Learning for Active Pantograph Control in High-Speed Railways
    Wang, Hui
    Han, Zhiwei
    Wang, Xufan
    Wu, Yanbo
    Liu, Zhigang
    IEEE TRANSACTIONS ON TRANSPORTATION ELECTRIFICATION, 2024, 10 (01): : 2045 - 2056
  • [47] A Federated Meta-Reinforcement Learning Algorithm Based on Gradient Correction
    Qin, Zerui
    Yue, Sheng
    PROCEEDINGS OF THE ACM TURING AWARD CELEBRATION CONFERENCE-CHINA 2024, ACM-TURC 2024, 2024, : 220 - 221
  • [48] Harnessing Meta-Reinforcement Learning for Enhanced Tracking in Geofencing Systems
    Famili, Alireza
    Sun, Shihua
    Atalay, Tolga
    Stavrou, Angelos
    IEEE OPEN JOURNAL OF THE COMMUNICATIONS SOCIETY, 2025, 6 : 944 - 960
  • [49] On First-Order Meta-Reinforcement Learning with Moreau Envelopes
    Toghani, Mohammad Taha
    Perez-Salazar, Sebastian
    Uribe, Cesar A.
    2023 62ND IEEE CONFERENCE ON DECISION AND CONTROL, CDC, 2023, : 4176 - 4181
  • [50] Meta-Reinforcement Learning by Tracking Task Non-stationarity
    Poiani, Riccardo
    Tirinzoni, Andrea
    Restelli, Marcello
    PROCEEDINGS OF THE THIRTIETH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2021, 2021, : 2899 - 2905