A Federated Meta-Reinforcement Learning Algorithm Based on Gradient Correction

被引:0
|
作者
Qin, Zerui [1 ]
Yue, Sheng [1 ]
机构
[1] Tsinghua Univ, Beijing, Peoples R China
关键词
D O I
10.1145/3674399.3674473
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
To meet the requirement of the decision-making capabilities of IoT devices in dynamic environments, Federated Reinforcement Learning (FRL) has attracted increasing attention in recent years. However, Due to the challenges of heterogeneous environments and sampling limitation in the practical scenarios, it is difficult for clients in different environments to jointly train a personalized policy that can quickly adapt to a new environment. In this paper, we propose a federated meta-reinforcement learning algorithm with gradient correction to extract a meta-policy from heterogeneous environments. Further, we analyze the performance of our proposed algorithm in the experiment section.
引用
收藏
页码:220 / 221
页数:2
相关论文
共 50 条
  • [21] Meta-Reinforcement Learning Based on Self-Supervised Task Representation Learning
    Wang, Mingyang
    Bing, Zhenshan
    Yao, Xiangtong
    Wang, Shuai
    Kai, Huang
    Su, Hang
    Yang, Chenguang
    Knoll, Alois
    THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 8, 2023, : 10157 - 10165
  • [22] Towards Effective Context for Meta-Reinforcement Learning: an Approach based on Contrastive Learning
    Fu, Haotian
    Tang, Hongyao
    Hao, Jianye
    Chen, Chen
    Feng, Xidong
    Li, Dong
    Liu, Wulong
    THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 7457 - 7465
  • [23] Prefrontal cortex as a meta-reinforcement learning system
    Wang, Jane X.
    Kurth-Nelson, Zeb
    Kumaran, Dharshan
    Tirumala, Dhruva
    Soyer, Hubert
    Leibo, Joel Z.
    Hassabis, Demis
    Botvinick, Matthew
    NATURE NEUROSCIENCE, 2018, 21 (06) : 860 - +
  • [24] Context-Based Meta-Reinforcement Learning With Bayesian Nonparametric Models
    Bing, Zhenshan
    Yun, Yuqi
    Huang, Kai
    Knoll, Alois
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2024, 46 (10) : 6948 - 6965
  • [25] Model-Based Meta-Reinforcement Learning for Flight With Suspended Payloads
    Belkhale, Suneel
    Li, Rachel
    Kahn, Gregory
    McAllister, Rowan
    Calandra, Roberto
    Levine, Sergey
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2021, 6 (02): : 1471 - 1478
  • [26] Uncertainty-based Meta-Reinforcement Learning for Robust Radar Tracking
    Ott, Julius
    Servadei, Lorenzo
    Mauro, Gianfranco
    Stadelmayer, Thomas
    Santra, Avik
    Wille, Robert
    2022 21ST IEEE INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS, ICMLA, 2022, : 1476 - 1483
  • [27] Some Considerations on Learning to Explore via Meta-Reinforcement Learning
    Stadie, Bradly C.
    Yang, Ge
    Houthooft, Rein
    Chen, Xi
    Duan, Yan
    Wu, Yuhuai
    Abbeel, Pieter
    Sutskever, Ilya
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018), 2018, 31
  • [28] Learning Locomotion Skills via Model-based Proximal Meta-Reinforcement Learning
    Xiao, Qing
    Cao, Zhengcai
    Zhou, Mengchu
    2019 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS (SMC), 2019, : 1545 - 1550
  • [29] Meta-Reinforcement Learning with Self-Modifying Networks
    Chalvidal, Mathieu
    Serre, Thomas
    VanRullen, Rufin
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
  • [30] Meta-Reinforcement Learning for Robotic Industrial Insertion Tasks
    Schoettler, Gerrit
    Nair, Ashvin
    Ojea, Juan Aparicio
    Levine, Sergey
    Solowjow, Eugen
    2020 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2020, : 9728 - 9735