Fault-Tolerant Federated Reinforcement Learning with Theoretical Guarantee

被引:0
|
作者
Fan, Flint Xiaofeng [1 ,3 ]
Ma, Yining [2 ]
Dai, Zhongxiang [1 ]
Jing, Wei [4 ]
Tan, Cheston [3 ]
Low, Bryan Kian Hsiang [1 ]
机构
[1] Natl Univ Singapore, Dept Comp Sci, Singapore, Singapore
[2] Natl Univ Singapore, Dept ISEM, Singapore, Singapore
[3] ASTAR, Inst Infocomm Res, Singapore, Singapore
[4] Alibaba DAMO Acad, Hangzhou, Peoples R China
关键词
POLICY-GRADIENT;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The growing literature of Federated Learning (FL) has recently inspired Federated Reinforcement Learning (FRL) to encourage multiple agents to federatively build a better decision-making policy without sharing raw trajectories. Despite its promising applications, existing works on FRL fail to I) provide theoretical analysis on its convergence, and II) account for random system failures and adversarial attacks. Towards this end, we propose the first FRL framework the convergence of which is guaranteed and tolerant to less than half of the participating agents being random system failures or adversarial attackers. We prove that the sample efficiency of the proposed framework is guaranteed to improve with the number of agents and is able to account for such potential failures or attacks. All theoretical results are empirically verified on various RL benchmark tasks.
引用
收藏
页数:15
相关论文
共 50 条
  • [1] Learning from Failures: Secure and Fault-Tolerant Aggregation for Federated Learning
    Mansouri, Mohamad
    Onen, Melek
    Ben Jaballah, Wafa
    [J]. PROCEEDINGS OF THE 38TH ANNUAL COMPUTER SECURITY APPLICATIONS CONFERENCE, ACSAC 2022, 2022, : 146 - 158
  • [2] Reinforcement learning decoders for fault-tolerant quantum computation
    Sweke, Ryan
    Kesselring, Markus S.
    van Nieuwenburg, Evert P. L.
    Eisert, Jens
    [J]. MACHINE LEARNING-SCIENCE AND TECHNOLOGY, 2021, 2 (02):
  • [3] Reinforcement-Learning Based Fault-Tolerant Control
    Zhang, Dapeng
    Lin, Zhiling
    Gao, Zhiwei
    [J]. 2017 IEEE 15TH INTERNATIONAL CONFERENCE ON INDUSTRIAL INFORMATICS (INDIN), 2017, : 671 - 676
  • [4] Synthesis of Fault-Tolerant Reliable Broadcast Algorithms With Reinforcement Learning
    Vaz, Diogo
    Matos, David R.
    Pardal, Miguel L.
    Correia, Miguel
    [J]. IEEE ACCESS, 2023, 11 : 62394 - 62408
  • [5] Reinforcement Learning with Randomized Physical Parameters for Fault-Tolerant Robots
    Okamoto, Wataru
    Kawamoto, Kazuhiko
    [J]. 2020 JOINT 11TH INTERNATIONAL CONFERENCE ON SOFT COMPUTING AND INTELLIGENT SYSTEMS AND 21ST INTERNATIONAL SYMPOSIUM ON ADVANCED INTELLIGENT SYSTEMS (SCIS-ISIS), 2020, : 437 - 440
  • [6] Byzantine Fault-Tolerant Federated Learning Based on Trustworthy Data and Historical Information
    Luo, Xujiang
    Tang, Bin
    [J]. ELECTRONICS, 2024, 13 (08)
  • [7] Fault-Tolerant Control of Degrading Systems with On-Policy Reinforcement Learning
    Ahmed, Ibrahim
    Quinones-Grueiro, Marcos
    Biswas, Gautam
    [J]. IFAC PAPERSONLINE, 2020, 53 (02): : 13733 - 13738
  • [8] Fault-Tolerant Formation Control for Heterogeneous Vehicles Via Reinforcement Learning
    Zhao, Wanbing
    Liu, Hao
    Valavanis, Kimon P.
    Lewis, Frank L.
    [J]. IEEE TRANSACTIONS ON AEROSPACE AND ELECTRONIC SYSTEMS, 2022, 58 (04) : 2796 - 2806
  • [9] Deep reinforcement learning for fault-tolerant workflow scheduling in cloud environment
    Tingting Dong
    Fei Xue
    Hengliang Tang
    Chuangbai Xiao
    [J]. Applied Intelligence, 2023, 53 : 9916 - 9932
  • [10] Deep reinforcement learning for fault-tolerant workflow scheduling in cloud environment
    Dong, Tingting
    Xue, Fei
    Tang, Hengliang
    Xiao, Chuangbai
    [J]. APPLIED INTELLIGENCE, 2023, 53 (09) : 9916 - 9932