Communication-Efficient and Resilient Distributed Q-Learning

被引：2

作者：

Xie, Yijing ^{[1
]}

Mou, Shaoshuai ^{[2
]}

Sundaram, Shreyas ^{[3
]}

机构：

[1] Univ Texas Arlington, Dept Elect Engn, Arlington, TX 76019 USA

[2] Purdue Univ, Sch Aeronaut & Astronaut, W Lafayette, IN 47907 USA

[3] Purdue Univ, Elmore Family Sch Elect & Comp Engn, W Lafayette, IN 47907 USA

来源：

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS | 2024年 / 35卷 / 03期

基金：

美国国家科学基金会;

关键词：

Event-triggered communication; multiagent systems; reinforcement learning; resilience; CONSENSUS;

D O I：

10.1109/TNNLS.2023.3292036

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This article investigates the problem of communication-efficient and resilient multiagent reinforcement learning (MARL). Specifically, we consider a setting where a set of agents are interconnected over a given network, and can only exchange information with their neighbors. Each agent observes a common Markov Decision Process and has a local cost which is a function of the current system state and the applied control action. The goal of MARL is for all agents to learn a policy that optimizes the infinite horizon discounted average of all their costs. Within this general setting, we consider two extensions to existing MARL algorithms. First, we provide an event-triggered learning rule where agents only exchange information with their neighbors if a certain triggering condition is satisfied. We show that this enables learning while reducing the amount of communication. Next, we consider the scenario where some of the agents can be adversarial (as captured by the Byzantine attack model), and arbitrarily deviate from the prescribed learning algorithm. We establish a fundamental trade-off between optimality and resilience when Byzantine agents are present. We then create a resilient algorithm and show almost sure convergence of all reliable agents' value functions to the neighborhood of the optimal value function of all reliable agents, under certain conditions on the network topology. When the optimal Q-values are sufficiently separated for different actions, we show that all reliable agents can learn the optimal policy under our algorithm.

引用

页码：3351 / 3364

页数：14

共 50 条

[1] Communication-Efficient Multi-Robot Exploration Using Coverage-Biased Distributed Q-Learning
Latif, Ehsan
Parasuraman, Ramviyas
[J]. IEEE ROBOTICS AND AUTOMATION LETTERS, 2024, 9 (03) : 2622 - 2629
[2] Communication-Efficient Distributed Learning: An Overview
Cao, Xuanyu
Basar, Tamer
Diggavi, Suhas
Eldar, Yonina C.
Letaief, Khaled B.
Poor, H. Vincent
Zhang, Junshan
[J]. IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS, 2023, 41 (04) : 851 - 873
[3] More communication-efficient distributed sparse learning
Zhou, Xingcai
Yang, Guang
[J]. INFORMATION SCIENCES, 2024, 668
[4] More communication-efficient distributed sparse learning
Zhou, Xingcai
Yang, Guang
[J]. Information Sciences, 2024, 668
[5] Communication-Efficient Distributed Learning of Discrete Probability Distributions
Diakonikolas, Ilias
Grigorescu, Elena
Li, Jerry
Natarajan, Abhiram
Onak, Krzysztof
Schmidt, Ludwig
[J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 30 (NIPS 2017), 2017, 30
[6] Local Stochastic ADMM for Communication-Efficient Distributed Learning
ben Issaid, Chaouki
Elgabli, Anis
Bennis, Mehdi
[J]. 2022 IEEE WIRELESS COMMUNICATIONS AND NETWORKING CONFERENCE (WCNC), 2022, : 1880 - 1885
[7] Communication-Efficient Distributed Cooperative Learning With Compressed Beliefs
Toghani, Mohammad Taha
Uribe, Cesar A.
[J]. IEEE TRANSACTIONS ON CONTROL OF NETWORK SYSTEMS, 2022, 9 (03): : 1215 - 1226
[8] Ordered Gradient Approach for Communication-Efficient Distributed Learning
Chen, Yicheng
Sadler, Brian M.
Blum, Rick S.
[J]. PROCEEDINGS OF THE 21ST IEEE INTERNATIONAL WORKSHOP ON SIGNAL PROCESSING ADVANCES IN WIRELESS COMMUNICATIONS (IEEE SPAWC2020), 2020,
[9] Communication-Efficient and Privacy-Aware Distributed Learning
Gogineni, Vinay Chakravarthi
Moradi, Ashkan
Venkategowda, Naveen K. D.
Werner, Stefan
[J]. IEEE TRANSACTIONS ON SIGNAL AND INFORMATION PROCESSING OVER NETWORKS, 2023, 9 : 705 - 720
[10] Communication-efficient Distributed Learning for Large Batch Optimization
Liu, Rui
Mozafari, Barzan
[J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,

← 1 2 3 4 5 →