Knowledge Reuse of Multi-Agent Reinforcement Learning in Cooperative Tasks

被引：2

作者：

Shi, Daming ^{[1
]}

Tong, Junbo ^{[1
]}

Liu, Yi ^{[1
]}

Fan, Wenhui ^{[1
]}

机构：

[1] Tsinghua Univ, Dept Automat, Beijing 100084, Peoples R China

来源：

ENTROPY | 2022年 / 24卷 / 04期

关键词：

multi-agent; reinforcement learning; cooperative task; adding teammate; knowledge sharing; knowledge transferring;

D O I：

10.3390/e24040470

中图分类号：

O4 [物理学];

学科分类号：

0702 ;

摘要：

With the development and appliance of multi-agent systems, multi-agent cooperation is becoming an important problem in artificial intelligence. Multi-agent reinforcement learning (MARL) is one of the most effective methods for solving multi-agent cooperative tasks. However, the huge sample complexity of traditional reinforcement learning methods results in two kinds of training waste in MARL for cooperative tasks: all homogeneous agents are trained independently and repetitively, and multi-agent systems need training from scratch when adding a new teammate. To tackle these two problems, we propose the knowledge reuse methods of MARL. On the one hand, this paper proposes sharing experience and policy within agents to mitigate training waste. On the other hand, this paper proposes reusing the policies learned by original teams to avoid knowledge waste when adding a new agent. Experimentally, the Pursuit task demonstrates how sharing experience and policy can accelerate the training speed and enhance the performance simultaneously. Additionally, transferring the learned policies from the N-agent enables the (N+1)-agent team to immediately perform cooperative tasks successfully, and only a minor training resource can allow the multi-agents to reach optimal performance identical to that from scratch.

引用

页数：15

共 50 条

[31] Centralized reinforcement learning for multi-agent cooperative environments
Chengxuan Lu
Qihao Bao
Shaojie Xia
Chongxiao Qu
[J]. Evolutionary Intelligence, 2024, 17 : 267 - 273
[32] Centralized reinforcement learning for multi-agent cooperative environments
Lu, Chengxuan
Bao, Qihao
Xia, Shaojie
Qu, Chongxiao
[J]. EVOLUTIONARY INTELLIGENCE, 2024, 17 (01) : 267 - 273
[33] Cooperative multi-agent game based on reinforcement learning
Liu, Hongbo
[J]. HIGH-CONFIDENCE COMPUTING, 2024, 4 (01):
[34] Reinforcement learning of coordination in cooperative multi-agent systems
Kapetanakis, S
Kudenko, D
[J]. EIGHTEENTH NATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE (AAAI-02)/FOURTEENTH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE (IAAI-02), PROCEEDINGS, 2002, : 326 - 331
[35] Training Cooperative Agents for Multi-Agent Reinforcement Learning
Bhalla, Sushrut
Subramanian, Sriram G.
Crowley, Mark
[J]. AAMAS '19: PROCEEDINGS OF THE 18TH INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS AND MULTIAGENT SYSTEMS, 2019, : 1826 - 1828
[36] Cooperative Exploration for Multi-Agent Deep Reinforcement Learning
Liu, Iou-Jen
Jain, Unnat
Yeh, Raymond A.
Schwing, Alexander G.
[J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139
[37] Qauxi: Cooperative multi-agent reinforcement learning with knowledge transferred from auxiliary task
Liang, Wenqian
Wang, Ji
Bao, Weidong
Zhu, Xiaomin
Wu, Guanlin
Zhang, Dayu
Niu, Liyuan
[J]. NEUROCOMPUTING, 2022, 504 : 163 - 173
[38] Towards Interpretable Policies in Multi-agent Reinforcement Learning Tasks
Crespi, Marco
Custode, Leonardo Lucio
Iacca, Giovanni
[J]. BIOINSPIRED OPTIMIZATION METHODS AND THEIR APPLICATIONS, 2022, 13627 : 262 - 276
[39] Learning Communication with Limited Range in Multi-agent Cooperative Tasks
Ning, Chengyu
Lu, Guoming
[J]. ARTIFICIAL INTELLIGENCE AND ROBOTICS, ISAIR 2023, 2024, 1998 : 433 - 442
[40] Learning Distinct Strategies for Heterogeneous Cooperative Multi-agent Reinforcement Learning
Wan, Kejia
Xu, Xinhai
Li, Yuan
[J]. ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2021, PT IV, 2021, 12894 : 544 - 555

← 1 2 3 4 5 →