A review of research on reinforcement learning algorithms for multi-agents

被引:2
|
作者
Hu, Kai [1 ,2 ]
Li, Mingyang [1 ]
Song, Zhiqiang [3 ]
Xu, Keer [1 ]
Xia, Qingfeng [3 ]
Sun, Ning [3 ]
Zhou, Peng [1 ]
Xia, Min [1 ]
机构
[1] Nanjing Univ Informat Sci & Technol, Sch Automat, Nanjing 210044, Peoples R China
[2] Nanjing Univ Informat Sci & Technol, CICAEET, Nanjing 210044, Peoples R China
[3] Wuxi Univ, Sch Automat, Wuxi 214000, Peoples R China
关键词
Agent; Reinforcement learning; Multi-agent reinforcement learning; Multi-agent systems; TIME NONLINEAR-SYSTEMS; RESOURCE-ALLOCATION; GAMES; LEVEL; SARSA; CHALLENGES; ITERATION;
D O I
10.1016/j.neucom.2024.128068
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In recent years, multi-agent reinforcement learning techniques have been widely used and evolved in the field of artificial intelligence. However, traditional reinforcement learning methods have limitations such as long training time, large sample data requirements, and highly delayed rewards. Therefore, this paper systematically and specifically studies the MARL algorithm. Firstly, this paper uses Citespace software to visually analyze the existing literature on multi-agent reinforcement learning and briefly indicates the research hotspots and key research directions in this field. Secondly, the applications of traditional reinforcement learning algorithms under two task objects, namely single -agent and multi-agent systems, are described in detail. Then, the paper highlights the diverse applications, challenges, and corresponding solutions of MARL algorithmic techniques in the field of MAS. Finally, the paper points out future research directions based on the existing limitations of the algorithm. Through this paper, readers will gain a systematic and in-depth understanding of MARL algorithms and how they can be utilized to better address the various challenges posed by MAS.
引用
收藏
页数:33
相关论文
共 50 条
  • [1] Optimization algorithm using multi-agents and reinforcement learning
    Kobayashi, Y
    Aiyoshi, E
    [J]. CEC2004: PROCEEDINGS OF THE 2004 CONGRESS ON EVOLUTIONARY COMPUTATION, VOLS 1 AND 2, 2004, : 63 - 68
  • [2] Deep Reinforcement Learning Approach for Flocking Control of Multi-agents
    Zhang, Han
    Cheng, Jin
    [J]. 2021 PROCEEDINGS OF THE 40TH CHINESE CONTROL CONFERENCE (CCC), 2021, : 5002 - 5007
  • [3] Combinatorial optimization algorithm for permutation using multi-agents and reinforcement learning
    Kobayashi, Y
    Aiyoshi, E
    [J]. SICE 2003 ANNUAL CONFERENCE, VOLS 1-3, 2003, : 2916 - 2920
  • [4] Ambient Lighting Controller Based on Reinforcement Learning Components of Multi-Agents
    Bielskis, A. A.
    Guseinoviene, E.
    Dzemydiene, D.
    Drungilas, D.
    Gricius, G.
    [J]. ELEKTRONIKA IR ELEKTROTECHNIKA, 2012, 121 (05) : 79 - 84
  • [5] Cooperative Localization for Multi-Agents Based on Reinforcement Learning Compensated Filter
    Wang, Ran
    Xu, Cheng
    Sun, Jing
    Duan, Shihong
    Zhang, Xiaotong
    [J]. IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS, 2024, 42 (10) : 2820 - 2831
  • [6] Toward competitive multi-agents in Polo game based on reinforcement learning
    Zahra Movahedi
    Azam Bastanfard
    [J]. Multimedia Tools and Applications, 2021, 80 : 26773 - 26793
  • [7] Toward competitive multi-agents in Polo game based on reinforcement learning
    Movahedi, Zahra
    Bastanfard, Azam
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (17) : 26773 - 26793
  • [8] Continuous-time path planning for multi-agents with fuzzy reinforcement learning
    Luviano, David
    Yu, Wen
    [J]. JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2017, 33 (01) : 491 - 501
  • [9] Research on intelligent web-learning based on multi-agents
    Feng, Naiqin
    Dong, Yajie
    Zhang, Aili
    Guo, Zhanjie
    [J]. ADVANCED INTELLIGENT COMPUTING THEORIES AND APPLICATIONS: WITH ASPECTS OF THEORETICAL AND METHODOLOGICAL ISSUES, 2007, 4681 : 190 - +
  • [10] Robust Collaborative Learning by Multi-Agents
    Balasingam, B.
    Pattipati, K.
    Levchuck, G.
    Romano, J. C.
    [J]. 2015 IEEE SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE FOR SECURITY AND DEFENSE APPLICATIONS (CISDA), 2015, : 183 - 187