Cooperative learning with joint state value approximation for multi-agent systems

被引：4

作者：

Chen X. ^{[1
]}

Chen G. ^{[1
]}

Cao W. ^{[1
]}

Wu M. ^{[1
]}

机构：

[1] School of Information Science and Engineering, Central South University, Changsha Hunan

来源：

Journal of Control Theory and Applications | 2013年 / 11卷 / 2期

基金：

中国博士后科学基金; 高等学校博士学科点专项科研基金; 中国国家自然科学基金;

关键词：

Cooperative system; Curse of dimensionality; Decomposition; Multi-agent system; Q-learning;

D O I：

10.1007/s11768-013-1141-z

中图分类号：

学科分类号：

摘要：

This paper relieves the 'curse of dimensionality' problem, which becomes intractable when scaling reinforcement learning to multi-agent systems. This problem is aggravated exponentially as the number of agents increases, resulting in large memory requirement and slowness in learning speed. For cooperative systems which widely exist in multi-agent systems, this paper proposes a new multi-agent Q-learning algorithm based on decomposing the joint state and joint action learning into two learning processes, which are learning individual action and the maximum value of the joint state approximately. The latter process considers others' actions to insure that the joint action is optimal and supports the updating of the former one. The simulation results illustrate that the proposed algorithm can learn the optimal joint behavior with smaller memory and faster learning speed compared with friend-Q learning and independent learning. © 2013 South China University of Technology, Academy of Mathematics and Systems Science, Chinese Academy of Sciences and Springer-Verlag Berlin Heidelberg.

引用

页码：149 / 155

页数：6

共 50 条

[1] Cooperative learning with joint state value approximation for multi-agent systems
Xin CHEN
Gang CHEN
Weihua CAO
Min WU
[J]. Control Theory and Technology, 2013, 11 (02) : 149 - 155
[2] Multi-Agent Q-Learning with Joint State Value Approximation
Chen Gang
Cao Weihua
Chen Xin
Wu Min
[J]. 2011 30TH CHINESE CONTROL CONFERENCE (CCC), 2011, : 4878 - 4882
[3] Cooperative Multi-Agent Learning: The State of the Art
Liviu Panait
Sean Luke
[J]. Autonomous Agents and Multi-Agent Systems, 2005, 11 : 387 - 434
[4] Cooperative multi-agent learning: The state of the art
Panait, L
Luke, S
[J]. AUTONOMOUS AGENTS AND MULTI-AGENT SYSTEMS, 2005, 11 (03) : 387 - 434
[5] Cooperative Learning Based on Multi-Agent Systems
Cheng Xian-yi
Qiu Jian-lin
Liu Ying
[J]. THIRD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING: WKDD 2010, PROCEEDINGS, 2010, : 455 - 457
[6] Cooperative Learning of Multi-Agent Systems Via Reinforcement Learning
Wang, Xin
Zhao, Chen
Huang, Tingwen
Chakrabarti, Prasun
Kurths, Juergen
[J]. IEEE TRANSACTIONS ON SIGNAL AND INFORMATION PROCESSING OVER NETWORKS, 2023, 9 : 13 - 23
[7] Baselines for joint-action reinforcement learning of coordination in cooperative multi-agent systems
Carpenter, M
Kudenko, D
[J]. ADAPTIVE AGENTS AND MULTI-AGENT SYSTEMS II: ADAPTATION AND MULTI-AGENT LEARNING, 2005, 3394 : 55 - 72
[8] Distributed learning and cooperative control for multi-agent systems
Choi, Jongeun
Oh, Songhwai
Horowitz, Roberto
[J]. AUTOMATICA, 2009, 45 (12) : 2802 - 2814
[9] Learning Cooperative Behaviours in Adversarial Multi-agent Systems
Wang, Ni
Das, Gautham P.
Millard, Alan G.
[J]. TOWARDS AUTONOMOUS ROBOTIC SYSTEMS, TAROS 2022, 2022, 13546 : 179 - 189
[10] Reinforcement learning of coordination in cooperative multi-agent systems
Kapetanakis, S
Kudenko, D
[J]. EIGHTEENTH NATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE (AAAI-02)/FOURTEENTH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE (IAAI-02), PROCEEDINGS, 2002, : 326 - 331

← 1 2 3 4 5 →