Multi-Agent Q-Learning with Joint State Value Approximation

被引：0

作者：

Chen Gang ^{[1
]}

Cao Weihua ^{[1
]}

Chen Xin ^{[1
]}

Wu Min ^{[1
]}

机构：

[1] Cent S Univ, Sch Informat Sci & Engn, Changsha 410083, Peoples R China

来源：

2011 30TH CHINESE CONTROL CONFERENCE (CCC) | 2011年

关键词：

Multi-agent system; Q-learning; Cooperative systems; Curse of dimensionality; Decomposition;

D O I：

暂无

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

This paper relieves the "curse of dimensionality" problem, which becomes intractable when scaling reinforcement learning to multi-agent systems. This problem is aggravated exponentially as the number of agents increases, resulting in large memory requirement and slowness in learning speed. For cooperative systems which are widely existed in multi-agent systems, this paper proposes a new multi-agent Q-learning algorithm based on the decomposing the joint state and joint action learning into two learning processes, which are learning individual action and the maximum value of the joint state approximately. The latter process considers others' actions to insure the joint action is optimal and supports the updating of the former one. The simulation results illustrate that the proposed algorithm can learn the optimal joint behavior with smaller memory and faster speed comparing with Friend-Q learning.

引用

页码：4878 / 4882

页数：5

共 50 条

[1] Cooperative learning with joint state value approximation for multi-agent systems
Xin CHEN
Gang CHEN
Weihua CAO
Min WU
[J]. Control Theory and Technology, 2013, 11 (02) : 149 - 155
[2] Cooperative learning with joint state value approximation for multi-agent systems
Chen X.
Chen G.
Cao W.
Wu M.
[J]. Journal of Control Theory and Applications, 2013, 11 (2): : 149 - 155
[3] DVF:Multi-agent Q-learning with difference value factorization
Huang, Anqi
Wang, Yongli
Sang, Jianghui
Wang, Xiaoli
Wang, Yupeng
[J]. KNOWLEDGE-BASED SYSTEMS, 2024, 286
[4] Q-learning in Multi-Agent Cooperation
Hwang, Kao-Shing
Chen, Yu-Jen
Lin, Tzung-Feng
[J]. 2008 IEEE WORKSHOP ON ADVANCED ROBOTICS AND ITS SOCIAL IMPACTS, 2008, : 239 - 244
[5] Multi-Agent Advisor Q-Learning
Subramanian, Sriram Ganapathi
Taylor, Matthew E.
Larson, Kate
Crowley, Mark
[J]. Journal of Artificial Intelligence Research, 2022, 74 : 1 - 74
[6] Multi-Agent Advisor Q-Learning
Subramanian, Sriram Ganapathi
Taylor, Matthew E.
Larson, Kate
Crowley, Mark
[J]. PROCEEDINGS OF THE THIRTY-SECOND INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2023, 2023, : 6884 - 6889
[7] Multi-Agent Advisor Q-Learning
Subramanian, Sriram Ganapathi
Taylor, Matthew E.
Larson, Kate
Crowley, Mark
[J]. JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2022, 74 : 1 - 74
[8] Towards Understanding Cooperative Multi-Agent Q-Learning with Value Factorization
Wang, Jianhao
Ren, Zhizhou
Han, Beining
Ye, Jianing
Zhang, Chongjie
[J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021,
[9] Multi-agent dueling Q-learning with mean field and value decomposition
Ding, Shifei
Du, Wei
Ding, Ling
Guo, Lili
Zhang, Jian
An, Bo
[J]. PATTERN RECOGNITION, 2023, 139
[10] Continuous Q-Learning for Multi-Agent Cooperation
Hwang, Kao-Shing
Jiang, Wei-Cheng
Lin, Yu-Hong
Lai, Li-Hsin
[J]. CYBERNETICS AND SYSTEMS, 2012, 43 (03) : 227 - 256

← 1 2 3 4 5 →