Multi-Agent Q-Learning with Joint State Value Approximation

被引:0
|
作者
Chen Gang [1 ]
Cao Weihua [1 ]
Chen Xin [1 ]
Wu Min [1 ]
机构
[1] Cent S Univ, Sch Informat Sci & Engn, Changsha 410083, Peoples R China
关键词
Multi-agent system; Q-learning; Cooperative systems; Curse of dimensionality; Decomposition;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper relieves the "curse of dimensionality" problem, which becomes intractable when scaling reinforcement learning to multi-agent systems. This problem is aggravated exponentially as the number of agents increases, resulting in large memory requirement and slowness in learning speed. For cooperative systems which are widely existed in multi-agent systems, this paper proposes a new multi-agent Q-learning algorithm based on the decomposing the joint state and joint action learning into two learning processes, which are learning individual action and the maximum value of the joint state approximately. The latter process considers others' actions to insure the joint action is optimal and supports the updating of the former one. The simulation results illustrate that the proposed algorithm can learn the optimal joint behavior with smaller memory and faster speed comparing with Friend-Q learning.
引用
收藏
页码:4878 / 4882
页数:5
相关论文
共 50 条
  • [1] Cooperative learning with joint state value approximation for multi-agent systems
    Xin CHEN
    Gang CHEN
    Weihua CAO
    Min WU
    [J]. Control Theory and Technology, 2013, 11 (02) : 149 - 155
  • [2] Cooperative learning with joint state value approximation for multi-agent systems
    Chen X.
    Chen G.
    Cao W.
    Wu M.
    [J]. Journal of Control Theory and Applications, 2013, 11 (2): : 149 - 155
  • [3] DVF:Multi-agent Q-learning with difference value factorization
    Huang, Anqi
    Wang, Yongli
    Sang, Jianghui
    Wang, Xiaoli
    Wang, Yupeng
    [J]. KNOWLEDGE-BASED SYSTEMS, 2024, 286
  • [4] Q-learning in Multi-Agent Cooperation
    Hwang, Kao-Shing
    Chen, Yu-Jen
    Lin, Tzung-Feng
    [J]. 2008 IEEE WORKSHOP ON ADVANCED ROBOTICS AND ITS SOCIAL IMPACTS, 2008, : 239 - 244
  • [5] Multi-Agent Advisor Q-Learning
    Subramanian, Sriram Ganapathi
    Taylor, Matthew E.
    Larson, Kate
    Crowley, Mark
    [J]. Journal of Artificial Intelligence Research, 2022, 74 : 1 - 74
  • [6] Multi-Agent Advisor Q-Learning
    Subramanian, Sriram Ganapathi
    Taylor, Matthew E.
    Larson, Kate
    Crowley, Mark
    [J]. PROCEEDINGS OF THE THIRTY-SECOND INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2023, 2023, : 6884 - 6889
  • [7] Multi-Agent Advisor Q-Learning
    Subramanian, Sriram Ganapathi
    Taylor, Matthew E.
    Larson, Kate
    Crowley, Mark
    [J]. JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2022, 74 : 1 - 74
  • [8] Towards Understanding Cooperative Multi-Agent Q-Learning with Value Factorization
    Wang, Jianhao
    Ren, Zhizhou
    Han, Beining
    Ye, Jianing
    Zhang, Chongjie
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021,
  • [9] Multi-agent dueling Q-learning with mean field and value decomposition
    Ding, Shifei
    Du, Wei
    Ding, Ling
    Guo, Lili
    Zhang, Jian
    An, Bo
    [J]. PATTERN RECOGNITION, 2023, 139
  • [10] Continuous Q-Learning for Multi-Agent Cooperation
    Hwang, Kao-Shing
    Jiang, Wei-Cheng
    Lin, Yu-Hong
    Lai, Li-Hsin
    [J]. CYBERNETICS AND SYSTEMS, 2012, 43 (03) : 227 - 256