Cooperative learning with joint state value approximation for multi-agent systems

被引:4
|
作者
Chen X. [1 ]
Chen G. [1 ]
Cao W. [1 ]
Wu M. [1 ]
机构
[1] School of Information Science and Engineering, Central South University, Changsha Hunan
来源
基金
中国博士后科学基金; 高等学校博士学科点专项科研基金; 中国国家自然科学基金;
关键词
Cooperative system; Curse of dimensionality; Decomposition; Multi-agent system; Q-learning;
D O I
10.1007/s11768-013-1141-z
中图分类号
学科分类号
摘要
This paper relieves the 'curse of dimensionality' problem, which becomes intractable when scaling reinforcement learning to multi-agent systems. This problem is aggravated exponentially as the number of agents increases, resulting in large memory requirement and slowness in learning speed. For cooperative systems which widely exist in multi-agent systems, this paper proposes a new multi-agent Q-learning algorithm based on decomposing the joint state and joint action learning into two learning processes, which are learning individual action and the maximum value of the joint state approximately. The latter process considers others' actions to insure that the joint action is optimal and supports the updating of the former one. The simulation results illustrate that the proposed algorithm can learn the optimal joint behavior with smaller memory and faster learning speed compared with friend-Q learning and independent learning. © 2013 South China University of Technology, Academy of Mathematics and Systems Science, Chinese Academy of Sciences and Springer-Verlag Berlin Heidelberg.
引用
收藏
页码:149 / 155
页数:6
相关论文
共 50 条
  • [1] Cooperative learning with joint state value approximation for multi-agent systems
    Xin CHEN
    Gang CHEN
    Weihua CAO
    Min WU
    [J]. Control Theory and Technology, 2013, 11 (02) : 149 - 155
  • [2] Multi-Agent Q-Learning with Joint State Value Approximation
    Chen Gang
    Cao Weihua
    Chen Xin
    Wu Min
    [J]. 2011 30TH CHINESE CONTROL CONFERENCE (CCC), 2011, : 4878 - 4882
  • [3] Cooperative Multi-Agent Learning: The State of the Art
    Liviu Panait
    Sean Luke
    [J]. Autonomous Agents and Multi-Agent Systems, 2005, 11 : 387 - 434
  • [4] Cooperative multi-agent learning: The state of the art
    Panait, L
    Luke, S
    [J]. AUTONOMOUS AGENTS AND MULTI-AGENT SYSTEMS, 2005, 11 (03) : 387 - 434
  • [5] Cooperative Learning Based on Multi-Agent Systems
    Cheng Xian-yi
    Qiu Jian-lin
    Liu Ying
    [J]. THIRD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING: WKDD 2010, PROCEEDINGS, 2010, : 455 - 457
  • [6] Cooperative Learning of Multi-Agent Systems Via Reinforcement Learning
    Wang, Xin
    Zhao, Chen
    Huang, Tingwen
    Chakrabarti, Prasun
    Kurths, Juergen
    [J]. IEEE TRANSACTIONS ON SIGNAL AND INFORMATION PROCESSING OVER NETWORKS, 2023, 9 : 13 - 23
  • [7] Baselines for joint-action reinforcement learning of coordination in cooperative multi-agent systems
    Carpenter, M
    Kudenko, D
    [J]. ADAPTIVE AGENTS AND MULTI-AGENT SYSTEMS II: ADAPTATION AND MULTI-AGENT LEARNING, 2005, 3394 : 55 - 72
  • [8] Distributed learning and cooperative control for multi-agent systems
    Choi, Jongeun
    Oh, Songhwai
    Horowitz, Roberto
    [J]. AUTOMATICA, 2009, 45 (12) : 2802 - 2814
  • [9] Learning Cooperative Behaviours in Adversarial Multi-agent Systems
    Wang, Ni
    Das, Gautham P.
    Millard, Alan G.
    [J]. TOWARDS AUTONOMOUS ROBOTIC SYSTEMS, TAROS 2022, 2022, 13546 : 179 - 189
  • [10] Reinforcement learning of coordination in cooperative multi-agent systems
    Kapetanakis, S
    Kudenko, D
    [J]. EIGHTEENTH NATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE (AAAI-02)/FOURTEENTH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE (IAAI-02), PROCEEDINGS, 2002, : 326 - 331