Bias Estimation Correction in Multi-Agent Reinforcement Learning for Mixed Cooperative-Competitive Environments

被引:0
|
作者
Sarkar T. [1 ,2 ]
Kalita S. [1 ]
机构
[1] Department of Computer Science and Engineering, Tezpur University, Assam, Tezpur
[2] Department of MCA, MS Ramaiah Institute of Technology, Karnataka, Bengaluru
关键词
Bias estimation correction; MADDPG; MARL; MATD3; Weighted critic update;
D O I
10.1007/s42979-023-02326-7
中图分类号
学科分类号
摘要
Multi-agent reinforcement learning (MARL) is a domain that is being actively researched in the current times. The ability of MARL algorithms in finding promising solutions to problems while having limited prior knowledge of the environment has found application in traffic control, unmanned vehicles, routing, and more. Despite the algorithmic advancements over the years, the issue of bias estimation still persists in MARL methods. Multi-agent twin delayed deep deterministic (MATD3) method solves for bias overestimation problem that arises in Multi-agent deep deterministic policy gradient (MADDPG). MATD3 does so by using a double-critic architecture. MATD3 calculates the target values and updates the critic networks by taking a minimum of the Q-values estimated by the target Q-networks. The method solves the overestimation problem but in the process introduces an underestimation. A method is proposed in this paper to correct for both bias over-estimation and under-estimation problem by incorporating a triple critic architecture. The target values are then calculated using a weighted sum of the minimum and the median of the target Q-values. The proposed method is tested in the multi-agent particle environment (MPE). Experimental results show that our method, apart from correcting for the bias estimation, outperforms MATD3 and MADDPG methods in terms of overall returns. We also perform experiments to test the effect of a weighted critic update on our proposed method and have found propitious results. © 2023, The Author(s), under exclusive licence to Springer Nature Singapore Pte Ltd.
引用
收藏
相关论文
共 50 条
  • [31] Cooperative Learning of Multi-Agent Systems Via Reinforcement Learning
    Wang, Xin
    Zhao, Chen
    Huang, Tingwen
    Chakrabarti, Prasun
    Kurths, Juergen
    IEEE TRANSACTIONS ON SIGNAL AND INFORMATION PROCESSING OVER NETWORKS, 2023, 9 : 13 - 23
  • [32] Cooperative Multi-Agent Reinforcement Learning With Approximate Model Learning
    Park, Young Joon
    Lee, Young Jae
    Kim, Seoung Bum
    IEEE ACCESS, 2020, 8 : 125389 - 125400
  • [33] Multi-agent cooperative learning research based on reinforcement learning
    Liu, Fei
    Zeng, Guangzhou
    2006 10TH INTERNATIONAL CONFERENCE ON COMPUTER SUPPORTED COOPERATIVE WORK IN DESIGN, PROCEEDINGS, VOLS 1 AND 2, 2006, : 1408 - 1413
  • [34] Multi-agent Cooperative Search based on Reinforcement Learning
    Sun, Yinjiang
    Zhang, Rui
    Liang, Wenbao
    Xu, Cheng
    PROCEEDINGS OF 2020 3RD INTERNATIONAL CONFERENCE ON UNMANNED SYSTEMS (ICUS), 2020, : 891 - 896
  • [35] Cooperative Multi-Agent Reinforcement Learning with Hypergraph Convolution
    Bai, Yunpeng
    Gong, Chen
    Zhang, Bin
    Fan, Guoliang
    Hou, Xinwen
    Lu, Yu
    2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,
  • [36] A review of cooperative multi-agent deep reinforcement learning
    Oroojlooy, Afshin
    Hajinezhad, Davood
    APPLIED INTELLIGENCE, 2023, 53 (11) : 13677 - 13722
  • [37] Cooperative Multi-agent Reinforcement Learning for Inventory Management
    Khirwar, Madhav
    Gurumoorthy, Karthik S.
    Jain, Ankit Ajit
    Manchenahally, Shantala
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES: APPLIED DATA SCIENCE AND DEMO TRACK, ECML PKDD 2023, PT VI, 2023, 14174 : 619 - 634
  • [38] The Cooperative Reinforcement Learning in a Multi-Agent Design System
    Liu, Hong
    Wang, Jihua
    PROCEEDINGS OF THE 2013 IEEE 17TH INTERNATIONAL CONFERENCE ON COMPUTER SUPPORTED COOPERATIVE WORK IN DESIGN (CSCWD), 2013, : 139 - 144
  • [39] Cooperative Multi-Agent Reinforcement Learning in Express System
    Li, Yexin
    Zheng, Yu
    Yang, Qiang
    CIKM '20: PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT, 2020, : 805 - 814
  • [40] A review of cooperative multi-agent deep reinforcement learning
    Afshin Oroojlooy
    Davood Hajinezhad
    Applied Intelligence, 2023, 53 : 13677 - 13722