Credit assignment in heterogeneous multi-agent reinforcement learning for fully cooperative tasks

被引:0
|
作者
Kun Jiang
Wenzhang Liu
Yuanda Wang
Lu Dong
Changyin Sun
机构
[1] Southeast University,School of Automation
[2] Peng Cheng Laboratory,School of Artificial Intelligence
[3] Anhui University,School of Cyber Science and Engineering
[4] Southeast University,undefined
来源
Applied Intelligence | 2023年 / 53卷
关键词
Credit assignment; Multi-agent reinforcement learning; Reward decomposition; Heterogeneous agents;
D O I
暂无
中图分类号
学科分类号
摘要
Credit assignment poses a significant challenge in heterogeneous multi-agent reinforcement learning (MARL) when tackling fully cooperative tasks. Existing MARL methods assess the contribution of each agent through value decomposition or agent-wise critic networks. However, value decomposition techniques are not directly applicable to control problems with continuous action spaces. Additionally, agent-wise critic networks struggle to differentiate the distinct contributions from the shared team reward. Moreover, most of these methods assume agent homogeneity, which limits their utility in more diverse scenarios. To address these limitations, we present a novel algorithm that factorizes and reshapes the team reward into agent-wise rewards, enabling the evaluation of the diverse contributions of heterogeneous agents. Specifically, we devise agent-wise local critics that leverage both the team reward and the factorized reward, alongside a global critic for assessing the joint policy. By accounting for the contribution differences resulting from agent heterogeneity, we introduce a power balance constraint that ensures a fairer measurement of each heterogeneous agent’s contribution, ultimately promoting energy efficiency. Finally, we optimize the policies of all agents using deterministic policy gradients. The effectiveness of our proposed algorithm has been validated through simulation experiments conducted in fully cooperative and heterogeneous multi-agent tasks.
引用
收藏
页码:29205 / 29222
页数:17
相关论文
共 50 条
  • [41] Levels of Realism for Cooperative Multi-agent Reinforcement Learning
    Cunningham, Bryan
    Cao, Yong
    [J]. ADVANCES IN SWARM INTELLIGENCE, ICSI 2012, PT I, 2012, 7331 : 573 - 582
  • [42] Centralized reinforcement learning for multi-agent cooperative environments
    Chengxuan Lu
    Qihao Bao
    Shaojie Xia
    Chongxiao Qu
    [J]. Evolutionary Intelligence, 2024, 17 : 267 - 273
  • [43] Centralized reinforcement learning for multi-agent cooperative environments
    Lu, Chengxuan
    Bao, Qihao
    Xia, Shaojie
    Qu, Chongxiao
    [J]. EVOLUTIONARY INTELLIGENCE, 2024, 17 (01) : 267 - 273
  • [44] Cooperative multi-agent game based on reinforcement learning
    Liu, Hongbo
    [J]. HIGH-CONFIDENCE COMPUTING, 2024, 4 (01):
  • [45] Reinforcement learning of coordination in cooperative multi-agent systems
    Kapetanakis, S
    Kudenko, D
    [J]. EIGHTEENTH NATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE (AAAI-02)/FOURTEENTH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE (IAAI-02), PROCEEDINGS, 2002, : 326 - 331
  • [46] Training Cooperative Agents for Multi-Agent Reinforcement Learning
    Bhalla, Sushrut
    Subramanian, Sriram G.
    Crowley, Mark
    [J]. AAMAS '19: PROCEEDINGS OF THE 18TH INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS AND MULTIAGENT SYSTEMS, 2019, : 1826 - 1828
  • [47] Cooperative Exploration for Multi-Agent Deep Reinforcement Learning
    Liu, Iou-Jen
    Jain, Unnat
    Yeh, Raymond A.
    Schwing, Alexander G.
    [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139
  • [48] Evaluating Multi-Agent Reinforcement Learning on Heterogeneous Platforms
    Wiggins, Samuel
    Meng, Yuan
    Kannan, Rajgopal
    Prasanna, Viktor
    [J]. ARTIFICIAL INTELLIGENCE AND MACHINE LEARNING FOR MULTI-DOMAIN OPERATIONS APPLICATIONS V, 2023, 12538
  • [49] Cooperative output regulation of heterogeneous directed multi-agent systems:a fully distributed model-free reinforcement learning framework
    Xiongtao SHI
    Yanjie LI
    Chenglong DU
    Huiping LI
    Chaoyang CHEN
    Weihua GUI
    [J]. Science China(Information Sciences), 2025, 68 (02) : 170 - 185
  • [50] Credit-of-Q-value for Multi-Agent Reinforcement Learning
    Li, Shuaibin
    Li, Xiu
    Cui, Jinqiang
    [J]. 2022 41ST CHINESE CONTROL CONFERENCE (CCC), 2022, : 6982 - 6988