Diffusion Policies as Multi-Agent Reinforcement Learning Strategies

被引:0
|
作者
Geng, Jinkun [1 ]
Liang, Xiubo [1 ]
Wang, Hongzhi [1 ]
Zhao, Yu [1 ]
机构
[1] Zhejiang Univ, Sch Software Technol, Ningbo, Peoples R China
关键词
Multi-agent reinforcement learning; Diffusion model; Offline reinforcement learning;
D O I
10.1007/978-3-031-44213-1_30
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In the realm of multi-agent systems, the application of reinforcement learning algorithms frequently confronts distinct challenges rooted in the non-stationarity and intricate nature of the environment. This paper presents an innovative methodology, denoted as Multi-Agent Diffuser (MA-Diffuser), which leverages diffusion models to encapsulate policies within a multi-agent context, thereby fostering efficient and expressive inter-agent coordination. Our methodology embeds the action-value maximization within the sampling process of the conditional diffusion model, thereby facilitating the detection of optimal actions closely aligned with the behavior policy. This strategy capitalizes on the expressive power of diffusion models, while simultaneously mitigating the prevalent function approximation errors often found in offline reinforcement learning environments. We have validated the efficacy of our approach within the Multi-Agent Particle Environment, and envisage its future extension to a broader range of tasks.
引用
收藏
页码:356 / 364
页数:9
相关论文
共 50 条
  • [1] Scalable Reinforcement Learning Policies for Multi-Agent Control
    Hsu, Christopher D.
    Jeong, Heejin
    Pappas, George J.
    Chaudhari, Pratik
    2021 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2021, : 4785 - 4791
  • [2] Learning competitive pricing strategies by multi-agent reinforcement learning
    Kutschinski, E
    Uthmann, T
    Polani, D
    JOURNAL OF ECONOMIC DYNAMICS & CONTROL, 2003, 27 (11-12): : 2207 - 2218
  • [3] Multi-agent Reinforcement Learning using strategies and voting
    Partalas, Loannis
    Feneris, Loannis
    Vlahavas, Loannis
    19TH IEEE INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE, VOL II, PROCEEDINGS, 2007, : 318 - 324
  • [4] Emergence of chemotactic strategies with multi-agent reinforcement learning
    Tovey, Samuel
    Lohrmann, Christoph
    Holm, Christian
    MACHINE LEARNING-SCIENCE AND TECHNOLOGY, 2024, 5 (03):
  • [5] Multi-Agent Deep Reinforcement Learning with Human Strategies
    Thanh Nguyen
    Ngoc Duy Nguyen
    Nahavandi, Saeid
    2019 IEEE INTERNATIONAL CONFERENCE ON INDUSTRIAL TECHNOLOGY (ICIT), 2019, : 1357 - 1362
  • [6] Learning Fair Policies in Decentralized Cooperative Multi-Agent Reinforcement Learning
    Zimmer, Matthieu
    Glanois, Claire
    Siddique, Umer
    Weng, Paul
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139
  • [7] Reinforcement Learning with Quantitative Verification for Assured Multi-Agent Policies
    Riley, Joshua
    Calinescu, Radu
    Paterson, Colin
    Kudenko, Daniel
    Banks, Alec
    ICAART: PROCEEDINGS OF THE 13TH INTERNATIONAL CONFERENCE ON AGENTS AND ARTIFICIAL INTELLIGENCE - VOL 2, 2021, : 237 - 245
  • [8] Towards Interpretable Policies in Multi-agent Reinforcement Learning Tasks
    Crespi, Marco
    Custode, Leonardo Lucio
    Iacca, Giovanni
    BIOINSPIRED OPTIMIZATION METHODS AND THEIR APPLICATIONS, 2022, 13627 : 262 - 276
  • [9] Multi-agent reinforcement learning based on policies of global objective
    张化祥
    黄上腾
    Journal of Systems Engineering and Electronics, 2005, (03) : 676 - 681
  • [10] Learning Distinct Strategies for Heterogeneous Cooperative Multi-agent Reinforcement Learning
    Wan, Kejia
    Xu, Xinhai
    Li, Yuan
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2021, PT IV, 2021, 12894 : 544 - 555