Filtered Observations for Model-Based Multi-agent Reinforcement Learning

被引:0
|
作者
Meng, Linghui [1 ,2 ]
Xiong, Xuantang [1 ,2 ]
Zang, Yifan [1 ,2 ]
Zhang, Xi [1 ]
Li, Guoqi [1 ,2 ]
Xing, Dengpeng [1 ,2 ]
Xu, Bo [1 ,2 ]
机构
[1] Chinese Acad Sci, Inst Automat, Beijing, Peoples R China
[2] Univ Chinese Acad Sci, Sch Artificial Intelligence, Beijing, Peoples R China
基金
国家重点研发计划;
关键词
Model-based planning; Multi-agent reinforcement learning; Generative models;
D O I
10.1007/978-3-031-43421-1_32
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Reinforcement learning (RL) pursues high sample efficiency in practical environments to avoid costly interactions. Learning to plan with a world model in a compact latent space for policy optimization significantly improves sample efficiency in single-agent RL. Although world model construction methods for single-agent can be naturally extended, existing multi-agent schemes fail to acquire world models effectively as redundant information increases rapidly with the number of agents. To address this issue, we in this paper leverage guided diffusion to filter this noisy information, which harms teamwork. Obtained purified global states are then used to build a unified world model. Based on the learned world model, we denoise each agent observation and plan for multi-agent policy optimization, facilitating efficient cooperation. We name our method UTOPIA, a model-based method for cooperative multi-agent reinforcement learning (MARL). Compared to strong model-free and model-based baselines, our method shows enhanced sample efficiency in various testbeds, including the challenging StarCraft Multi-Agent Challenge tasks.
引用
收藏
页码:540 / 555
页数:16
相关论文
共 50 条
  • [1] Mingling Foresight with Imagination: Model-Based Cooperative Multi-Agent Reinforcement Learning
    Xu, Zhiwei
    Li, Dapeng
    Zhang, Bin
    Zhan, Yuan
    Bai, Yunpeng
    Fan, Guoliang
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
  • [2] Model-Based Reinforcement Learning using Model Mediator in Dynamic Multi-Agent Environment
    Imai S.
    Iwasawa Y.
    Matsuo Y.
    Transactions of the Japanese Society for Artificial Intelligence, 2023, 38 (05)
  • [3] Efficient Model-based Multi-agent Reinforcement Learning via Optimistic Equilibrium Computation
    Sessa, Pier Giuseppe
    Kamgarpour, Maryam
    Krause, Andreas
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022, : 19580 - 19597
  • [4] A Model-Based Reinforcement Learning Algorithm for Multi-Agent Cooperation Nash Equilibrium With Unstable Communication
    Jiang, Yuannan
    Jiang, Shengming
    Wang, Xiaofeng
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II-EXPRESS BRIEFS, 2024, 71 (11) : 4743 - 4747
  • [5] Safe multi-agent motion planning via filtered reinforcement learning
    Vinod, Abraham P.
    Safaoui, Sleiman
    Chakrabarty, Ankush
    Quirynen, Rien
    Yoshikawa, Nobuyuki
    Di Cairano, Stefano
    2022 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, ICRA 2022, 2022, : 7270 - 7276
  • [6] Model-Based Self-Advising for Multi-Agent Learning
    Ye, Dayong
    Zhu, Tianqing
    Zhu, Congcong
    Zhou, Wanlei
    Yu, Philip S.
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 34 (10) : 7934 - 7945
  • [7] Model-based learning of interaction strategies in multi-agent systems
    Carmel, D
    Markovitch, S
    JOURNAL OF EXPERIMENTAL & THEORETICAL ARTIFICIAL INTELLIGENCE, 1998, 10 (03) : 309 - 332
  • [8] Exploration Strategies for Model-based Learning in Multi-agent Systems
    Carmel D.
    Markovitch S.
    Autonomous Agents and Multi-Agent Systems, 1999, 2 (2) : 141 - 172
  • [9] MULTI-MODEL FEDERATED LEARNING OPTIMIZATION BASED ON MULTI-AGENT REINFORCEMENT LEARNING
    Atapour, S. Kaveh
    Seyedmohammadi, S. Jamal
    Sheikholeslami, S. Mohammad
    Abouei, Jamshid
    Mohammadi, Arash
    Plataniotis, Konstantinos N.
    2023 IEEE 9TH INTERNATIONAL WORKSHOP ON COMPUTATIONAL ADVANCES IN MULTI-SENSOR ADAPTIVE PROCESSING, CAMSAP, 2023, : 151 - 155
  • [10] Reinforcement learning based on multi-agent in RoboCup
    Zhang, W
    Li, JG
    Ruan, XG
    ADVANCES IN INTELLIGENT COMPUTING, PT 1, PROCEEDINGS, 2005, 3644 : 967 - 975