Collaborative multi-agent reinforcement learning based on experience propagation

被引:0
|
作者
Min Fang [1 ]
Frans C.A. Groen [2 ]
机构
[1] School of Computer Science and Technology, Xidian University
[2] Informatics Institute, University of Amsterdam
基金
中国国家自然科学基金;
关键词
multi-agent; Q learning; state list extracting; experience sharing;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
For multi-agent reinforcement learning in Markov games, knowledge extraction and sharing are key research problems. State list extracting means to calculate the optimal shared state path from state trajectories with cycles. A state list extracting algorithm checks cyclic state lists of a current state in the state trajectory, condensing the optimal action set of the current state. By reinforcing the optimal action selected, the action policy of cyclic states is optimized gradually. The state list extracting is repeatedly learned and used as the experience knowledge which is shared by teams. Agents speed up the rate of convergence by experience sharing. Competition games of preys and predators are used for the experiments. The results of experiments prove that the proposed algorithms overcome the lack of experience in the initial stage, speed up learning and improve the performance.
引用
收藏
页码:683 / 689
页数:7
相关论文
共 50 条
  • [1] Collaborative multi-agent reinforcement learning based on experience propagation
    Fang, Min
    Groen, Frans C. A.
    [J]. JOURNAL OF SYSTEMS ENGINEERING AND ELECTRONICS, 2013, 24 (04) : 683 - 689
  • [2] Constraint-based multi-agent reinforcement learning for collaborative tasks
    Shang, Xiumin
    Xu, Tengyu
    Karamouzas, Ioannis
    Kallmann, Marcelo
    [J]. COMPUTER ANIMATION AND VIRTUAL WORLDS, 2023, 34 (3-4)
  • [3] Multi-agent Collaborative Fire Rescue Based on Deep Reinforcement Learning
    Feng, Yiming
    [J]. 2022 IEEE INTERNATIONAL CONFERENCE ON ELECTRICAL ENGINEERING, BIG DATA AND ALGORITHMS (EEBDA), 2022, : 1317 - 1321
  • [4] Experience generalization for multi-agent reinforcement learning
    Pegoraro, R
    Costa, AHR
    Ribeiro, CHC
    [J]. SCCC 2001: XXI INTERNATIONAL CONFERENCE OF THE CHILEAN COMPUTER SCIENCE SOCIETY, PROCEEDINGS, 2001, : 233 - 239
  • [5] Avoiding collaborative paradox in multi-agent reinforcement learning
    Kim, Hyunseok
    Kim, Seonghyun
    Lee, Donghun
    Jang, Ingook
    [J]. ETRI JOURNAL, 2021, 43 (06) : 1004 - 1012
  • [6] An Optimization Method for Collaborative Radar Antijamming Based on Multi-Agent Reinforcement Learning
    Feng, Cheng
    Fu, Xiongjun
    Wang, Ziyi
    Dong, Jian
    Zhao, Zhichun
    Pan, Teng
    [J]. REMOTE SENSING, 2023, 15 (11)
  • [7] Experience Selection in Multi-Agent Deep Reinforcement Learning
    Wang, Yishen
    Zhang, Zongzhang
    [J]. 2019 IEEE 31ST INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2019), 2019, : 864 - 870
  • [8] A Multi-agent Reinforcement Learning with Weighted Experience Sharing
    Yu, Lasheng
    Abdulai, Issahaku
    [J]. ADVANCED INTELLIGENT COMPUTING THEORIES AND APPLICATIONS: WITH ASPECTS OF ARTIFICIAL INTELLIGENCE, 2012, 6839 : 219 - 225
  • [9] Multi-agent Reinforcement Learning for Collaborative Transportation Management (CTM)
    Okdinawati, Liane
    Simatupang, Togar M.
    Sunitiyoso, Yos
    [J]. AGENT-BASED APPROACHES IN ECONOMICS AND SOCIAL COMPLEX SYSTEMS IX, 2017, 15 : 123 - 136
  • [10] Measuring Collaborative Emergent Behavior in Multi-agent Reinforcement Learning
    Barton, Sean L.
    Waytowich, Nicholas R.
    Zaroukian, Erin
    Asher, Derrik E.
    [J]. HUMAN SYSTEMS ENGINEERING AND DESIGN, IHSED2018, 2019, 876 : 422 - 427