Experimental Evaluation of a Method for Simulation based Learning for a Multi-Agent System Acting in a Physical Environment

被引:0
|
作者
Qian, Kun [1 ]
Brehm, Robert W. [1 ]
Duggen, Lars [1 ]
机构
[1] Univ Southern Denmark, Mads Clausen Inst, SDU Mechatron, Odense, Denmark
关键词
Cooperative Multi-Agent Systems; Multi-Agent Reinforcement Learning; Multi-Agent Actor-Critic; Cooperative Navigation; Simulation Based Learning;
D O I
10.5220/0007250301030109
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A method for simulation based reinforcement learning (RL) for a multi-agent system acting in a physical environment is introduced, which is based on Multi-Agent Actor-Critic (MAAC) reinforcement learning. In the proposed method, avatar agents learn in a simulated model of the physical environment and the learned experience is then used by agents in the actual physical environment. The proposed concept is verified using a laboratory benchmark setup in which multiple agents, acting within the same environment, are required to coordinate their movement actions to prevent collisions. Three state-of-the-art algorithms for multi-agent reinforcement learning (MARL) are evaluated, with respect to their applicability for a predefined benchmark scenario. Based on simulations it is shown that the MAAC method is most applicable for implementation as it provides effective distributed learning and suits well to the concept of learning in simulated environments. Our experimental results, which compare simulated learning and task execution in a simulated environment with that of task execution in a physical environment demonstrate the feasibility of the proposed concept.
引用
收藏
页码:103 / 109
页数:7
相关论文
共 50 条
  • [1] Behavior modeling based on multi-agent and multi-agent simulation environment
    Yin, QJ
    Du, XY
    Huang, K
    SYSTEM SIMULATION AND SCIENTIFIC COMPUTING, VOLS 1 AND 2, PROCEEDINGS, 2005, : 1531 - 1536
  • [2] Simulation and multi-agent environment for aircraft maintenance learning
    Gouardères, G
    Minko, A
    Richard, L
    ARTIFICIAL INTELLIGENCE: METHODOLOGY, SYSTEMS, APPLICATIONS, PROCEEDINGS, 2000, 1904 : 152 - 166
  • [3] Opponent learning for multi-agent system simulation
    Wu, Ji
    Ye, Chaoqun
    Jin, Shiyao
    ROUGH SETS AND KNOWLEDGE TECHNOLOGY, PROCEEDINGS, 2006, 4062 : 643 - 650
  • [4] Meteorological Environment Modeling and Simulation Technology based on Multi-agent System
    Li, Wen-juan
    Chen, Hui-xian
    Ding, Han
    Zhou, Jing-wen
    Gao, Xing-rong
    Li, Ceng
    2015 FIFTH INTERNATIONAL CONFERENCE ON INSTRUMENTATION AND MEASUREMENT, COMPUTER, COMMUNICATION AND CONTROL (IMCCC), 2015, : 1800 - 1802
  • [5] Constructing adaptive individual learning environment based on multi-agent system
    Chen, Peng
    Meng, Anbo
    Zhao, Chunhua
    CIS WORKSHOPS 2007: INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND SECURITY WORKSHOPS, 2007, : 374 - +
  • [6] Software Environment for Simulation of UAV Multi-Agent System
    Obdrzalek, Zbynek
    2016 21ST INTERNATIONAL CONFERENCE ON METHODS AND MODELS IN AUTOMATION AND ROBOTICS (MMAR), 2016, : 720 - 725
  • [7] Research on multi-agent simulation environment based on HLA
    Zhuang, Yan
    Zhang, Zhi-Xiang
    Cheng, Jian-Ming
    Du, Hui
    PROCEEDINGS OF 2006 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, 2006, : 154 - +
  • [8] A Method of Trust Evaluation in Multi-agent System
    Guan Chun
    Li Guihan
    2009 THIRD INTERNATIONAL SYMPOSIUM ON INTELLIGENT INFORMATION TECHNOLOGY APPLICATION, VOL 3, PROCEEDINGS, 2009, : 123 - 125
  • [9] Behavior acquisition based on multi-module learning system in multi-agent environment
    Takahashi, Y
    Edazawa, K
    Asada, M
    ROBOCUP 2002: ROBOT SOCCER WORLD CUP VI, 2003, 2752 : 435 - 442
  • [10] An experimental evaluation of communication in an Organization-based Multi-Agent System
    Bouslimi, Issam
    Hanachi, Chihab
    Ghedira, Khaled
    2014 IEEE/WIC/ACM INTERNATIONAL JOINT CONFERENCES ON WEB INTELLIGENCE (WI) AND INTELLIGENT AGENT TECHNOLOGIES (IAT), VOL 3, 2014, : 72 - 78