Multi-Agent Deep Reinforcement Learning for Walker Systems

被引:0
|
作者
Park, Inhee [1 ]
Moh, Teng-Sheng [1 ]
机构
[1] San Jose State Univ, Dept Comp Sci, San Jose, CA 95192 USA
关键词
Deep Reinforcement Learning (DRL); Proximal Policy Optimization (PPO); Multi-agent DRL (MADRL);
D O I
10.1109/ICMLA52953.2021.00082
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We applied the state-of-art performance Deep Reinforcement Learning (DRL) algorithm, Proximal Policy Optimization (PPO), to the minimal robot-legs locomotion for the challenging multi-agent, continuous and high-dimensional state-space environments. The main contribution of this work is identifying the potential factors/hyperparameters and their effects on the performance of the multi-agent settings by varying the number of agents. Based on the comprehensive experiments with 2-10 multiwalkers environments, we found that 1) A minibatch size and a sampling reuse ratio (experience replay buffer size containing multiple minibatches) are critical hyperparameters to improve performance of the PPO; 2) Optimal neural network size depends on the number of walkers in the multi-agent environments; and 3) Parameter sharing among multi-agent is a better training strategy than fully independent learning in terms of comparable performance and improved efficiency with reduced parameters consuming less memory.
引用
收藏
页码:490 / 495
页数:6
相关论文
共 50 条
  • [31] Cooperative Learning of Multi-Agent Systems Via Reinforcement Learning
    Wang, Xin
    Zhao, Chen
    Huang, Tingwen
    Chakrabarti, Prasun
    Kurths, Juergen
    [J]. IEEE TRANSACTIONS ON SIGNAL AND INFORMATION PROCESSING OVER NETWORKS, 2023, 9 : 13 - 23
  • [32] Resource Allocation in Multi-cell NOMA Systems with Multi-Agent Deep Reinforcement Learning
    Wang, Shichao
    Wang, Xiaoming
    Zhang, Yuhan
    Xu, Youyun
    [J]. 2021 IEEE WIRELESS COMMUNICATIONS AND NETWORKING CONFERENCE (WCNC), 2021,
  • [33] Multi-Agent Deep Reinforcement Learning for Uplink Power Control in Multi-Cell Systems
    Jia, Ruibao
    Liu, Liu
    Zheng, Xufei
    Yang, Yuhan
    Wang, Shaoyang
    Huang, Pingmu
    Lv, Tiejun
    [J]. 2022 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS WORKSHOPS (ICC WORKSHOPS), 2022, : 324 - 330
  • [34] Deep multi-agent reinforcement learning for multi-level preventive maintenance in manufacturing systems
    Su, Jianyu
    Huang, Jing
    Adams, Stephen
    Chang, Qing
    Beling, Peter A.
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2022, 192
  • [35] Multi-agent deep reinforcement learning (MADRL) meets multi-user MIMO systems
    Lee, Heunchul
    Jeong, Jaeseong
    [J]. 2021 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM), 2021,
  • [36] Multi-Agent Reinforcement Learning
    Stankovic, Milos
    [J]. 2016 13TH SYMPOSIUM ON NEURAL NETWORKS AND APPLICATIONS (NEUREL), 2016, : 43 - 43
  • [37] Learning to Delay in Ride-Sourcing Systems: A Multi-Agent Deep Reinforcement Learning Framework
    Ke, Jintao
    Xiao, Feng
    Yang, Hai
    Ye, Jieping
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2022, 34 (05) : 2280 - 2292
  • [38] Distributed reinforcement learning in multi-agent decision systems
    Giráldez, JI
    Borrajo, D
    [J]. PROGRESS IN ARTIFICIAL INTELLIGENCE-IBERAMIA 98, 1998, 1484 : 148 - 159
  • [39] Shaping multi-agent systems with gradient reinforcement learning
    Olivier Buffet
    Alain Dutech
    François Charpillet
    [J]. Autonomous Agents and Multi-Agent Systems, 2007, 15 : 197 - 220
  • [40] Towards reinforcement learning for holonic multi-agent systems
    Abdoos, Monireh
    Mozayani, Nasser
    Bazzan, Ana L. C.
    [J]. INTELLIGENT DATA ANALYSIS, 2015, 19 (02) : 211 - 232