Mutual Deep Deterministic Policy Gradient Learning

被引:0
|
作者
Sun, Zhou [1 ]
机构
[1] HongWen Sch Qingdao, Dept Sci, Qingdao, Peoples R China
关键词
Machine Learning; Deep Reinforcement learning; DDPG; Mutual Learning;
D O I
10.1109/BDICN55575.2022.00099
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In deep reinforcement learning (DRL), policy gradient (PG) and actor-critic (AC) based methods are among the most populous and effective methods for training DRL agents. One such method is the state-of-the-art deep deterministic policy gradient (DDPG). In this research, we employ the framework of mutual learning with DDPG to present a novel, Mutual DDPG (MuDDPG) agent with the aim to improve the performance and robustness of conventional DDPG. We also propose an additional simple innovation of adaptive reward-based exploration to further improve the rate of learning. We demonstrate that by employing these schemes, MuDDPG can converge faster and perform better than vanilla DDPG in two simple simulated tasks while adding significant robustness to the learning process.
引用
收藏
页码:508 / 513
页数:6
相关论文
共 50 条
  • [11] Learning a Self-driving Bicycle Using Deep Deterministic Policy Gradient
    Le, Tuyen P.
    Quang, Nguyen Dang
    Choi, SeungYoon
    Chung, TaeChoong
    2018 18TH INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND SYSTEMS (ICCAS), 2018, : 231 - 236
  • [12] Reinforcement Learning for Mobile Robot Obstacle Avoidance with Deep Deterministic Policy Gradient
    Chen, Miao
    Li, Wenna
    Fei, Shihan
    Wei, Yufei
    Tu, Mingyang
    Li, Jiangbo
    INTELLIGENT ROBOTICS AND APPLICATIONS (ICIRA 2022), PT III, 2022, 13457 : 197 - 204
  • [13] Heuristic Gait Learning of Quadruped Robot Based on Deep Deterministic Policy Gradient Algorithm
    Wang, Mingchao
    Ruan, Xiaogang
    Zhu, Xiaoqing
    2020 CHINESE AUTOMATION CONGRESS (CAC 2020), 2020, : 1046 - 1049
  • [14] Reinforcement Learning Control with Deep Deterministic Policy Gradient Algorithm for Multivariable pH Process
    Panjapornpon, Chanin
    Chinchalongporn, Patcharapol
    Bardeeniz, Santi
    Makkayatorn, Ratthanita
    Wongpunnawat, Witchaya
    PROCESSES, 2022, 10 (12)
  • [15] Independent Deep Deterministic Policy Gradient Reinforcement Learning in Cooperative Multiagent Pursuit Games
    Zhou, Shiyang
    Ren, Weiya
    Ren, Xiaoguang
    Wang, Yanzhen
    Yi, Xiaodong
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2021, PT IV, 2021, 12894 : 625 - 637
  • [16] Deep Deterministic Policy Gradient to Regulate Feedback Control Systems Using Reinforcement Learning
    Arshad, Jehangir
    Khan, Ayesha
    Aftab, Mariam
    Hussain, Mujtaba
    Rehman, Ateeq Ur
    Ahmad, Shafiq
    Al-Shayea, Adel M.
    Shafiq, Muhammad
    CMC-COMPUTERS MATERIALS & CONTINUA, 2022, 71 (01): : 1153 - 1169
  • [17] Improvement of PMSM Control Using Reinforcement Learning Deep Deterministic Policy Gradient Agent
    Nicola, Marcel
    Nicola, Claudiu-Ionel
    2021 21ST INTERNATIONAL SYMPOSIUM ON POWER ELECTRONICS (EE 2021), 2021,
  • [18] Deep deterministic policy gradient to regulate feedback control systems using reinforcement learning
    Arshad, Jehangir
    Khan, Ayesha
    Aftab, Mariam
    Hussain, Mujtaba
    Rehman, Ateeq Ur
    Ahmad, Shafiq
    Al-Shayea, Adel M.
    Shafiq, Muhammad
    Computers, Materials and Continua, 2022, 71 (01): : 1153 - 1169
  • [19] Multi-robot Cooperation Learning Based on Powell Deep Deterministic Policy Gradient
    Li, Zongyuan
    Xiao, Chuxi
    Liu, Ziyi
    Guo, Xian
    INTELLIGENT ROBOTICS AND APPLICATIONS (ICIRA 2022), PT II, 2022, 13456 : 77 - 87
  • [20] Robotic Visual-Inertial Calibration via Deep Deterministic Policy Gradient Learning
    Zhu, Wenxing
    Wang, Lihui
    Chen, Liangliang
    Xu, Ninghui
    Su, Yuzuwei
    IEEE SENSORS JOURNAL, 2022, 22 (14) : 14448 - 14457