CT-DQN: Control-Tutored Deep Reinforcement Learning

被引:0
|
作者
De Lellis, Francesco [1 ]
Coraggio, Marco [2 ]
Russo, Giovanni [3 ]
Musolesi, Mirco [4 ,5 ]
di Bernardo, Mario [1 ,2 ]
机构
[1] Univ Naples Federico II, Naples, Italy
[2] Scuola Super Meridionale, Naples, Italy
[3] Univ Salerno, Salerno, Italy
[4] UCL, London, England
[5] Univ Bologna, Bologna, Italy
关键词
Reinforcement learning based control; deep reinforcement learning; feedback control;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
One of the major challenges in Deep Reinforcement Learning for control is the need for extensive training to learn a policy. Motivated by this, we present the design of the Control-Tutored Deep Q-Networks (CT-DQN) algorithm, a Deep Reinforcement Learning algorithm that leverages a control tutor, i.e., an exogenous control law, to reduce learning time. The tutor can be designed using an approximate model of the system, without any assumption about the knowledge of the system dynamics. There is no expectation that it will be able to achieve the control objective if used standalone. During learning, the tutor occasionally suggests an action, thus partially guiding exploration. We validate our approach on three scenarios from OpenAI Gym: the inverted pendulum, lunar lander, and car racing. We demonstrate that CT-DQN is able to achieve better or equivalent data efficiency with respect to the classic function approximation solutions.
引用
收藏
页数:13
相关论文
共 50 条
  • [1] Control-Tutored Reinforcement Learning: Towards the Integration of Data-Driven and Model-Based Control
    DeLellis, Francesco
    Coraggio, Marco
    Russo, Giovanni
    Musolesi, Mirco
    di Bernardo, Mario
    LEARNING FOR DYNAMICS AND CONTROL CONFERENCE, VOL 168, 2022, 168
  • [2] Deep Reinforcement Learning with DQN vs. PPO in VizDoom
    Zakharenkov, Anton
    Makarov, Ilya
    21ST IEEE INTERNATIONAL SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE AND INFORMATICS (CINTI), 2021, : 137 - 142
  • [3] Averaged-DQN: Variance Reduction and Stabilization for Deep Reinforcement Learning
    Anschel, Oron
    Baram, Nir
    Shimkin, Nahum
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 70, 2017, 70
  • [4] Deep Reinforcement Learning in VizDoom via DQN and Actor-Critic Agents
    Bakhanova, Maria
    Makarov, Ilya
    ADVANCES IN COMPUTATIONAL INTELLIGENCE, IWANN 2021, PT I, 2021, 12861 : 138 - 150
  • [5] Multi-robot path planning based on a deep reinforcement learning DQN algorithm
    Yang Yang
    Li Juntao
    Peng Lingling
    CAAI TRANSACTIONS ON INTELLIGENCE TECHNOLOGY, 2020, 5 (03) : 177 - 183
  • [6] Tuning Apex DQN: A Reinforcement Learning based Deep Q-Network Algorithm
    Ruhela, Dhani
    Ruhela, Amit
    PRACTICE AND EXPERIENCE IN ADVANCED RESEARCH COMPUTING 2024, PEARC 2024, 2024,
  • [7] Autonomous Maneuver Decision of UAV Based on Deep Reinforcement Learning: Comparison of DQN and DDPG
    Wang, Yu
    Ren, Tianjun
    Fan, Zilin
    2022 34TH CHINESE CONTROL AND DECISION CONFERENCE, CCDC, 2022, : 4857 - 4860
  • [8] Comparison of MPPT based on Deep Reinforcement Learning by DQN, DDPG and TD3
    Panggabean, Jayandi
    Sutisna, Nana
    Syafalni, Infall
    Adiono, Trio
    2023 ASIA PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE, APSIPA ASC, 2023, : 261 - 266
  • [9] A Deep Reinforcement Learning Method for Mobile Robot Collision Avoidance based on Double DQN
    Xue, Xidi
    Li, Zhan
    Zhang, Dongsheng
    Yan, Yingxin
    2019 IEEE 28TH INTERNATIONAL SYMPOSIUM ON INDUSTRIAL ELECTRONICS (ISIE), 2019, : 2131 - 2136
  • [10] DQN Reinforcement Learning-based Steering Control Strategy for Autonomous Driving
    Lin, Xinyou
    Ye, Zhuoming
    Zhou, Binhao
    Jixie Gongcheng Xuebao/Journal of Mechanical Engineering, 2023, 59 (16): : 315 - 324