CT-DQN: Control-Tutored Deep Reinforcement Learning

被引：0

作者：

De Lellis, Francesco ^{[1
]}

Coraggio, Marco ^{[2
]}

Russo, Giovanni ^{[3
]}

Musolesi, Mirco ^{[4
,5
]}

di Bernardo, Mario ^{[1
,2
]}

机构：

[1] Univ Naples Federico II, Naples, Italy

[2] Scuola Super Meridionale, Naples, Italy

[3] Univ Salerno, Salerno, Italy

[4] UCL, London, England

[5] Univ Bologna, Bologna, Italy

来源：

LEARNING FOR DYNAMICS AND CONTROL CONFERENCE, VOL 211 | 2023年 / 211卷

关键词：

Reinforcement learning based control; deep reinforcement learning; feedback control;

D O I：

暂无

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

One of the major challenges in Deep Reinforcement Learning for control is the need for extensive training to learn a policy. Motivated by this, we present the design of the Control-Tutored Deep Q-Networks (CT-DQN) algorithm, a Deep Reinforcement Learning algorithm that leverages a control tutor, i.e., an exogenous control law, to reduce learning time. The tutor can be designed using an approximate model of the system, without any assumption about the knowledge of the system dynamics. There is no expectation that it will be able to achieve the control objective if used standalone. During learning, the tutor occasionally suggests an action, thus partially guiding exploration. We validate our approach on three scenarios from OpenAI Gym: the inverted pendulum, lunar lander, and car racing. We demonstrate that CT-DQN is able to achieve better or equivalent data efficiency with respect to the classic function approximation solutions.

引用

页数：13

共 50 条

[1] Control-Tutored Reinforcement Learning: Towards the Integration of Data-Driven and Model-Based Control
DeLellis, Francesco
Coraggio, Marco
Russo, Giovanni
Musolesi, Mirco
di Bernardo, Mario
LEARNING FOR DYNAMICS AND CONTROL CONFERENCE, VOL 168, 2022, 168
[2] Deep Reinforcement Learning with DQN vs. PPO in VizDoom
Zakharenkov, Anton
Makarov, Ilya
21ST IEEE INTERNATIONAL SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE AND INFORMATICS (CINTI), 2021, : 137 - 142
[3] Averaged-DQN: Variance Reduction and Stabilization for Deep Reinforcement Learning
Anschel, Oron
Baram, Nir
Shimkin, Nahum
INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 70, 2017, 70
[4] Deep Reinforcement Learning in VizDoom via DQN and Actor-Critic Agents
Bakhanova, Maria
Makarov, Ilya
ADVANCES IN COMPUTATIONAL INTELLIGENCE, IWANN 2021, PT I, 2021, 12861 : 138 - 150
[5] Multi-robot path planning based on a deep reinforcement learning DQN algorithm
Yang Yang
Li Juntao
Peng Lingling
CAAI TRANSACTIONS ON INTELLIGENCE TECHNOLOGY, 2020, 5 (03) : 177 - 183
[6] Tuning Apex DQN: A Reinforcement Learning based Deep Q-Network Algorithm
Ruhela, Dhani
Ruhela, Amit
PRACTICE AND EXPERIENCE IN ADVANCED RESEARCH COMPUTING 2024, PEARC 2024, 2024,
[7] Autonomous Maneuver Decision of UAV Based on Deep Reinforcement Learning: Comparison of DQN and DDPG
Wang, Yu
Ren, Tianjun
Fan, Zilin
2022 34TH CHINESE CONTROL AND DECISION CONFERENCE, CCDC, 2022, : 4857 - 4860
[8] Comparison of MPPT based on Deep Reinforcement Learning by DQN, DDPG and TD3
Panggabean, Jayandi
Sutisna, Nana
Syafalni, Infall
Adiono, Trio
2023 ASIA PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE, APSIPA ASC, 2023, : 261 - 266
[9] A Deep Reinforcement Learning Method for Mobile Robot Collision Avoidance based on Double DQN
Xue, Xidi
Li, Zhan
Zhang, Dongsheng
Yan, Yingxin
2019 IEEE 28TH INTERNATIONAL SYMPOSIUM ON INDUSTRIAL ELECTRONICS (ISIE), 2019, : 2131 - 2136
[10] DQN Reinforcement Learning-based Steering Control Strategy for Autonomous Driving
Lin, Xinyou
Ye, Zhuoming
Zhou, Binhao
Jixie Gongcheng Xuebao/Journal of Mechanical Engineering, 2023, 59 (16): : 315 - 324

← 1 2 3 4 5 →