A Deep Deterministic Policy Gradient Approach for Vehicle Speed Tracking Control With a Robotic Driver

被引：18

作者：

Hao, Gaofeng ^{[1
]}

Fu, Zhuang ^{[1
]}

Feng, Xin ^{[1
]}

Gong, Zening ^{[1
]}

Chen, Peng ^{[2
]}

Wang, Dan ^{[2
]}

Wang, Weibin ^{[2
]}

Si, Yang ^{[2
]}

机构：

[1] Shanghai Jiao Tong Univ, State Key Lab Mech Syst & Vibrat, Shanghai 200240, Peoples R China

[2] Pan Asia Tech Automot Ctr PATAC, Shanghai 201201, Peoples R China

来源：

IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING | 2022年 / 19卷 / 03期

基金：

中国国家自然科学基金;

关键词：

Robots; Testing; Dynamometers; Training; Automobiles; Aerospace electronics; Resistance; Deep deterministic policy gradient (DDPG); network exploration; reinforcement learning (RL); replay buffer; robotic driver; vehicle speed tracking control;

D O I：

10.1109/TASE.2021.3088004

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

In performance tests, replacing humans with robotic drivers has many advantages, such as high efficiency and high security. To realize the vehicle speed tracking control with a robotic driver, this article proposes a novel deep reinforcement learning (DRL) approach based on deep deterministic policy gradient (DDPG). Specifically, the design of the approach includes state space, action space, reward function, and control algorithm. Then, to shorten the training time, the proposed approach utilizes the basic fundamental relationship between vehicle speed and pedal opening to intervene in network exploration. Furthermore, to solve speed fluctuations in low-speed sections, the replay buffer is optimized by adding weighted training samples. Experiments are conducted on fifteen cars, and results show that the proposed algorithm can effectively control the vehicle speed. Generally, it only needs three or four episodes of training to meet the requirements. Compared with the Segment-PID method, the proposed method has a smoother speed and fewer overbound times.

引用

页码：2514 / 2525

页数：12

共 50 条

[1] Multi-Task Vehicle Platoon Control: A Deep Deterministic Policy Gradient Approach
Berahman, Mehran
Rostami-Shahrbabaki, Majid
Bogenberger, Klaus
[J]. FUTURE TRANSPORTATION, 2022, 2 (04): : 1028 - 1046
[2] An enhanced deep deterministic policy gradient algorithm for intelligent control of robotic arms
Dong, Ruyi
Du, Junjie
Liu, Yanan
Heidari, Ali Asghar
Chen, Huiling
[J]. FRONTIERS IN NEUROINFORMATICS, 2023, 17
[3] Unmanned Surface Vehicle Course Tracking Control Based on Neural Network and Deep Deterministic Policy Gradient Algorithm
Wang, Yan
Tong, Jie
Song, Tian-Yu
Wan, Zhan-Hong
[J]. 2018 OCEANS - MTS/IEEE KOBE TECHNO-OCEANS (OTO), 2018,
[4] Multi-vehicle Flocking Control with Deep Deterministic Policy Gradient Method
Xu, Zhao
Lyu, Yang
Pan, Quan
Hu, Jinwen
Zhao, Chunhui
Liu, Shuai
[J]. 2018 IEEE 14TH INTERNATIONAL CONFERENCE ON CONTROL AND AUTOMATION (ICCA), 2018, : 306 - 311
[5] Reward adaptive wind power tracking control based on deep deterministic policy gradient
Chen, Peng
Han, Dezhi
[J]. APPLIED ENERGY, 2023, 348
[6] AUTONOMOUS VEHICLE DRIVING VIA DEEP DETERMINISTIC POLICY GRADIENT
Huang, Wenhui
Braghin, Francesco
Arrigoni, Stefano
[J]. PROCEEDINGS OF THE ASME INTERNATIONAL DESIGN ENGINEERING TECHNICAL CONFERENCES AND COMPUTERS AND INFORMATION IN ENGINEERING CONFERENCE, 2019, VOL 3, 2020,
[7] Target tracking strategy using deep deterministic policy gradient
You, Shixun
Diao, Ming
Gao, Lipeng
Zhang, Fulong
Wang, Huan
[J]. APPLIED SOFT COMPUTING, 2020, 95
[8] Robotic-Arm-Based Force Control by Deep Deterministic Policy Gradient in Neurosurgical Practice
Inziarte-Hidalgo, Ibai
Gorospe, Erik
Zulueta, Ekaitz
Lopez-Guede, Jose Manuel
Fernandez-Gamiz, Unai
Etxebarria, Saioa
[J]. MATHEMATICS, 2023, 11 (19)
[9] Deep Recurrent Deterministic Policy Gradient for Physical Control
Zhang, Lei
Han, Shuai
Zhang, Zhiruo
Li, Lefan
Lu, Shuai
[J]. ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, ICANN 2020, PT II, 2020, 12397 : 257 - 268
[10] Deep deterministic policy gradient algorithm for UAV control
Huang X.
Liu J.
Jia C.
Wang Z.
Zhang J.
[J]. Hangkong Xuebao/Acta Aeronautica et Astronautica Sinica, 2021, 42 (11):

← 1 2 3 4 5 →