A Deep Deterministic Policy Gradient Approach for Vehicle Speed Tracking Control With a Robotic Driver

被引：18

作者：

Hao, Gaofeng ^{[1
]}

Fu, Zhuang ^{[1
]}

Feng, Xin ^{[1
]}

Gong, Zening ^{[1
]}

Chen, Peng ^{[2
]}

Wang, Dan ^{[2
]}

Wang, Weibin ^{[2
]}

Si, Yang ^{[2
]}

机构：

[1] Shanghai Jiao Tong Univ, State Key Lab Mech Syst & Vibrat, Shanghai 200240, Peoples R China

[2] Pan Asia Tech Automot Ctr PATAC, Shanghai 201201, Peoples R China

来源：

IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING | 2022年 / 19卷 / 03期

基金：

中国国家自然科学基金;

关键词：

Robots; Testing; Dynamometers; Training; Automobiles; Aerospace electronics; Resistance; Deep deterministic policy gradient (DDPG); network exploration; reinforcement learning (RL); replay buffer; robotic driver; vehicle speed tracking control;

D O I：

10.1109/TASE.2021.3088004

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

In performance tests, replacing humans with robotic drivers has many advantages, such as high efficiency and high security. To realize the vehicle speed tracking control with a robotic driver, this article proposes a novel deep reinforcement learning (DRL) approach based on deep deterministic policy gradient (DDPG). Specifically, the design of the approach includes state space, action space, reward function, and control algorithm. Then, to shorten the training time, the proposed approach utilizes the basic fundamental relationship between vehicle speed and pedal opening to intervene in network exploration. Furthermore, to solve speed fluctuations in low-speed sections, the replay buffer is optimized by adding weighted training samples. Experiments are conducted on fifteen cars, and results show that the proposed algorithm can effectively control the vehicle speed. Generally, it only needs three or four episodes of training to meet the requirements. Compared with the Segment-PID method, the proposed method has a smoother speed and fewer overbound times.

引用

下载

页码：2514 / 2525

页数：12

共 50 条

[41] Optimal Transportation Network Company Vehicle Dispatching via Deep Deterministic Policy Gradient
Shi, Dian
Li, Xuanheng
Li, Ming
Wang, Jie
Li, Pan
Pan, Miao
[J]. WIRELESS ALGORITHMS, SYSTEMS, AND APPLICATIONS, WASA 2019, 2019, 11604 : 297 - 309
[42] Composite deep learning control for autonomous bicycles by using deep deterministic policy gradient
He, Kanghui
Dong, Chaoyang
Yan, An
Zheng, Qingyuan
Liang, Bin
Wang, Qing
[J]. IECON 2020: THE 46TH ANNUAL CONFERENCE OF THE IEEE INDUSTRIAL ELECTRONICS SOCIETY, 2020, : 2766 - 2773
[43] A State-Compensated Deep Deterministic Policy Gradient Algorithm for UAV Trajectory Tracking
Wu, Jiying
Yang, Zhong
Liao, Luwei
He, Naifeng
Wang, Zhiyong
Wang, Can
[J]. MACHINES, 2022, 10 (07)
[44] Multi-Agent Distributed Deep Deterministic Policy Gradient for Partially Observable Tracking
Fan, Dongyu
Shen, Haikuo
Dong, Lijing
[J]. ACTUATORS, 2021, 10 (10)
[45] Mutual Deep Deterministic Policy Gradient Learning
Sun, Zhou
[J]. 2022 INTERNATIONAL CONFERENCE ON BIG DATA, INFORMATION AND COMPUTER NETWORK (BDICN 2022), 2022, : 508 - 513
[46] Deep Deterministic Policy Gradient for Portfolio Management
Khemlichi, Firdaous
Chougrad, Hiba
Khamlichi, Youness Idrissi
El Boushaki, Abdessamad
Ben Ali, Safae Elhaj
[J]. 2020 6TH IEEE CONGRESS ON INFORMATION SCIENCE AND TECHNOLOGY (IEEE CIST'20), 2020, : 424 - 429
[47] Flatness Based Velocity Tracking Control of a Vehicle on a Roller Dynamometer Using a Robotic Driver
Sailer, Stefan
Buchholz, Michael
Dietmayer, Klaus
[J]. 2011 50TH IEEE CONFERENCE ON DECISION AND CONTROL AND EUROPEAN CONTROL CONFERENCE (CDC-ECC), 2011, : 7962 - 7967
[48] Deep Deterministic Policy Gradient Virtual Coupling control for thecoordination and manoeuvring of heterogeneous uncertain nonlinearHigh-Speed Trains
Basile, Giacomo
Lu, Dario Giuseppe
Petrillo, Alberto
Santini, Stefania
[J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 133
[49] Twin Delayed Deep Deterministic Policy Gradient-Based Target Tracking for Unmanned Aerial Vehicle With Achievement Rewarding and Multistage Training
Mosali, Najmaddin Abo
Shamsudin, Syariful Syafiq
Alfandi, Omar
Omar, Rosli
Al-Fadhali, Najib
[J]. IEEE ACCESS, 2022, 10 : 23545 - 23559
[50] Grasping Control of a Vision Robot Based on a Deep Attentive Deterministic Policy Gradient
Ji, Xiangxin
Xiong, Feng
Kong, Weichang
Wei, Dongfei
Shen, Zeyan
[J]. IEEE Access, 2022, 10 : 867 - 878

← 1 2 3 4 5 →