A Deep Deterministic Policy Gradient Approach for Vehicle Speed Tracking Control With a Robotic Driver

被引:18
|
作者
Hao, Gaofeng [1 ]
Fu, Zhuang [1 ]
Feng, Xin [1 ]
Gong, Zening [1 ]
Chen, Peng [2 ]
Wang, Dan [2 ]
Wang, Weibin [2 ]
Si, Yang [2 ]
机构
[1] Shanghai Jiao Tong Univ, State Key Lab Mech Syst & Vibrat, Shanghai 200240, Peoples R China
[2] Pan Asia Tech Automot Ctr PATAC, Shanghai 201201, Peoples R China
基金
中国国家自然科学基金;
关键词
Robots; Testing; Dynamometers; Training; Automobiles; Aerospace electronics; Resistance; Deep deterministic policy gradient (DDPG); network exploration; reinforcement learning (RL); replay buffer; robotic driver; vehicle speed tracking control;
D O I
10.1109/TASE.2021.3088004
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In performance tests, replacing humans with robotic drivers has many advantages, such as high efficiency and high security. To realize the vehicle speed tracking control with a robotic driver, this article proposes a novel deep reinforcement learning (DRL) approach based on deep deterministic policy gradient (DDPG). Specifically, the design of the approach includes state space, action space, reward function, and control algorithm. Then, to shorten the training time, the proposed approach utilizes the basic fundamental relationship between vehicle speed and pedal opening to intervene in network exploration. Furthermore, to solve speed fluctuations in low-speed sections, the replay buffer is optimized by adding weighted training samples. Experiments are conducted on fifteen cars, and results show that the proposed algorithm can effectively control the vehicle speed. Generally, it only needs three or four episodes of training to meet the requirements. Compared with the Segment-PID method, the proposed method has a smoother speed and fewer overbound times.
引用
收藏
页码:2514 / 2525
页数:12
相关论文
共 50 条
  • [1] Multi-Task Vehicle Platoon Control: A Deep Deterministic Policy Gradient Approach
    Berahman, Mehran
    Rostami-Shahrbabaki, Majid
    Bogenberger, Klaus
    [J]. FUTURE TRANSPORTATION, 2022, 2 (04): : 1028 - 1046
  • [2] An enhanced deep deterministic policy gradient algorithm for intelligent control of robotic arms
    Dong, Ruyi
    Du, Junjie
    Liu, Yanan
    Heidari, Ali Asghar
    Chen, Huiling
    [J]. FRONTIERS IN NEUROINFORMATICS, 2023, 17
  • [3] Unmanned Surface Vehicle Course Tracking Control Based on Neural Network and Deep Deterministic Policy Gradient Algorithm
    Wang, Yan
    Tong, Jie
    Song, Tian-Yu
    Wan, Zhan-Hong
    [J]. 2018 OCEANS - MTS/IEEE KOBE TECHNO-OCEANS (OTO), 2018,
  • [4] Multi-vehicle Flocking Control with Deep Deterministic Policy Gradient Method
    Xu, Zhao
    Lyu, Yang
    Pan, Quan
    Hu, Jinwen
    Zhao, Chunhui
    Liu, Shuai
    [J]. 2018 IEEE 14TH INTERNATIONAL CONFERENCE ON CONTROL AND AUTOMATION (ICCA), 2018, : 306 - 311
  • [5] Reward adaptive wind power tracking control based on deep deterministic policy gradient
    Chen, Peng
    Han, Dezhi
    [J]. APPLIED ENERGY, 2023, 348
  • [6] AUTONOMOUS VEHICLE DRIVING VIA DEEP DETERMINISTIC POLICY GRADIENT
    Huang, Wenhui
    Braghin, Francesco
    Arrigoni, Stefano
    [J]. PROCEEDINGS OF THE ASME INTERNATIONAL DESIGN ENGINEERING TECHNICAL CONFERENCES AND COMPUTERS AND INFORMATION IN ENGINEERING CONFERENCE, 2019, VOL 3, 2020,
  • [7] Target tracking strategy using deep deterministic policy gradient
    You, Shixun
    Diao, Ming
    Gao, Lipeng
    Zhang, Fulong
    Wang, Huan
    [J]. APPLIED SOFT COMPUTING, 2020, 95
  • [8] Robotic-Arm-Based Force Control by Deep Deterministic Policy Gradient in Neurosurgical Practice
    Inziarte-Hidalgo, Ibai
    Gorospe, Erik
    Zulueta, Ekaitz
    Lopez-Guede, Jose Manuel
    Fernandez-Gamiz, Unai
    Etxebarria, Saioa
    [J]. MATHEMATICS, 2023, 11 (19)
  • [9] Deep Recurrent Deterministic Policy Gradient for Physical Control
    Zhang, Lei
    Han, Shuai
    Zhang, Zhiruo
    Li, Lefan
    Lu, Shuai
    [J]. ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, ICANN 2020, PT II, 2020, 12397 : 257 - 268
  • [10] Deep deterministic policy gradient algorithm for UAV control
    Huang X.
    Liu J.
    Jia C.
    Wang Z.
    Zhang J.
    [J]. Hangkong Xuebao/Acta Aeronautica et Astronautica Sinica, 2021, 42 (11):