A Deep Deterministic Policy Gradient Approach for Vehicle Speed Tracking Control With a Robotic Driver

被引:18
|
作者
Hao, Gaofeng [1 ]
Fu, Zhuang [1 ]
Feng, Xin [1 ]
Gong, Zening [1 ]
Chen, Peng [2 ]
Wang, Dan [2 ]
Wang, Weibin [2 ]
Si, Yang [2 ]
机构
[1] Shanghai Jiao Tong Univ, State Key Lab Mech Syst & Vibrat, Shanghai 200240, Peoples R China
[2] Pan Asia Tech Automot Ctr PATAC, Shanghai 201201, Peoples R China
基金
中国国家自然科学基金;
关键词
Robots; Testing; Dynamometers; Training; Automobiles; Aerospace electronics; Resistance; Deep deterministic policy gradient (DDPG); network exploration; reinforcement learning (RL); replay buffer; robotic driver; vehicle speed tracking control;
D O I
10.1109/TASE.2021.3088004
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In performance tests, replacing humans with robotic drivers has many advantages, such as high efficiency and high security. To realize the vehicle speed tracking control with a robotic driver, this article proposes a novel deep reinforcement learning (DRL) approach based on deep deterministic policy gradient (DDPG). Specifically, the design of the approach includes state space, action space, reward function, and control algorithm. Then, to shorten the training time, the proposed approach utilizes the basic fundamental relationship between vehicle speed and pedal opening to intervene in network exploration. Furthermore, to solve speed fluctuations in low-speed sections, the replay buffer is optimized by adding weighted training samples. Experiments are conducted on fifteen cars, and results show that the proposed algorithm can effectively control the vehicle speed. Generally, it only needs three or four episodes of training to meet the requirements. Compared with the Segment-PID method, the proposed method has a smoother speed and fewer overbound times.
引用
下载
收藏
页码:2514 / 2525
页数:12
相关论文
共 50 条
  • [41] Optimal Transportation Network Company Vehicle Dispatching via Deep Deterministic Policy Gradient
    Shi, Dian
    Li, Xuanheng
    Li, Ming
    Wang, Jie
    Li, Pan
    Pan, Miao
    [J]. WIRELESS ALGORITHMS, SYSTEMS, AND APPLICATIONS, WASA 2019, 2019, 11604 : 297 - 309
  • [42] Composite deep learning control for autonomous bicycles by using deep deterministic policy gradient
    He, Kanghui
    Dong, Chaoyang
    Yan, An
    Zheng, Qingyuan
    Liang, Bin
    Wang, Qing
    [J]. IECON 2020: THE 46TH ANNUAL CONFERENCE OF THE IEEE INDUSTRIAL ELECTRONICS SOCIETY, 2020, : 2766 - 2773
  • [43] A State-Compensated Deep Deterministic Policy Gradient Algorithm for UAV Trajectory Tracking
    Wu, Jiying
    Yang, Zhong
    Liao, Luwei
    He, Naifeng
    Wang, Zhiyong
    Wang, Can
    [J]. MACHINES, 2022, 10 (07)
  • [44] Multi-Agent Distributed Deep Deterministic Policy Gradient for Partially Observable Tracking
    Fan, Dongyu
    Shen, Haikuo
    Dong, Lijing
    [J]. ACTUATORS, 2021, 10 (10)
  • [45] Mutual Deep Deterministic Policy Gradient Learning
    Sun, Zhou
    [J]. 2022 INTERNATIONAL CONFERENCE ON BIG DATA, INFORMATION AND COMPUTER NETWORK (BDICN 2022), 2022, : 508 - 513
  • [46] Deep Deterministic Policy Gradient for Portfolio Management
    Khemlichi, Firdaous
    Chougrad, Hiba
    Khamlichi, Youness Idrissi
    El Boushaki, Abdessamad
    Ben Ali, Safae Elhaj
    [J]. 2020 6TH IEEE CONGRESS ON INFORMATION SCIENCE AND TECHNOLOGY (IEEE CIST'20), 2020, : 424 - 429
  • [47] Flatness Based Velocity Tracking Control of a Vehicle on a Roller Dynamometer Using a Robotic Driver
    Sailer, Stefan
    Buchholz, Michael
    Dietmayer, Klaus
    [J]. 2011 50TH IEEE CONFERENCE ON DECISION AND CONTROL AND EUROPEAN CONTROL CONFERENCE (CDC-ECC), 2011, : 7962 - 7967
  • [48] Deep Deterministic Policy Gradient Virtual Coupling control for thecoordination and manoeuvring of heterogeneous uncertain nonlinearHigh-Speed Trains
    Basile, Giacomo
    Lu, Dario Giuseppe
    Petrillo, Alberto
    Santini, Stefania
    [J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 133
  • [49] Twin Delayed Deep Deterministic Policy Gradient-Based Target Tracking for Unmanned Aerial Vehicle With Achievement Rewarding and Multistage Training
    Mosali, Najmaddin Abo
    Shamsudin, Syariful Syafiq
    Alfandi, Omar
    Omar, Rosli
    Al-Fadhali, Najib
    [J]. IEEE ACCESS, 2022, 10 : 23545 - 23559
  • [50] Grasping Control of a Vision Robot Based on a Deep Attentive Deterministic Policy Gradient
    Ji, Xiangxin
    Xiong, Feng
    Kong, Weichang
    Wei, Dongfei
    Shen, Zeyan
    [J]. IEEE Access, 2022, 10 : 867 - 878