Comparison of Deep Reinforcement Learning and Model Predictive Control for Adaptive Cruise Control

被引:103
|
作者
Lin, Yuan [1 ]
McPhee, John [1 ]
Azad, Nasser L. [1 ]
机构
[1] Univ Waterloo, Syst Design Engn Dept, Waterloo, ON N2L 3G1, Canada
来源
基金
加拿大自然科学与工程研究理事会;
关键词
Learning (artificial intelligence); Mathematical model; Cost function; Testing; Optimal control; Delays; Deep reinforcement learning; Model Predictive Control (MPC); Adaptive Cruise Control (ACC);
D O I
10.1109/TIV.2020.3012947
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This study compares Deep Reinforcement Learning (DRL) and Model Predictive Control (MPC) for Adaptive Cruise Control (ACC) design in car-following scenarios. A first-order system is used as the Control-Oriented Model (COM) to approximate the acceleration command dynamics of a vehicle. Based on the equations of the control system and the multi-objective cost function, we train a DRL policy using Deep Deterministic Policy Gradient (DDPG) and solve the MPC problem via Interior-Point Optimization (IPO). Simulation results for the episode costs show that, when there are no modeling errors and the testing inputs are within the training data range, the DRL solution is equivalent to MPC with a sufficiently long prediction horizon. Particularly, the DRL episode cost is only 5.8% higher than the benchmark optimal control solution provided by optimizing the entire episode via IPO. The DRL control performance degrades when the testing inputs are outside the training data range, indicating inadequate machine learning generalization. When there are modeling errors due to control delay, disturbances, and/or testing with a High-Fidelity Model (HFM) of the vehicle, the DRL-trained policy performs better when the modeling errors are large while having similar performances as MPC when the modeling errors are small.
引用
收藏
页码:221 / 231
页数:11
相关论文
共 50 条
  • [31] Multi-Agent Reinforcement Learning for Cooperative Adaptive Cruise Control
    Peake, Ashley
    McCalmon, Joe
    Raiford, Benjamin
    Liu, Tongtong
    Alqahtani, Sarra
    [J]. 2020 IEEE 32ND INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI), 2020, : 15 - 22
  • [32] Optimum trajectory learning in musculoskeletal systems with model predictive control and deep reinforcement learning
    Berat Denizdurduran
    Henry Markram
    Marc-Oliver Gewaltig
    [J]. Biological Cybernetics, 2022, 116 : 711 - 726
  • [33] Predictive Control of a Robot Manipulator with Deep Reinforcement Learning
    Bejar, Eduardo
    Moran, Antonio
    [J]. 2021 7TH INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND ROBOTICS (ICCAR), 2021, : 127 - 130
  • [34] Safe Deep Reinforcement Learning for Adaptive Cruise Control by Imposing State-Specific Safe Sets
    Brosowsky, Mathis
    Keck, Florian
    Ketterer, Jakob
    Isele, Simon
    Slieter, Daniel
    Zoellner, Marius
    [J]. 2021 32ND IEEE INTELLIGENT VEHICLES SYMPOSIUM (IV), 2021, : 488 - 495
  • [35] Adaptive parameterized model predictive control based on reinforcement learning: A synthesis framework
    Sun, Dingshan
    Jamshidnejad, Anahita
    De Schutter, Bart
    [J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 136
  • [36] Model Predictive Control Guided Reinforcement Learning Control Scheme
    Xie, Huimin
    Xu, Xinghai
    Li, Yuling
    Hong, Wenjing
    Shi, Jia
    [J]. 2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,
  • [37] Containerized Framework for Building Control Performance Comparisons: Model Predictive Control vs Deep Reinforcement Learning Control
    Fu, Yangyang
    Xu, Shichao
    Zhu, Qi
    O'Neill, Zheng
    [J]. BUILDSYS'21: PROCEEDINGS OF THE 2021 ACM INTERNATIONAL CONFERENCE ON SYSTEMS FOR ENERGY-EFFICIENT BUILT ENVIRONMENTS, 2021, : 276 - 280
  • [38] Space control law design in adaptive cruise control vehicles using model predictive control
    Marzbanrad, J.
    Karimian, N.
    [J]. PROCEEDINGS OF THE INSTITUTION OF MECHANICAL ENGINEERS PART D-JOURNAL OF AUTOMOBILE ENGINEERING, 2011, 225 (D7) : 870 - 884
  • [39] Model predictive control with constraints for a nonlinear adaptive cruise control vehicle model in transition manoeuvres
    Ali, Zeeshan
    Popov, Atanas A.
    Charles, Guy
    [J]. VEHICLE SYSTEM DYNAMICS, 2013, 51 (06) : 943 - 963
  • [40] Improved Model Predictive Control for Cooperative Adaptive Cruise Control Subject to Actuator Delay
    Liu, Fuchun
    Chen, Yifeng
    [J]. 2017 CHINESE AUTOMATION CONGRESS (CAC), 2017, : 4717 - 4722