A supervised Actor-Critic approach for adaptive cruise control

被引:58
|
作者
Zhao, Dongbin [1 ]
Wang, Bin [1 ]
Liu, Derong [1 ]
机构
[1] Chinese Acad Sci, Inst Automat, State Key Lab Management & Control Complex Syst, Beijing 100190, Peoples R China
基金
北京市自然科学基金; 中国国家自然科学基金;
关键词
Supervised reinforcement learning; Actor-Critic; Adaptive cruise control; Uniformly ultimate bounded; Neural networks; FEEDBACK-CONTROL; SYSTEMS; ACC;
D O I
10.1007/s00500-013-1110-y
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A novel supervised Actor-Critic (SAC) approach for adaptive cruise control (ACC) problem is proposed in this paper. The key elements required by the SAC algorithm namely Actor and Critic, are approximated by feed-forward neural networks respectively. The output of Actor and the state are input to Critic to approximate the performance index function. A Lyapunov stability analysis approach has been presented to prove the uniformly ultimate bounded property of the estimation errors of the neural networks. Moreover, we use the supervisory controller to pre-train Actor to achieve a basic control policy, which can improve the training convergence and success rate. We apply this method to learn an approximate optimal control policy for the ACC problem. Experimental results in several driving scenarios demonstrate that the SAC algorithm performs well, so it is feasible and effective for the ACC problem.
引用
收藏
页码:2089 / 2099
页数:11
相关论文
共 50 条
  • [1] A supervised Actor–Critic approach for adaptive cruise control
    Dongbin Zhao
    Bin Wang
    Derong Liu
    [J]. Soft Computing, 2013, 17 : 2089 - 2099
  • [2] Design and Implementation of an Adaptive Cruise Control System Based on Supervised Actor-Critic Learning
    Wang, Bin
    Zhao, Dongbin
    Li, Chengdong
    Dai, Yujie
    [J]. 2015 5TH INTERNATIONAL CONFERENCE ON INFORMATION SCIENCE AND TECHNOLOGY (ICIST), 2015, : 243 - 248
  • [3] Supervised Advantage Actor-Critic for Recommender Systems
    Xin, Xin
    Karatzoglou, Alexandros
    Arapakis, Ioannis
    Jose, Joemon M.
    [J]. WSDM'22: PROCEEDINGS OF THE FIFTEENTH ACM INTERNATIONAL CONFERENCE ON WEB SEARCH AND DATA MINING, 2022, : 1186 - 1196
  • [4] ADAPTIVE ACTOR-CRITIC BILATERAL FILTER
    Chen, Bo-Hao
    Cheng, Hsiang-Yin
    Yin, Jia-Li
    [J]. 2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 1675 - 1679
  • [5] Adaptive actor-critic control of robots with integral invariant manifold
    Pantoja-Garcia, Luis
    Garcia-Rodriguez, Rodolfo
    Parra-Vega, Vicente
    [J]. 2021 IEEE CHILEAN CONFERENCE ON ELECTRICAL, ELECTRONICS ENGINEERING, INFORMATION AND COMMUNICATION TECHNOLOGIES (IEEE CHILECON 2021), 2021, : 782 - 787
  • [6] Adaptive actor-critic structure for parametrized controllers
    Goehrt, Thomas
    Osinenko, Pavel
    Streif, Stefan
    [J]. IFAC PAPERSONLINE, 2019, 52 (16): : 652 - 657
  • [7] Adaptive Advantage Estimation for Actor-Critic Algorithms
    Chen, Yurou
    Zhang, Fengyi
    Liu, Zhiyong
    [J]. 2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
  • [8] Actor-Critic based Adaptive Control Strategy for Effective Energy Management
    Sankaranarayanan, Chandramouli
    Shaju, Sreenath
    Sukhwani, Mohak
    [J]. PROCEEDINGS OF THE 2022 5TH INTERNATIONAL CONFERENCE ON ADVANCED SYSTEMS AND EMERGENT TECHNOLOGIES IC_ASET'2022), 2022, : 23 - 28
  • [9] SMONAC: Supervised Multiobjective Negative Actor-Critic for Sequential Recommendation
    Zhou, Fei
    Luo, Biao
    Wu, Zhengke
    Huang, Tingwen
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, : 1 - 13
  • [10] An Actor-Critic approach for control of Residential Photovoltaic-Battery Systems
    Joshi, Amit
    Tipaldi, Massimo
    Glielmo, Luigi
    [J]. IFAC PAPERSONLINE, 2021, 54 (07): : 222 - 227