A supervised Actor–Critic approach for adaptive cruise control

被引：3

作者：

Dongbin Zhao

Bin Wang

Derong Liu

机构：

[1] Chinese Academy of Sciences,The State Key Laboratory of Management and Control for Complex Systems, Institute of Automation

来源：

Soft Computing | 2013年 / 17卷

关键词：

Supervised reinforcement learning; Actor–Critic; Adaptive cruise control; Uniformly ultimate bounded; Neural networks;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

A novel supervised Actor–Critic (SAC) approach for adaptive cruise control (ACC) problem is proposed in this paper. The key elements required by the SAC algorithm namely Actor and Critic, are approximated by feed-forward neural networks respectively. The output of Actor and the state are input to Critic to approximate the performance index function. A Lyapunov stability analysis approach has been presented to prove the uniformly ultimate bounded property of the estimation errors of the neural networks. Moreover, we use the supervisory controller to pre-train Actor to achieve a basic control policy, which can improve the training convergence and success rate. We apply this method to learn an approximate optimal control policy for the ACC problem. Experimental results in several driving scenarios demonstrate that the SAC algorithm performs well, so it is feasible and effective for the ACC problem.

引用

页码：2089 / 2099

页数：10

共 50 条

[21] Cooperative Adaptive Cruise Control: A Reinforcement Learning Approach
Desjardins, Charles
Chaib-draa, Brahim
[J]. IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2011, 12 (04) : 1248 - 1260
[22] A Model Predictive Cooperative Adaptive Cruise Control Approach
Stanger, Thomas
del Re, Luigi
[J]. 2013 AMERICAN CONTROL CONFERENCE (ACC), 2013, : 1374 - 1379
[23] Adaptive actor-critic structure for parametrized controllers
Goehrt, Thomas
Osinenko, Pavel
Streif, Stefan
[J]. IFAC PAPERSONLINE, 2019, 52 (16): : 652 - 657
[24] Adaptive Advantage Estimation for Actor-Critic Algorithms
Chen, Yurou
Zhang, Fengyi
Liu, Zhiyong
[J]. 2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
[25] SMONAC: Supervised Multiobjective Negative Actor-Critic for Sequential Recommendation
Zhou, Fei
Luo, Biao
Wu, Zhengke
Huang, Tingwen
[J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, : 1 - 13
[26] SOAC: Supervised Off-Policy Actor -Critic for Recommender Systems
Wu, Shiqing
Xu, Guandong
Wang, Xianzhi
[J]. 23RD IEEE INTERNATIONAL CONFERENCE ON DATA MINING, ICDM 2023, 2023, : 14121 - 14626
[27] Actor Critic Learning: A Near Set Approach
Anwar, Shamama
Patnaik, K. Sridhar
[J]. ROUGH SETS AND CURRENT TRENDS IN COMPUTING, PROCEEDINGS, 2008, 5306 : 252 - 261
[28] Adaptive fault-tolerant control for spacecraft: A dynamic Stackelberg game approach with advantage actor-critic reinforcement learning
Meng, Yizhen
Liu, Chun
Liu, Yangyang
Tan, Longyu
[J]. AEROSPACE SCIENCE AND TECHNOLOGY, 2024, 154
[29] Actor Critic Agents for Wind Farm Control
Monroc, Claire Bizon
Busic, Ana
Dubuc, Donatien
Zhu, Jiamin
[J]. 2023 AMERICAN CONTROL CONFERENCE, ACC, 2023, : 177 - 183
[30] Neural Network Predictive Control Approach Design for Adaptive Cruise Control
Mahadika, Pratama
Subiantoro, Aries
Kusumoputro, Benyamin
[J]. INTERNATIONAL JOURNAL OF TECHNOLOGY, 2020, 11 (07): : 1451 - 1462

← 1 2 3 4 5 →