Policy Iteration Based Approximate Dynamic Programming Toward Autonomous Driving in Constrained Dynamic Environment

被引:16
|
作者
Lin, Ziyu [1 ]
Ma, Jun [2 ,3 ]
Duan, Jingliang [4 ]
Li, Shengbo Eben [1 ]
Ma, Haitong [1 ]
Cheng, Bo [1 ]
Lee, Tong Heng [5 ]
机构
[1] Tsinghua Univ, Sch Vehicle & Mobil, Beijing 100084, Peoples R China
[2] Hong Kong Univ Sci & Technol Guangzhou, Robot & Autonomous Syst Thrust, Guangzhou, Peoples R China
[3] Hong Kong Univ Sci & Technol, Dept Elect & Comp Engn, Hong Kong, Peoples R China
[4] Univ Sci & Technol Beijing, Sch Mech Engn, Beijing 100083, Peoples R China
[5] Natl Univ Singapore, Dept Elect & Comp Engn, Singapore 117583, Singapore
基金
国家重点研发计划;
关键词
Planning; Autonomous vehicles; Vehicle dynamics; Task analysis; Heuristic algorithms; Approximation algorithms; Roads; Autonomous driving; approximate dynamic programming; motion planning; constrained optimization; reinforcement learning; VEHICLE;
D O I
10.1109/TITS.2023.3237568
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
In the area of autonomous driving, it typically brings great difficulty in solving the motion planning problem since the vehicle model is nonlinear and the driving scenarios are complex. Particularly, most of the existing methods cannot be generalized to dynamically changing scenarios with varying surrounding vehicles. To address this problem, this development here investigates the framework of integrated decision and control. As part of the modules, static path planning determines the reference candidates ahead, and then the optimal path-tracking controller realizes the specific autonomous driving task. An innovative and effective constrained finite-horizon approximate dynamic programming (ADP) algorithm is herein presented to generate the desired control policy for effective path tracking. With the generalized policy neural network that maps from the state to the control input, the proposed algorithm preserves the high effectiveness for the motion planning problem towards changing driving environments with varying surrounding vehicles. Moreover, the algorithm attains the noteworthy advantage of alleviating the typically heavy computational loads with the mode of offline training and online execution. As a result of the utilization of multi-layer neural networks in conjunction with the actor-critic framework, the constrained ADP method is capable of handling complex and multidimensional scenarios. Finally, various simulations have been carried out to show that the constrained ADP algorithm is effective.
引用
收藏
页码:5003 / 5013
页数:11
相关论文
共 50 条
  • [41] Dynamic representations for autonomous driving
    Olier, Juan Sebastian
    Marin-Plaza, Pablo
    Martin, David
    Marcenaro, Lucio
    Barakova, Emilia
    Rauterberg, Matthias
    Regazzoni, Carlo
    2017 14TH IEEE INTERNATIONAL CONFERENCE ON ADVANCED VIDEO AND SIGNAL BASED SURVEILLANCE (AVSS), 2017,
  • [42] Dynamic Energy Management for Hybrid Electric Vehicle Based on Approximate Dynamic Programming
    Li, Weimin
    Xu, Guoqing
    Wang, Zhancheng
    Xu, Yangsheng
    2008 7TH WORLD CONGRESS ON INTELLIGENT CONTROL AND AUTOMATION, VOLS 1-23, 2008, : 7864 - +
  • [43] Region-based approximation in approximate dynamic programming
    Sardarmehni, Tohid
    Song, Xingyong
    INTERNATIONAL JOURNAL OF CONTROL, 2024, 97 (02) : 306 - 315
  • [44] Microgrid Energy Management based on Approximate Dynamic Programming
    Strelec, Martin
    Berka, Jan
    2013 4TH IEEE/PES INNOVATIVE SMART GRID TECHNOLOGIES EUROPE (ISGT EUROPE), 2013,
  • [45] Approximate dynamic programming for constrained linear systems: A piecewise quadratic approximation approach☆
    He, Kanghui
    Shi, Shengling
    van den Boom, Ton
    De Schutter, Bart
    AUTOMATICA, 2024, 160
  • [46] Differential neural network robust constrained controller using approximate dynamic programming
    Noriega-Marquez, Sebastian
    Poznyak, Alexander
    Hernandez-Sanchez, Alejandra
    Chairez, Isaac
    EUROPEAN JOURNAL OF CONTROL, 2024, 78
  • [47] Constrained Unscented Dynamic Programming
    Plancher, Brian
    Manchester, Zachary
    Kuindersma, Scott
    2017 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2017, : 5674 - 5680
  • [48] State Aggregation based Linear Programming approach to Approximate Dynamic Programming
    Darbha, S.
    Krishnamoorthy, K.
    Pachter, M.
    Chandler, P.
    49TH IEEE CONFERENCE ON DECISION AND CONTROL (CDC), 2010, : 935 - 941
  • [49] Constrained discounted dynamic programming
    Feinberg, EA
    Shwartz, A
    MATHEMATICS OF OPERATIONS RESEARCH, 1996, 21 (04) : 922 - 945
  • [50] Path Planning in an Uncertain Environment Using Approximate Dynamic Programming Methods
    Bienkowski, Adam
    Sidoti, David
    Zhang, Lingyi
    Pattipati, Krishna R.
    Sampson, Charles R.
    Hansen, James
    2018 21ST INTERNATIONAL CONFERENCE ON INFORMATION FUSION (FUSION), 2018, : 2542 - 2547