Neuro-dynamic programming for optimal control of macroscopic fundamental diagram systems

被引:37
|
作者
Su, Z. C. [1 ,4 ]
Chow, Andy H. F. [1 ]
Zheng, N. [2 ]
Huang, Y. P. [3 ,4 ]
Liang, E. M. [4 ]
Zhong, R. X. [4 ]
机构
[1] City Univ Hong Kong, Dept Architecture & Civil Engn, Hong Kong, Peoples R China
[2] Monash Univ, Dept Civil Engn, Melbourne, Vic, Australia
[3] Hong Kong Polytech Univ, Dept Civil & Environm Engn, Hong Kong, Peoples R China
[4] Sun Yat Sen Univ, Sch Intelligent Syst Engn, Guangzhou, Peoples R China
基金
国家重点研发计划;
关键词
Macroscopic fundamental diagram; Hamilton-Jacobi-Bellman equation; Neuro-dynamic programming; Policy iteration; Saturated state and input; URBAN ROAD NETWORKS; PERIMETER CONTROL; TRAFFIC CONTROL; NONLINEAR-SYSTEMS; GATING CONTROL; CONGESTION; MODEL; APPROXIMATION; DERIVATIVES; STABILITY;
D O I
10.1016/j.trc.2020.102628
中图分类号
U [交通运输];
学科分类号
08 ; 0823 ;
摘要
The macroscopic fundamental diagram (MFD) can effectively reduce the spatial dimension involved in dynamic optimization of traffic performance for large-scale networks. Solving the Hamilton-Jacobi-Bellman (HJB) equation takes center stage in yielding solutions to the optimal control problem. At the core of solving the HJB equation is the value function that represents choosing a sequence of actions to optimize the system performance. However, this problem generally becomes intractable for possible discontinuities in the solution and the curse of dimensionality for systems with all but modest dimension. To address these challenges, a neural network is used to approximate the value function to obtain the optimal controls through policy iteration. Furthermore, a saturated operator is embedded in the neural network approximator to handle the difficulty caused by the control and state constraints. This policy iteration can be implemented as an iterative data-driven technique that integrates with the model-based optimal design based on real-time observations. Numerical experiments are conducted to show that the neuro-dynamic programming approach can achieve optimization goals while stabilizing the system by regulating the traffic state to the desired uncongested equilibrium.
引用
收藏
页数:23
相关论文
共 50 条
  • [1] A neuro-dynamic programming approach for perimeter control of two urban regions with macroscopic fundamental diagrams
    Su, Z. C.
    Chow, Andy H. F.
    Zheng, N.
    Huang, Y. P.
    Zhong, R. X.
    [J]. 2019 IEEE INTELLIGENT TRANSPORTATION SYSTEMS CONFERENCE (ITSC), 2019, : 2944 - 2949
  • [2] Neuro-dynamic programming
    Volgenant, T
    [J]. INTERFACES, 1997, 27 (06) : 143 - 143
  • [3] Neuro-dynamic programming approach to optimal control of spreading of dengue viruses
    Putri, Anindiati Praminto
    Hartono
    [J]. 2018 INTERNATIONAL CONFERENCE ON COMPUTER, CONTROL, INFORMATICS AND ITS APPLICATIONS (IC3INA), 2018, : 169 - 174
  • [4] Optimal etch time control design using neuro-dynamic programming
    Yang, L
    Si, J
    [J]. 2001 IEEE INTERNATIONAL SYMPOSIUM ON SEMICONDUCTOR MANUFACTURING, CONFERENCE PROCEEDINGS, 2001, : 75 - 78
  • [5] Neuro-dynamic programming for cooperative inventory control
    Bauso, D
    Giarré, L
    Pesenti, R
    [J]. PROCEEDINGS OF THE 2004 AMERICAN CONTROL CONFERENCE, VOLS 1-6, 2004, : 5527 - 5532
  • [6] Constrained Online Optimal Control for Continuous-Time Nonlinear Systems Using Neuro-Dynamic Programming
    Yang Xiong
    Liu Derong
    Wang Ding
    Ma Hongwen
    [J]. 2014 33RD CHINESE CONTROL CONFERENCE (CCC), 2014, : 8717 - 8722
  • [7] Neuro-Dynamic Programming for adaptive fusion complexity control
    Ross, KN
    Chaney, RD
    [J]. SENSOR FUSION: ARCHITECTURES, ALGORITHMS, AND APPLICATIONS III, 1999, 3719 : 398 - 409
  • [8] Neuro-Dynamic Programming in Control of the Ball and Beam System
    Burghardt, Andrzej
    Szuster, Marcin
    [J]. MECHATRONIC SYSTEMS, MECHANICS AND MATERIALS II, 2014, 210 : 206 - 214
  • [9] Control of a logistic node via Neuro-Dynamic Programming
    Boccadoro, Mauro
    Martinelli, Francesco
    [J]. 49TH IEEE CONFERENCE ON DECISION AND CONTROL (CDC), 2010, : 4896 - 4901
  • [10] A neuro-dynamic programming approach to the optimal stand management problem
    Comeau, Jules
    Gunn, Eldon
    [J]. CANADIAN JOURNAL OF FOREST RESEARCH, 2017, 47 (06) : 808 - 816