Neuro-dynamic programming for optimal control of macroscopic fundamental diagram systems

被引:37
|
作者
Su, Z. C. [1 ,4 ]
Chow, Andy H. F. [1 ]
Zheng, N. [2 ]
Huang, Y. P. [3 ,4 ]
Liang, E. M. [4 ]
Zhong, R. X. [4 ]
机构
[1] City Univ Hong Kong, Dept Architecture & Civil Engn, Hong Kong, Peoples R China
[2] Monash Univ, Dept Civil Engn, Melbourne, Vic, Australia
[3] Hong Kong Polytech Univ, Dept Civil & Environm Engn, Hong Kong, Peoples R China
[4] Sun Yat Sen Univ, Sch Intelligent Syst Engn, Guangzhou, Peoples R China
基金
国家重点研发计划;
关键词
Macroscopic fundamental diagram; Hamilton-Jacobi-Bellman equation; Neuro-dynamic programming; Policy iteration; Saturated state and input; URBAN ROAD NETWORKS; PERIMETER CONTROL; TRAFFIC CONTROL; NONLINEAR-SYSTEMS; GATING CONTROL; CONGESTION; MODEL; APPROXIMATION; DERIVATIVES; STABILITY;
D O I
10.1016/j.trc.2020.102628
中图分类号
U [交通运输];
学科分类号
08 ; 0823 ;
摘要
The macroscopic fundamental diagram (MFD) can effectively reduce the spatial dimension involved in dynamic optimization of traffic performance for large-scale networks. Solving the Hamilton-Jacobi-Bellman (HJB) equation takes center stage in yielding solutions to the optimal control problem. At the core of solving the HJB equation is the value function that represents choosing a sequence of actions to optimize the system performance. However, this problem generally becomes intractable for possible discontinuities in the solution and the curse of dimensionality for systems with all but modest dimension. To address these challenges, a neural network is used to approximate the value function to obtain the optimal controls through policy iteration. Furthermore, a saturated operator is embedded in the neural network approximator to handle the difficulty caused by the control and state constraints. This policy iteration can be implemented as an iterative data-driven technique that integrates with the model-based optimal design based on real-time observations. Numerical experiments are conducted to show that the neuro-dynamic programming approach can achieve optimization goals while stabilizing the system by regulating the traffic state to the desired uncongested equilibrium.
引用
收藏
页数:23
相关论文
共 50 条
  • [21] Edge Computing Aided Congestion Control using Neuro-Dynamic Programming in NDN
    Qin, Jin
    Xing, Yitao
    Wei, Wenjia
    Xue, Kaiping
    [J]. 2020 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM), 2020,
  • [22] A neuro-dynamic programming approach to retailer inventory management
    Van Roy, B
    Bertsekas, DP
    Lee, Y
    Tsitsiklis, JN
    [J]. PROCEEDINGS OF THE 36TH IEEE CONFERENCE ON DECISION AND CONTROL, VOLS 1-5, 1997, : 4052 - 4057
  • [23] An Online Energy Management Control for Hybrid Electric Vehicles Based on Neuro-Dynamic Programming
    Qin, Feiyan
    Li, Weimin
    Hu, Yue
    Xu, Guoqing
    [J]. ALGORITHMS, 2018, 11 (03)
  • [24] Call admission control and routing in integrated services networks using neuro-dynamic programming
    Marbach, P
    Mihatsch, O
    Tsitsiklis, JN
    [J]. IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS, 2000, 18 (02) : 197 - 208
  • [25] Optimal Constrained Neuro-Dynamic Programming Based Self-learning Battery Management in Microgrids
    Wei, Qinglai
    Liu, Derong
    [J]. NEURAL INFORMATION PROCESSING, ICONIP 2016, PT III, 2016, 9949 : 193 - 201
  • [26] Neuro-dynamic programming for task allocation to unmanned aerial vehicles
    Kamel, A
    Anwar, MM
    Nygard, K
    [J]. INTELLIGENT AND ADAPTIVE SYSTEMS AND SOFTWARE ENGINEERING, 2004, : 121 - 127
  • [27] A neuro-dynamic programming approach to admission control in ATM networks: The single link case
    Marbach, P
    Tsitsiklis, JN
    [J]. 1997 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I - V: VOL I: PLENARY, EXPERT SUMMARIES, SPECIAL, AUDIO, UNDERWATER ACOUSTICS, VLSI; VOL II: SPEECH PROCESSING; VOL III: SPEECH PROCESSING, DIGITAL SIGNAL PROCESSING; VOL IV: MULTIDIMENSIONAL SIGNAL PROCESSING, NEURAL NETWORKS - VOL V: STATISTICAL SIGNAL AND ARRAY PROCESSING, APPLICATIONS, 1997, : 159 - 162
  • [28] Scheduling for Wafer Fabrication System Based on Neuro-Dynamic Programming
    Wang Ying
    Liu Jie
    [J]. PROCEEDINGS OF 2009 CONFERENCE ON COMMUNICATION FACULTY, 2009, : 93 - 98
  • [29] OPTIMIZATION OF A WHEY BIOPROCESS USING NEURO-DYNAMIC PROGRAMMING STRATEGY
    Ilkova, Tatiana
    Petrov, Mitko
    Roeva, Olympia
    [J]. BIOTECHNOLOGY & BIOTECHNOLOGICAL EQUIPMENT, 2012, 26 (05) : 3249 - 3253
  • [30] Performance Potential-based Neuro-dynamic Programming for SMDPs
    TANG Hao YUAN Ji-Bin LU Yang CHENG Wen-Juan(School of Computer and Information
    [J]. 自动化学报, 2005, (04) : 150 - 153