Adaptive Metro Service Schedule and Train Composition With a Proximal Policy Optimization Approach Based on Deep Reinforcement Learning

被引:25
|
作者
Ying, Cheng-Shuo [1 ]
Chow, Andy H. F. [2 ]
Wang, Yi-Hui [3 ]
Chin, Kwai-Sang [1 ]
机构
[1] City Univ Hong Kong, Dept Syst Engn & Engn Management, Hong Kong, Peoples R China
[2] City Univ Hong Kong, Dept Architecture & Civil Engn, Hong Kong, Peoples R China
[3] Beijing Jiaotong Univ, State Key Lab Rail Traff Control & Safety, Beijing 100044, Peoples R China
基金
中国国家自然科学基金;
关键词
Metro service scheduling; train composition; Markov decision process; deep reinforcement learning; proximal policy optimization; TIMETABLE OPTIMIZATION; PASSENGER DEMAND; CIRCULATION; SYSTEM;
D O I
10.1109/TITS.2021.3063399
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
This paper presents an integrated metro service scheduling and train unit deployment with a proximal policy optimization approach based on the deep reinforcement learning framework. The optimization problem is formulated as a Markov decision process (MDP) subject to a set of operational constraints. To address the computational complexity, the value function and control policy are parameterized by artificial neural networks (ANNs) with which the operational constraints are incorporated through a devised mask scheme. A proximal policy optimization (PPO) approach is developed for training the ANNs via successive transition simulations. The optimization framework is implemented and tested on a real-world scenario configured with the Victoria Line of London Underground, UK. The results show that the performance of proposed methodology outperforms a set of selected evolutionary heuristics in terms of both solution quality and computational efficiency. Results illustrate the advantages of having flexible train composition in saving operational costs and reducing service irregularities. This study contributes to real time metro operations with limited resources and state-of-art optimization techniques.
引用
收藏
页码:6895 / 6906
页数:12
相关论文
共 50 条
  • [1] Deep reinforcement learning based train door adaptive control in metro tunnel evacuation optimization
    Shen, Yixin
    Ma, Jian
    Fang, Hongqiang
    Lo, S. M.
    Shi, Congling
    [J]. TUNNELLING AND UNDERGROUND SPACE TECHNOLOGY, 2022, 128
  • [2] Multi-agent deep reinforcement learning for adaptive coordinated metro service operations with flexible train composition
    Ying, Cheng-Shuo
    Chow, Andy H. F.
    Nguyen, Hoa T. M.
    Chin, Kwai-Sang
    [J]. TRANSPORTATION RESEARCH PART B-METHODOLOGICAL, 2022, 161 : 36 - 59
  • [3] Reactive Power Optimization Based on Proximal Policy Optimization of Deep Reinforcement Learning
    Zahng, Pei
    Zhu, Zhujun
    Xie, Hua
    [J]. Dianwang Jishu/Power System Technology, 2023, 47 (02): : 562 - 570
  • [4] An Adaptive Model-Free Control Method for Metro Train Based on Deep Reinforcement Learning
    Lai, Wenzhu
    Chen, Dewang
    Huang, Yunhu
    Huang, Benzun
    [J]. ADVANCES IN NATURAL COMPUTATION, FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY, ICNC-FSKD 2022, 2023, 153 : 263 - 273
  • [5] PPO-ABR: Proximal Policy Optimization based Deep Reinforcement Learning for Adaptive BitRate streaming
    Naresh, Mandan
    Saxena, Paresh
    Gupta, Manik
    [J]. 2023 INTERNATIONAL WIRELESS COMMUNICATIONS AND MOBILE COMPUTING, IWCMC, 2023, : 199 - 204
  • [6] Adaptive energy management strategy for FCHEV based on improved proximal policy optimization in deep reinforcement learning algorithm
    Lu, Xueqin
    Qian, Shenchen
    Zhai, Xinrui
    Wang, Peiyinquan
    Wu, Tao
    [J]. ENERGY CONVERSION AND MANAGEMENT, 2024, 321
  • [7] Adaptive Service Composition Based on Reinforcement Learning
    Wang, Hongbing
    Zhou, Xuan
    Zhou, Xiang
    Liu, Weihong
    Li, Wenya
    Bouguettaya, Athman
    [J]. SERVICE-ORIENTED COMPUTING - ICSOC 2010, PROCEEDINGS, 2010, 6470 : 92 - +
  • [8] Deep Reinforcement Learning Based Train Driving Optimization
    Huang, Jin
    Zhang, Ende
    Zhang, Jiarui
    Huang, Siguang
    Zhong, Zhihua
    [J]. 2019 CHINESE AUTOMATION CONGRESS (CAC2019), 2019, : 2375 - 2381
  • [9] Large-scale and adaptive service composition based on deep reinforcement learning
    Liu, Jiang-Wen
    Hu, Li-Qiang
    Cai, Zhao-Quan
    Xing, Li-Ning
    Tan, Xu
    [J]. JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2019, 65
  • [10] Adaptive and large-scale service composition based on deep reinforcement learning
    Wang, Hongbing
    Gu, Mingzhu
    Yu, Qi
    Tao, Yong
    Li, Jiajie
    Fei, Huanhuan
    Yan, Jia
    Zhao, Wei
    Hong, Tianjing
    [J]. KNOWLEDGE-BASED SYSTEMS, 2019, 180 : 75 - 90