Adaptive Metro Service Schedule and Train Composition With a Proximal Policy Optimization Approach Based on Deep Reinforcement Learning

被引：25

作者：

Ying, Cheng-Shuo ^{[1
]}

Chow, Andy H. F. ^{[2
]}

Wang, Yi-Hui ^{[3
]}

Chin, Kwai-Sang ^{[1
]}

机构：

[1] City Univ Hong Kong, Dept Syst Engn & Engn Management, Hong Kong, Peoples R China

[2] City Univ Hong Kong, Dept Architecture & Civil Engn, Hong Kong, Peoples R China

[3] Beijing Jiaotong Univ, State Key Lab Rail Traff Control & Safety, Beijing 100044, Peoples R China

来源：

IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS | 2022年 / 23卷 / 07期

基金：

中国国家自然科学基金;

关键词：

Metro service scheduling; train composition; Markov decision process; deep reinforcement learning; proximal policy optimization; TIMETABLE OPTIMIZATION; PASSENGER DEMAND; CIRCULATION; SYSTEM;

D O I：

10.1109/TITS.2021.3063399

中图分类号：

TU [建筑科学];

学科分类号：

0813 ;

摘要：

This paper presents an integrated metro service scheduling and train unit deployment with a proximal policy optimization approach based on the deep reinforcement learning framework. The optimization problem is formulated as a Markov decision process (MDP) subject to a set of operational constraints. To address the computational complexity, the value function and control policy are parameterized by artificial neural networks (ANNs) with which the operational constraints are incorporated through a devised mask scheme. A proximal policy optimization (PPO) approach is developed for training the ANNs via successive transition simulations. The optimization framework is implemented and tested on a real-world scenario configured with the Victoria Line of London Underground, UK. The results show that the performance of proposed methodology outperforms a set of selected evolutionary heuristics in terms of both solution quality and computational efficiency. Results illustrate the advantages of having flexible train composition in saving operational costs and reducing service irregularities. This study contributes to real time metro operations with limited resources and state-of-art optimization techniques.

引用

页码：6895 / 6906

页数：12

共 50 条

[1] Deep reinforcement learning based train door adaptive control in metro tunnel evacuation optimization
Shen, Yixin
Ma, Jian
Fang, Hongqiang
Lo, S. M.
Shi, Congling
[J]. TUNNELLING AND UNDERGROUND SPACE TECHNOLOGY, 2022, 128
[2] Multi-agent deep reinforcement learning for adaptive coordinated metro service operations with flexible train composition
Ying, Cheng-Shuo
Chow, Andy H. F.
Nguyen, Hoa T. M.
Chin, Kwai-Sang
[J]. TRANSPORTATION RESEARCH PART B-METHODOLOGICAL, 2022, 161 : 36 - 59
[3] Reactive Power Optimization Based on Proximal Policy Optimization of Deep Reinforcement Learning
Zahng, Pei
Zhu, Zhujun
Xie, Hua
[J]. Dianwang Jishu/Power System Technology, 2023, 47 (02): : 562 - 570
[4] An Adaptive Model-Free Control Method for Metro Train Based on Deep Reinforcement Learning
Lai, Wenzhu
Chen, Dewang
Huang, Yunhu
Huang, Benzun
[J]. ADVANCES IN NATURAL COMPUTATION, FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY, ICNC-FSKD 2022, 2023, 153 : 263 - 273
[5] PPO-ABR: Proximal Policy Optimization based Deep Reinforcement Learning for Adaptive BitRate streaming
Naresh, Mandan
Saxena, Paresh
Gupta, Manik
[J]. 2023 INTERNATIONAL WIRELESS COMMUNICATIONS AND MOBILE COMPUTING, IWCMC, 2023, : 199 - 204
[6] Adaptive energy management strategy for FCHEV based on improved proximal policy optimization in deep reinforcement learning algorithm
Lu, Xueqin
Qian, Shenchen
Zhai, Xinrui
Wang, Peiyinquan
Wu, Tao
[J]. ENERGY CONVERSION AND MANAGEMENT, 2024, 321
[7] Adaptive Service Composition Based on Reinforcement Learning
Wang, Hongbing
Zhou, Xuan
Zhou, Xiang
Liu, Weihong
Li, Wenya
Bouguettaya, Athman
[J]. SERVICE-ORIENTED COMPUTING - ICSOC 2010, PROCEEDINGS, 2010, 6470 : 92 - +
[8] Deep Reinforcement Learning Based Train Driving Optimization
Huang, Jin
Zhang, Ende
Zhang, Jiarui
Huang, Siguang
Zhong, Zhihua
[J]. 2019 CHINESE AUTOMATION CONGRESS (CAC2019), 2019, : 2375 - 2381
[9] Large-scale and adaptive service composition based on deep reinforcement learning
Liu, Jiang-Wen
Hu, Li-Qiang
Cai, Zhao-Quan
Xing, Li-Ning
Tan, Xu
[J]. JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2019, 65
[10] Adaptive and large-scale service composition based on deep reinforcement learning
Wang, Hongbing
Gu, Mingzhu
Yu, Qi
Tao, Yong
Li, Jiajie
Fei, Huanhuan
Yan, Jia
Zhao, Wei
Hong, Tianjing
[J]. KNOWLEDGE-BASED SYSTEMS, 2019, 180 : 75 - 90

← 1 2 3 4 5 →