A reinforcement learning-based approach for online bus scheduling

被引：4

作者：

Liu, Yingzhuo ^{[1
,2
]}

Zuo, Xingquan ^{[1
,2
]}

Ai, Guanqun ^{[1
,2
]}

Liu, Yahong ^{[1
,2
]}

机构：

[1] Beijing Univ Posts & Telecommun, Sch Comp Sci, Beijing, Peoples R China

[2] Minist Educ, Key Lab Trustworthy Distributed Comp & Serv, Beijing, Peoples R China

来源：

KNOWLEDGE-BASED SYSTEMS | 2023年 / 271卷

关键词：

Bus scheduling; Online scheduling; Deep reinforcement learning; OPTIMIZATION MODEL; VEHICLE; ALGORITHM;

D O I：

10.1016/j.knosys.2023.110584

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Bus Scheduling Problem (BSP) is vital to save operational cost and ensure service quality. Existing approaches typically generate a bus scheduling scheme in an offline manner and then schedule vehicles according to the scheme. In practice, uncertain events such as traffic congestion occur frequently, which may make the originally planned bus scheduling scheme infeasible. This study proposes a Reinforcement Learning-based Bus Scheduling Approach (RL-BSA) for online bus scheduling. In RL-BSA, each departure time in a bus timetable is regarded as a decision point, and an agent makes a decision at the departure time to select a vehicle to depart at the time. The BSP is modeled as a Markov Decision Process (MDP) for the first time in literature. The state features are devised, which consist of real-time information of vehicles, including remaining working time, remaining driving time, rest time, number of executed trips and vehicle type. A reward function combining a final reward and a step-wise reward is devised. An invalid action masking approach is used to avoid the agent from selecting vehicles not meeting constraints. The agent is trained by interacting with a simulation environment and then the trained agent can schedule vehicles in an online manner. Experiments on real-world BSP instances show that RL-BSA can significantly reduce the number of vehicles used compared with the manual scheduling approach and Adaptive Large Neighborhood Search (ALNS). Under uncertain environment, RL-BSA can cover all departure times in the timetable without increasing the number of vehicles used.(c) 2023 Elsevier B.V. All rights reserved.

引用

页数：12

共 50 条

[1] Reinforcement Learning-Based Online Scheduling of Multiple Workflows in Edge Environment
Huang, Binbin
Wang, Lingbin
Liu, Xiao
Huang, Zixin
Yin, Yuyu
Zhu, Fujin
Wang, Shangguang
Deng, Shuiguang
[J]. IEEE Transactions on Network and Service Management, 2024, 21 (05): : 5691 - 5706
[2] A deep reinforcement learning-based approach for the residential appliances scheduling
Li, Sichen
Cao, Di
Huang, Qi
Zhang, Zhenyuan
Chen, Zhe
Blaabjerg, Frede
Hu, Weihao
[J]. ENERGY REPORTS, 2022, 8 : 1034 - 1042
[3] A Reinforcement Learning-based Approach to Dynamic Job-shop Scheduling
WEI Ying-Zi~(1
[J]. 自动化学报, 2005, (05) : 113 - 119
[4] A Generic Spatiotemporal Scheduling for Autonomous UAVs: A Reinforcement Learning-Based Approach
Bouhamed, Omar
Ghazzai, Hakim
Besbes, Hichem
Massoud, Yehia
[J]. IEEE OPEN JOURNAL OF VEHICULAR TECHNOLOGY, 2020, 1 : 93 - 106
[5] Beyond Max-weight Scheduling: A Reinforcement Learning-based Approach
Bae, Jeongmin
Lee, Joohyun
Chong, Song
[J]. 17TH INTERNATIONAL SYMPOSIUM ON MODELING AND OPTIMIZATION IN MOBILE, AD HOC, AND WIRELESS NETWORKS (WIOPT 2019), 2019, : 92 - 99
[6] DEEP REINFORCEMENT LEARNING-BASED IRRIGATION SCHEDULING
Yang, Y.
Hu, J.
Porter, D.
Marek, T.
Heflin, K.
Kong, H.
Sun, L.
[J]. TRANSACTIONS OF THE ASABE, 2020, 63 (03) : 549 - 556
[7] An improved deep reinforcement learning-based scheduling approach for dynamic task scheduling in cloud manufacturing
Wang, Xiaohan
Zhang, Lin
Liu, Yongkui
Laili, Yuanjun
[J]. INTERNATIONAL JOURNAL OF PRODUCTION RESEARCH, 2024, 62 (11) : 4014 - 4030
[8] Online Dispatching and Fair Scheduling of Edge Computing Tasks: A Learning-Based Approach
Yuan, Hao
Tang, Guoming
Li, Xinyi
Guo, Deke
Luo, Lailong
Luo, Xueshan
[J]. IEEE INTERNET OF THINGS JOURNAL, 2021, 8 (19): : 14985 - 14998
[9] Optimal Scheduling of Battery Energy Storage Systems Using a Reinforcement Learning-based Approach
Selim, Alaa
Mo, Huadong
Pota, Hemanshu
Dong, Daoyi
[J]. IFAC PAPERSONLINE, 2023, 56 (02): : 11741 - 11747
[10] Online EVs Vehicle-to-Grid Scheduling Coordinated with Multi-Energy Microgrids: A Deep Reinforcement Learning-Based Approach
Pan, Weiqi
Yu, Xiaorong
Guo, Zishan
Qian, Tao
Li, Yang
[J]. ENERGIES, 2024, 17 (11)

← 1 2 3 4 5 →