A reinforcement learning-based approach for online bus scheduling

被引:4
|
作者
Liu, Yingzhuo [1 ,2 ]
Zuo, Xingquan [1 ,2 ]
Ai, Guanqun [1 ,2 ]
Liu, Yahong [1 ,2 ]
机构
[1] Beijing Univ Posts & Telecommun, Sch Comp Sci, Beijing, Peoples R China
[2] Minist Educ, Key Lab Trustworthy Distributed Comp & Serv, Beijing, Peoples R China
关键词
Bus scheduling; Online scheduling; Deep reinforcement learning; OPTIMIZATION MODEL; VEHICLE; ALGORITHM;
D O I
10.1016/j.knosys.2023.110584
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Bus Scheduling Problem (BSP) is vital to save operational cost and ensure service quality. Existing approaches typically generate a bus scheduling scheme in an offline manner and then schedule vehicles according to the scheme. In practice, uncertain events such as traffic congestion occur frequently, which may make the originally planned bus scheduling scheme infeasible. This study proposes a Reinforcement Learning-based Bus Scheduling Approach (RL-BSA) for online bus scheduling. In RL-BSA, each departure time in a bus timetable is regarded as a decision point, and an agent makes a decision at the departure time to select a vehicle to depart at the time. The BSP is modeled as a Markov Decision Process (MDP) for the first time in literature. The state features are devised, which consist of real-time information of vehicles, including remaining working time, remaining driving time, rest time, number of executed trips and vehicle type. A reward function combining a final reward and a step-wise reward is devised. An invalid action masking approach is used to avoid the agent from selecting vehicles not meeting constraints. The agent is trained by interacting with a simulation environment and then the trained agent can schedule vehicles in an online manner. Experiments on real-world BSP instances show that RL-BSA can significantly reduce the number of vehicles used compared with the manual scheduling approach and Adaptive Large Neighborhood Search (ALNS). Under uncertain environment, RL-BSA can cover all departure times in the timetable without increasing the number of vehicles used.(c) 2023 Elsevier B.V. All rights reserved.
引用
收藏
页数:12
相关论文
共 50 条
  • [1] Reinforcement Learning-Based Online Scheduling of Multiple Workflows in Edge Environment
    Huang, Binbin
    Wang, Lingbin
    Liu, Xiao
    Huang, Zixin
    Yin, Yuyu
    Zhu, Fujin
    Wang, Shangguang
    Deng, Shuiguang
    [J]. IEEE Transactions on Network and Service Management, 2024, 21 (05): : 5691 - 5706
  • [2] A deep reinforcement learning-based approach for the residential appliances scheduling
    Li, Sichen
    Cao, Di
    Huang, Qi
    Zhang, Zhenyuan
    Chen, Zhe
    Blaabjerg, Frede
    Hu, Weihao
    [J]. ENERGY REPORTS, 2022, 8 : 1034 - 1042
  • [3] A Reinforcement Learning-based Approach to Dynamic Job-shop Scheduling
    WEI Ying-Zi~(1
    [J]. 自动化学报, 2005, (05) : 113 - 119
  • [4] A Generic Spatiotemporal Scheduling for Autonomous UAVs: A Reinforcement Learning-Based Approach
    Bouhamed, Omar
    Ghazzai, Hakim
    Besbes, Hichem
    Massoud, Yehia
    [J]. IEEE OPEN JOURNAL OF VEHICULAR TECHNOLOGY, 2020, 1 : 93 - 106
  • [5] Beyond Max-weight Scheduling: A Reinforcement Learning-based Approach
    Bae, Jeongmin
    Lee, Joohyun
    Chong, Song
    [J]. 17TH INTERNATIONAL SYMPOSIUM ON MODELING AND OPTIMIZATION IN MOBILE, AD HOC, AND WIRELESS NETWORKS (WIOPT 2019), 2019, : 92 - 99
  • [6] DEEP REINFORCEMENT LEARNING-BASED IRRIGATION SCHEDULING
    Yang, Y.
    Hu, J.
    Porter, D.
    Marek, T.
    Heflin, K.
    Kong, H.
    Sun, L.
    [J]. TRANSACTIONS OF THE ASABE, 2020, 63 (03) : 549 - 556
  • [7] An improved deep reinforcement learning-based scheduling approach for dynamic task scheduling in cloud manufacturing
    Wang, Xiaohan
    Zhang, Lin
    Liu, Yongkui
    Laili, Yuanjun
    [J]. INTERNATIONAL JOURNAL OF PRODUCTION RESEARCH, 2024, 62 (11) : 4014 - 4030
  • [8] Online Dispatching and Fair Scheduling of Edge Computing Tasks: A Learning-Based Approach
    Yuan, Hao
    Tang, Guoming
    Li, Xinyi
    Guo, Deke
    Luo, Lailong
    Luo, Xueshan
    [J]. IEEE INTERNET OF THINGS JOURNAL, 2021, 8 (19): : 14985 - 14998
  • [9] Optimal Scheduling of Battery Energy Storage Systems Using a Reinforcement Learning-based Approach
    Selim, Alaa
    Mo, Huadong
    Pota, Hemanshu
    Dong, Daoyi
    [J]. IFAC PAPERSONLINE, 2023, 56 (02): : 11741 - 11747
  • [10] Online EVs Vehicle-to-Grid Scheduling Coordinated with Multi-Energy Microgrids: A Deep Reinforcement Learning-Based Approach
    Pan, Weiqi
    Yu, Xiaorong
    Guo, Zishan
    Qian, Tao
    Li, Yang
    [J]. ENERGIES, 2024, 17 (11)