Data-driven train delay prediction incorporating dispatching commands: An XGBoost-metaheuristic framework

被引:1
|
作者
Gao, Tianze [1 ]
Chen, Junhua [1 ,2 ]
Xu, Huizhang [1 ]
机构
[1] Beijing Jiaotong Univ, Sch Traff & Transportat, Beijing, Peoples R China
[2] Beijing Jiaotong Univ, Sch Traff & Transportat, Beijing 100044, Peoples R China
关键词
rail transportation; train delay; prediction theory; data mining; feature extraction; MODEL; OPTIMIZATION; PROPAGATION; TIME;
D O I
10.1049/itr2.12461
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Train delays can significantly impact the punctuality and service quality of high-speed trains, which also play a crucial role in affecting dispatchers with their decision-making. In this study, a data-driven train delay prediction framework was proposed and strengthened by considering the impact of dispatching commands and the mechanisms of train delay propagation using XGBoost. Four metaheuristic algorithms were utilized to fine-tune its hyperparameters. A vast dataset comprising 1.9 million records spanning 38 months of train operation data was utilized for feature extraction and model training. The model's accuracy was evaluated using three statistical metrics, and a comparison of the four tuning frameworks was performed. To emphasize the model's interpretability and its practical guidance for train rescheduling, the relationship of dispatching commands, delay propagation and delay prediction was validated by combining the theory and practical results, and a SHAP (SHapley Additive exPlanations) analysis was used for a clearer model explanation. The results revealed that distinct XGBoost-Metaheuristic models exhibit unique effects in different criteria, yet they all demonstrated high accuracy and low prediction errors, thereby revealing the potential of using machine learning for train delay prediction, which is valuable for decision-making and rescheduling. This paper proposed and strengthened the data-driven train delay prediction framework by considering the impact of dispatching commands and the mechanisms of train delay propagation using XGBoost-Metaheuristic framework. Using a vast volume of training dataset, the models' performance was enhanced and compared under different criteria or scenarios, which can provide valuable guidance for railway dispatching and scheduling.image
引用
下载
收藏
页码:1777 / 1796
页数:20
相关论文
共 50 条
  • [11] Data-driven XGBoost model for maximum stress prediction of additive manufactured lattice structures
    Zhiwei Zhang
    Yuyan Zhang
    Yintang Wen
    Yaxue Ren
    Complex & Intelligent Systems, 2023, 9 : 5881 - 5892
  • [12] Data-driven prognostic framework for remaining useful life prediction
    Motrani A.
    Noureddine R.
    International Journal of Industrial and Systems Engineering, 2023, 43 (02) : 210 - 221
  • [13] A hybrid data-driven framework for loss prediction of MCA airfoils
    Zeinalzadeh, A.
    Kamakoli, G. Hosseinzadeh
    Pakatchian, MR.
    ENGINEERING ANALYSIS WITH BOUNDARY ELEMENTS, 2024, 163 : 394 - 405
  • [14] A data-driven framework for conceptual cost estimation of infrastructure projects using XGBoost and Bayesian optimization
    Zhang, Jiashu
    Yuan, Jingfeng
    Mahmoudi, Amin
    Ji, Wenying
    Fang, Qiushi
    JOURNAL OF ASIAN ARCHITECTURE AND BUILDING ENGINEERING, 2023,
  • [15] The Prediction of Flight Delay: Big Data-driven Machine Learning Approach
    Huo, Jiage
    Keung, K. L.
    Lee, C. K. M.
    Ng, Kam K. H.
    Li, K. C.
    2020 IEEE INTERNATIONAL CONFERENCE ON INDUSTRIAL ENGINEERING AND ENGINEERING MANAGEMENT (IEEE IEEM), 2020, : 190 - 194
  • [16] A GTFS data acquisition and processing framework and its application to train delay prediction
    Wu, Jianqing
    Du, Bo
    Gong, Zengyang
    Wu, Qiang
    Shen, Jun
    Zhou, Luping
    Cai, Chen
    INTERNATIONAL JOURNAL OF TRANSPORTATION SCIENCE AND TECHNOLOGY, 2023, 12 (01) : 201 - 216
  • [17] A Hybrid Data-Driven Metaheuristic Framework to Optimize Strain of Lattice Structures Proceeded by Additive Manufacturing
    Zhang, Tao
    Sajjad, Uzair
    Sengupta, Akash
    Ali, Mubasher
    Sultan, Muhammad
    Hamid, Khalid
    MICROMACHINES, 2023, 14 (10)
  • [18] Data-Driven Metro Train Crowding Prediction Based on Real-Time Load Data
    Jenelius, Erik
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2020, 21 (06) : 2254 - 2265
  • [19] Data-driven models for the steady thermal performance prediction of energy piles optimized by metaheuristic algorithms
    Hu, Shuaijun
    Kong, Gangqiang
    Zhang, Changsen
    Fu, Jinghui
    Li, Shiyao
    Yang, Qing
    Energy, 2024, 313
  • [20] Data-Driven Framework for Electrode Wear Prediction in Resistance Spot Welding
    Panza, Luigi
    Bruno, Giulia
    De Maddis, Manuela
    Lombardi, Franco
    Spena, Pasquale Russo
    Traini, Emiliano
    PRODUCT LIFECYCLE MANAGEMENT: GREEN AND BLUE TECHNOLOGIES TO SUPPORT SMART AND SUSTAINABLE ORGANIZATIONS, PT I, 2022, 639 : 239 - 252