Data-driven train delay prediction incorporating dispatching commands: An XGBoost-metaheuristic framework

被引:1
|
作者
Gao, Tianze [1 ]
Chen, Junhua [1 ,2 ]
Xu, Huizhang [1 ]
机构
[1] Beijing Jiaotong Univ, Sch Traff & Transportat, Beijing, Peoples R China
[2] Beijing Jiaotong Univ, Sch Traff & Transportat, Beijing 100044, Peoples R China
关键词
rail transportation; train delay; prediction theory; data mining; feature extraction; MODEL; OPTIMIZATION; PROPAGATION; TIME;
D O I
10.1049/itr2.12461
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Train delays can significantly impact the punctuality and service quality of high-speed trains, which also play a crucial role in affecting dispatchers with their decision-making. In this study, a data-driven train delay prediction framework was proposed and strengthened by considering the impact of dispatching commands and the mechanisms of train delay propagation using XGBoost. Four metaheuristic algorithms were utilized to fine-tune its hyperparameters. A vast dataset comprising 1.9 million records spanning 38 months of train operation data was utilized for feature extraction and model training. The model's accuracy was evaluated using three statistical metrics, and a comparison of the four tuning frameworks was performed. To emphasize the model's interpretability and its practical guidance for train rescheduling, the relationship of dispatching commands, delay propagation and delay prediction was validated by combining the theory and practical results, and a SHAP (SHapley Additive exPlanations) analysis was used for a clearer model explanation. The results revealed that distinct XGBoost-Metaheuristic models exhibit unique effects in different criteria, yet they all demonstrated high accuracy and low prediction errors, thereby revealing the potential of using machine learning for train delay prediction, which is valuable for decision-making and rescheduling. This paper proposed and strengthened the data-driven train delay prediction framework by considering the impact of dispatching commands and the mechanisms of train delay propagation using XGBoost-Metaheuristic framework. Using a vast volume of training dataset, the models' performance was enhanced and compared under different criteria or scenarios, which can provide valuable guidance for railway dispatching and scheduling.image
引用
下载
收藏
页码:1777 / 1796
页数:20
相关论文
共 50 条
  • [21] A purely data-driven framework for prediction, optimization, and control of networked processes
    Tavasoli, Ali
    Henry, Teague
    Shakeri, Heman
    ISA TRANSACTIONS, 2023, 138 : 491 - 503
  • [22] Data-driven prediction of product yields and control framework of hydrocracking unit
    Pang, Zheyuan
    Huang, Pan
    Lian, Cheng
    Peng, Chong
    Fang, Xiangcheng
    Liu, Honglai
    CHEMICAL ENGINEERING SCIENCE, 2024, 283
  • [23] Data-driven framework for the prediction of cutting force in turningInspec keywordsOther keywords
    Chatterjee, Kaustabh
    Zhang, Jian
    Dixit, Uday Shanker
    IET COLLABORATIVE INTELLIGENT MANUFACTURING, 2020, 2 (02) : 87 - 95
  • [24] Coupled Dynamic Data-Driven Framework for Forest Fire Spread Prediction
    Brun, Carlos
    Cortes, Ana
    Margalef, Tomas
    DYNAMIC DATA-DRIVEN ENVIRONMENTAL SYSTEMS SCIENCE, DYDESS 2014, 2015, 8964 : 54 - 67
  • [25] An Efficient Data-Driven Traffic Prediction Framework for Network Digital Twin
    Nan, Haihan
    Li, Ruidong
    Zhu, Xiaoyan
    Ma, Jianfeng
    Niyato, Dusit
    IEEE NETWORK, 2024, 38 (01): : 22 - 29
  • [26] A Data-Driven Prediction Framework for Analyzing and Monitoring Business Process Performances
    Bevacqua, Antonio
    Carnuccio, Marco
    Folino, Francesco
    Guarascio, Massimo
    Pontieri, Luigi
    ENTERPRISE INFORMATION SYSTEMS, ICEIS 2013, 2014, 190 : 100 - 117
  • [27] On the Prediction of Aerosol-Cloud Interactions Within a Data-Driven Framework
    Li, Xiang-Yu
    Wang, Hailong
    Chakraborty, T.C.
    Sorooshian, Armin
    Ziemba, Luke D.
    Voigt, Christiane
    Thornhill, Kenneth Lee
    Yuan, Emma
    Geophysical Research Letters, 2024, 51 (24)
  • [28] A Data-Driven Expectation Prediction Framework Based on Social Exchange Theory
    Cao, Enguo
    Jiang, Jinzhi
    Duan, Yanjun
    Peng, Hui
    FRONTIERS IN PSYCHOLOGY, 2022, 12
  • [29] Research on the dynamic data-driven application system architecture for flight delay prediction
    Chen, Haiyan
    Wang, Jiandong
    Feng, Lirong
    Journal of Software, 2012, 7 (02) : 263 - 268
  • [30] Data-Driven Materials Modeling with XGBoost Algorithm and Statistical Inference Analysis for Prediction of Fatigue Strength of Steels
    Deok-Kee Choi
    International Journal of Precision Engineering and Manufacturing, 2019, 20 : 129 - 138