Data-driven train delay prediction incorporating dispatching commands: An XGBoost-metaheuristic framework

被引:1
|
作者
Gao, Tianze [1 ]
Chen, Junhua [1 ,2 ]
Xu, Huizhang [1 ]
机构
[1] Beijing Jiaotong Univ, Sch Traff & Transportat, Beijing, Peoples R China
[2] Beijing Jiaotong Univ, Sch Traff & Transportat, Beijing 100044, Peoples R China
关键词
rail transportation; train delay; prediction theory; data mining; feature extraction; MODEL; OPTIMIZATION; PROPAGATION; TIME;
D O I
10.1049/itr2.12461
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Train delays can significantly impact the punctuality and service quality of high-speed trains, which also play a crucial role in affecting dispatchers with their decision-making. In this study, a data-driven train delay prediction framework was proposed and strengthened by considering the impact of dispatching commands and the mechanisms of train delay propagation using XGBoost. Four metaheuristic algorithms were utilized to fine-tune its hyperparameters. A vast dataset comprising 1.9 million records spanning 38 months of train operation data was utilized for feature extraction and model training. The model's accuracy was evaluated using three statistical metrics, and a comparison of the four tuning frameworks was performed. To emphasize the model's interpretability and its practical guidance for train rescheduling, the relationship of dispatching commands, delay propagation and delay prediction was validated by combining the theory and practical results, and a SHAP (SHapley Additive exPlanations) analysis was used for a clearer model explanation. The results revealed that distinct XGBoost-Metaheuristic models exhibit unique effects in different criteria, yet they all demonstrated high accuracy and low prediction errors, thereby revealing the potential of using machine learning for train delay prediction, which is valuable for decision-making and rescheduling. This paper proposed and strengthened the data-driven train delay prediction framework by considering the impact of dispatching commands and the mechanisms of train delay propagation using XGBoost-Metaheuristic framework. Using a vast volume of training dataset, the models' performance was enhanced and compared under different criteria or scenarios, which can provide valuable guidance for railway dispatching and scheduling.image
引用
下载
收藏
页码:1777 / 1796
页数:20
相关论文
共 50 条
  • [41] User Persona in Personalized Wireless Networks: A Big Data-Driven Prediction Framework
    Alkurd, Rawan
    AbuAlhaol, Ibrahim
    Yanikomeroglu, Halim
    2020 IEEE 92ND VEHICULAR TECHNOLOGY CONFERENCE (VTC2020-FALL), 2020,
  • [42] Collision Prediction in an Integrated Framework of Scenario-Based and Data-Driven Approaches
    Lee, Sungwoo
    Song, Bongsob
    Shin, Jangho
    IEEE ACCESS, 2024, 12 : 55234 - 55247
  • [43] A Case-Based Data-Driven Prediction Framework for Machine Fault Prognostics
    Cheng, Fangzhou
    Qu, Liyan
    Qiao, Wei
    2015 IEEE ENERGY CONVERSION CONGRESS AND EXPOSITION (ECCE), 2015, : 3957 - 3963
  • [44] A Universal Automated Data-Driven Modeling Framework for Truck Traffic Volume Prediction
    Mahdavian, Amirsaman
    Shojaei, Alireza
    Salem, Milad
    Laman, Haluk
    Eluru, Naveen
    Oloufa, Amr A.
    IEEE ACCESS, 2021, 9 : 105341 - 105356
  • [45] Data-driven framework for prediction and optimization of gas turbine blade film cooling
    Wang, Yaning
    Wang, Zirui
    Qian, Shuyang
    Qiu, Xubin
    Shen, Weiqi
    Zhang, Xinshuai
    Lyu, Benshuai
    Cui, Jiahuan
    PHYSICS OF FLUIDS, 2024, 36 (03)
  • [46] Hybrid Mechanistic and Data-driven Modeling Method of Compliant Assembly Variation Prediction for Train Body
    Wang J.
    Liu J.
    Hou X.
    Qi Z.
    Li Z.
    Liu T.
    Jixie Gongcheng Xuebao/Journal of Mechanical Engineering, 2024, 60 (06): : 177 - 186
  • [47] A new physics-based data-driven guideline for wear modelling and prediction of train wheels
    Zeng, Yuanchen
    Song, Dongli
    Zhang, Weihua
    Zhou, Bin
    Xie, Mingyuan
    Tang, Xu
    WEAR, 2020, 456 (456-457)
  • [48] Application of a data-driven XGBoost model for the prediction of COVID-19 in the USA: a time-series study
    Fang, Zheng-gang
    Yang, Shu-qin
    Lv, Cai-xia
    An, Shu-yi
    Wu, Wei
    BMJ OPEN, 2022, 12 (07):
  • [49] A Data-Driven Framework for Tunnel Geological-Type Prediction Based on TBM Operating Data
    Zhao, Junhong
    Shi, Maolin
    Hu, Gang
    Song, Xueguan
    Zhang, Chao
    Tao, Dacheng
    Wu, Wei
    IEEE ACCESS, 2019, 7 : 66703 - 66713
  • [50] Data-driven models for train control dynamics in high-speed railways: LAG-LSTM for train trajectory prediction
    Yin, Jiateng
    Ning, Chenhe
    Tang, Tao
    INFORMATION SCIENCES, 2022, 600 : 377 - 400