Data-driven train delay prediction incorporating dispatching commands: An XGBoost-metaheuristic framework

被引:1
|
作者
Gao, Tianze [1 ]
Chen, Junhua [1 ,2 ]
Xu, Huizhang [1 ]
机构
[1] Beijing Jiaotong Univ, Sch Traff & Transportat, Beijing, Peoples R China
[2] Beijing Jiaotong Univ, Sch Traff & Transportat, Beijing 100044, Peoples R China
关键词
rail transportation; train delay; prediction theory; data mining; feature extraction; MODEL; OPTIMIZATION; PROPAGATION; TIME;
D O I
10.1049/itr2.12461
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Train delays can significantly impact the punctuality and service quality of high-speed trains, which also play a crucial role in affecting dispatchers with their decision-making. In this study, a data-driven train delay prediction framework was proposed and strengthened by considering the impact of dispatching commands and the mechanisms of train delay propagation using XGBoost. Four metaheuristic algorithms were utilized to fine-tune its hyperparameters. A vast dataset comprising 1.9 million records spanning 38 months of train operation data was utilized for feature extraction and model training. The model's accuracy was evaluated using three statistical metrics, and a comparison of the four tuning frameworks was performed. To emphasize the model's interpretability and its practical guidance for train rescheduling, the relationship of dispatching commands, delay propagation and delay prediction was validated by combining the theory and practical results, and a SHAP (SHapley Additive exPlanations) analysis was used for a clearer model explanation. The results revealed that distinct XGBoost-Metaheuristic models exhibit unique effects in different criteria, yet they all demonstrated high accuracy and low prediction errors, thereby revealing the potential of using machine learning for train delay prediction, which is valuable for decision-making and rescheduling. This paper proposed and strengthened the data-driven train delay prediction framework by considering the impact of dispatching commands and the mechanisms of train delay propagation using XGBoost-Metaheuristic framework. Using a vast volume of training dataset, the models' performance was enhanced and compared under different criteria or scenarios, which can provide valuable guidance for railway dispatching and scheduling.image
引用
下载
收藏
页码:1777 / 1796
页数:20
相关论文
共 50 条
  • [1] Data-driven stochastic model for train delay analysis and prediction
    Sahin, Ismail
    INTERNATIONAL JOURNAL OF RAIL TRANSPORTATION, 2023, 11 (02) : 207 - 226
  • [2] AP-GRIP evaluation framework for data-driven train delay prediction models: systematic literature review
    Tiong Kah Yong
    Zhenliang Ma
    Carl-William Palmqvist
    European Transport Research Review, 17 (1)
  • [3] Train Dispatching Management With Data-Driven Approaches: A Comprehensive Review and Appraisal
    Wen, Chao
    Huang, Ping
    Li, Zhongcan
    Lessan, Javad
    Fu, Liping
    Jiang, Chaozhe
    Xu, Xinyue
    IEEE ACCESS, 2019, 7 : 114547 - 114571
  • [4] A Data-Driven Two-Stage Prediction Model for Train Primary-Delay Recovery Time
    Gao, Bowen
    Ou, Dongxiu
    Dong, Decun
    Wu, Yusen
    INTERNATIONAL JOURNAL OF SOFTWARE ENGINEERING AND KNOWLEDGE ENGINEERING, 2020, 30 (07) : 921 - 940
  • [5] Adaptive, data-driven, online prediction of train event times
    Kecman, Pavle
    Goverde, Rob M. P.
    2013 16TH INTERNATIONAL IEEE CONFERENCE ON INTELLIGENT TRANSPORTATION SYSTEMS - (ITSC), 2013, : 803 - 808
  • [6] Online Data-Driven Adaptive Prediction of Train Event Times
    Kecman, Pavle
    Goverde, Rob M. P.
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2015, 16 (01) : 465 - 474
  • [7] Data-driven Model for Influenza Prediction Incorporating Environmental Effects
    Didi, Yosra
    Walha, Ahlem
    Wali, Ali
    PROCEEDINGS OF THE 5TH INTERNATIONAL CONFERENCE ON INTERNET OF THINGS, BIG DATA AND SECURITY (IOTBDS), 2020, : 15 - 24
  • [8] An integrated framework of data-driven, metaheuristic, and mechanistic modeling approach for biomass pyrolysis
    Ullah, Zahid
    Khan, Muzammil
    Naqvi, Salman Raza
    Khan, Muhammad Nouman Aslam
    Farooq, Wasif
    Anjum, Muhammad Waqas
    Yaqub, Muhammad Waqas
    AlMohamadi, Hamad
    Almomani, Fares
    PROCESS SAFETY AND ENVIRONMENTAL PROTECTION, 2022, 162 : 337 - 345
  • [9] A data-driven framework for intonational phrase break prediction
    Maragoudakis, M
    Zervas, P
    Fakotakis, N
    Kokkinakis, G
    TEXT, SPEECH AND DIALOGUE, PROCEEDINGS, 2003, 2807 : 189 - 197
  • [10] Data-driven XGBoost model for maximum stress prediction of additive manufactured lattice structures
    Zhang, Zhiwei
    Zhang, Yuyan
    Wen, Yintang
    Ren, Yaxue
    COMPLEX & INTELLIGENT SYSTEMS, 2023, 9 (05) : 5881 - 5892