Train delays prediction based on feature selection and random forest

被引:1
|
作者
Ji, Yuanyuan [1 ]
Zheng, Wei [1 ,2 ]
Dong, Hairong [3 ]
Gao, Pengfei [1 ,2 ]
机构
[1] Beijing Jiaotong Univ, Natl Res Ctr Railway Safety Assessment, Beijing 100044, Peoples R China
[2] Beijing Jiaotong Univ, Beijing Key Lab Intelligent Traff Data Secur & Pr, Beijing 100044, Peoples R China
[3] Beijing Jiaotong Univ, State Key Lab Rail Traff Control & Safety, Beijing 100044, Peoples R China
基金
中国国家自然科学基金;
关键词
D O I
10.1109/itsc45102.2020.9294653
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Although trains are more efficient and convenient than other transportation, delays often occur. Accurately predicting the delay time of trains is of great significance to both dispatchers and passengers. The method for predicting the arrival delay time of trains is based on feature selection algorithm and machine learning. First, we collect train delay cases to sort out the delay factors. In addition to internal factors, external factors such as weather and signal failure are also considered. Then, an improved max-relevance and min-redundancy method (mRMR) is used for feature selection. Finally, we apply the method of weighted random forest (wRF) to predict the delay time. The results demonstrate that the feature selection algorithm has a prominent effect on improving the accuracy of the model, and the mean square error based on the weighted random forest has an improvement potential in forecast precision.
引用
收藏
页数:6
相关论文
共 50 条
  • [1] TEHRAN AIR POLLUTANTS PREDICTION BASED ON RANDOM FOREST FEATURE SELECTION METHOD
    Shamsoddini, A.
    Aboodi, M. R.
    Karami, J.
    [J]. ISPRS INTERNATIONAL JOINT CONFERENCES OF THE 2ND GEOSPATIAL INFORMATION RESEARCH (GI RESEARCH 2017); THE 4TH SENSORS AND MODELS IN PHOTOGRAMMETRY AND REMOTE SENSING (SMPR 2017); THE 6TH EARTH OBSERVATION OF ENVIRONMENTAL CHANGES (EOEC 2017), 2017, 42-4 (W4): : 483 - 488
  • [2] Feature selection algorithm based on random forest
    Yao, Deng-Ju
    Yang, Jing
    Zhan, Xiao-Juan
    [J]. Jilin Daxue Xuebao (Gongxueban)/Journal of Jilin University (Engineering and Technology Edition), 2014, 44 (01): : 137 - 141
  • [3] Prediction of Protein Cleavage Site with Feature Selection by Random Forest
    Li, Bi-Qing
    Cai, Yu-Dong
    Feng, Kai-Yan
    Zhao, Gui-Jun
    [J]. PLOS ONE, 2012, 7 (09):
  • [4] Prediction with Random Forest Involving Sampling and Feature Selection Strategies
    Cao, Min
    Zhang, Xiaolong
    Li, Bo
    Zhao, Jiafu
    [J]. PROCEEDINGS OF THE 2018 13TH IEEE CONFERENCE ON INDUSTRIAL ELECTRONICS AND APPLICATIONS (ICIEA 2018), 2018, : 600 - 605
  • [5] Research on Feature Selection Methods based on Random Forest
    Wang, Zhuo
    [J]. TEHNICKI VJESNIK-TECHNICAL GAZETTE, 2023, 30 (02): : 623 - 633
  • [6] Software Defect Prediction using Feature Selection and Random Forest Algorithm
    Ibrahim, Dyana Rashid
    Ghnemat, Rawan
    Hudaib, Amjad
    [J]. 2017 INTERNATIONAL CONFERENCE ON NEW TRENDS IN COMPUTING SCIENCES (ICTCS), 2017, : 252 - 257
  • [7] A New Noisy Random Forest Based Method for Feature Selection
    Akhiat, Yassine
    Manzali, Youness
    Chahhou, Mohamed
    Zinedine, Ahmed
    [J]. CYBERNETICS AND INFORMATION TECHNOLOGIES, 2021, 21 (02) : 10 - 28
  • [8] Distance Correlation-Based Feature Selection in Random Forest
    Ratnasingam, Suthakaran
    Munoz-Lopez, Jose
    [J]. ENTROPY, 2023, 25 (09)
  • [9] Microgrid fault classification based on random forest feature selection
    Wang, Changhong
    Gao, Yanjie
    Tang, Min
    [J]. REVIEWS OF ADHESION AND ADHESIVES, 2023, 11 (02): : 220 - 237
  • [10] Intrusion Detection Model Based on Feature Selection and Random Forest
    Dong, Rui Hong
    Shui, Yong Li
    Zhang, Qiu Yu
    [J]. International Journal of Network Security, 2021, 23 (06) : 985 - 996