Ranking Risk Factors in Financial Losses From Railroad Incidents: A Machine Learning Approach

被引:5
|
作者
Dhingra, Neeraj [1 ]
Bridgelall, Raj [2 ]
Lu, Pan [1 ]
Szmerekovsky, Joseph [1 ]
Bhardwaj, Bhavana [3 ]
机构
[1] North Dakota State Univ, Dept Transportat Logist & Finance, Fargo, ND 58105 USA
[2] North Dakota State Univ, Dept Transportat Logist & Finance, Plano, TX USA
[3] Valley City State Univ, Dept Comp Syst & Software Engn, Valley City, ND USA
关键词
rail; rail safety; machine learning; human factors in crashes; system safety; data science; class I rail; freight train; INJURY SEVERITY; ALGORITHMS; REGRESSION; ACCIDENTS; ENSEMBLE; RULES; MODEL;
D O I
10.1177/03611981221133085
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
The reported financial losses from railroad accidents since 2009 have been more than US$4.11 billion dollars. This considerable loss is a major concern for the industry, society, and the government. Therefore, identifying and ranking the factors that contribute to financial losses from railroad accidents would inform strategies to minimize them. To achieve that goal, this paper evaluates and compares the results of applying different non-parametric statistical and regression methods to 15 years of railroad Class I freight train accident data. The models compared are random forest, k-nearest neighbors, support vector machines, stochastic gradient boosting, extreme gradient boosting, and stepwise linear regression. The results indicate that these methods are all suitable for analyzing non-linear and heterogeneous railroad incident data. However, the extreme gradient boosting method provided the best performance. Therefore, the analysis used that model to identify and rank factors that contribute to financial losses, based on the gain percentage of the prediction accuracy. The number of derailed freight cars and the absence of territory signalization dominated as contributing factors in more than 57% and 20% of the accidents, respectively. Partial-dependence plots further explore the complex non-linear dependencies of each factor to better visualize and interpret the results.
引用
收藏
页码:299 / 309
页数:11
相关论文
共 50 条
  • [1] Machine Learning Approach to Task Ranking
    Ricky, Michael Yoseph
    Hendric, Spits Warnars Harco Leslie
    Budiharto, Widodo
    Abbas, Bahtiar Saleh
    [J]. 2017 14TH INTERNATIONAL SYMPOSIUM ON PERVASIVE SYSTEMS, ALGORITHMS AND NETWORKS & 2017 11TH INTERNATIONAL CONFERENCE ON FRONTIER OF COMPUTER SCIENCE AND TECHNOLOGY & 2017 THIRD INTERNATIONAL SYMPOSIUM OF CREATIVE COMPUTING (ISPAN-FCST-ISCC), 2017, : 507 - 513
  • [2] A Machine Learning Approach for Detecting Traffic Incidents from Video Cameras
    Gabrielli, Guillermo
    Ferreira, Ignacio
    Dalchiele, Pablo
    Tchernykh, Andrei
    Nesmachnow, Sergio
    [J]. SMART CITIES (ICSC-CITIES 2021), 2022, 1555 : 162 - 177
  • [3] Revisiting the Risk Factors for Endometriosis: A Machine Learning Approach
    Blass, Ido
    Sahar, Tali
    Shraibman, Adi
    Ofer, Dan
    Rappoport, Nadav
    Linial, Michal
    [J]. JOURNAL OF PERSONALIZED MEDICINE, 2022, 12 (07):
  • [4] Factors affecting the quality of financial statements from an audit point of view: A machine learning approach
    Hung, Dang Ngoc
    Van, Vu Thi Thuy
    Archer, Lan
    [J]. COGENT BUSINESS & MANAGEMENT, 2023, 10 (01):
  • [5] A Machine Learning Approach for Ranking in Question Answering
    Amato, Alba
    Coronato, Antonio
    [J]. ADVANCES ON P2P, PARALLEL, GRID, CLOUD AND INTERNET COMPUTING (3PGCIC-2017), 2018, 13 : 89 - 98
  • [6] A MACHINE LEARNING APPROACH FOR THE IDENTIFICATION OF RISK FACTORS FOR CARDIOVASCULAR DISEASE
    Coelho, J. R.
    Gaspar, I. M.
    Silva, A. M.
    Freitas, A. T.
    [J]. CARDIOLOGY, 2013, 126 : 272 - 272
  • [7] Gene ranking from microarray data for cancer classification -: A machine learning approach
    Ruiz, Roberto
    Pontes, Beatriz
    Giraldez, Raul
    Aguilar-Ruiz, Jesus S.
    [J]. KNOWLEDGE-BASED INTELLIGENT INFORMATION AND ENGINEERING SYSTEMS, PT 2, PROCEEDINGS, 2006, 4252 : 1272 - 1280
  • [8] Identification of IT Incidents for Improved Risk Analysis by Using Machine Learning
    Sulaman, Sardar Muhammad
    Weyns, Kim
    Host, Martin
    [J]. PROCEEDINGS 41ST EUROMICRO CONFERENCE ON SOFTWARE ENGINEERING AND ADVANCED APPLICATIONS SEAA 2015, 2015, : 369 - 373
  • [9] WEB PAGE RANKING USING MACHINE LEARNING APPROACH
    Chauhan, Vijay
    Jaiswal, Arunima
    Khan, Junaid Khalid
    [J]. 2015 5TH INTERNATIONAL CONFERENCE ON ADVANCED COMPUTING & COMMUNICATION TECHNOLOGIES ACCT 2015, 2015, : 575 - 580
  • [10] A machine-learning approach to ranking RDF properties
    Dessi, Andrea
    Atzori, Maurizio
    [J]. FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2016, 54 : 366 - 377