Prediction model of crash severity in imbalanced dataset using data leveling methods and metaheuristic optimization algorithms

被引:14
|
作者
Danesh, Akbar [1 ]
Ehsani, Mehrdad [1 ]
Nejad, Fereidoon Moghadas [1 ]
Zakeri, Hamzeh [1 ]
机构
[1] Amirkabir Univ Technol, Dept Civil & Environm Engn, Tehran, Iran
关键词
Crash injury severity; imbalanced dataset; machine learning algorithm; prediction model; sensitivity analysis; data leveling methods; SUPPORT VECTOR MACHINE; DRIVER INJURY SEVERITY; INVASIVE WEED OPTIMIZATION; MULTINOMIAL LOGIT MODEL; BICYCLE CRASHES; HYBRID APPROACH; DECISION RULES; CLASSIFICATION; FREQUENCY; ACCIDENTS;
D O I
10.1080/13588265.2022.2028471
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Road accident is one of the important problems in the world which caused large number of deaths. In a road crash dataset, the fatal crash samples, often constitute very small proportion in comparison with non-fatal crash samples. Accurate prediction of fatal crashes, as a minority class, is one of the important challenges in such imbalanced sample distribution in the most of machine learning algorithms. This study introduced data leveling methods based on two metaheuristic optimization algorithms (biogeography-based optimization and invasive weed optimization) to obtain more balanced data. Then, three machine learning algorithms including decision tree, support vector machine (SVM) and k-nearest neighbor were applied for obtained balanced dataset. Performances of the prepared models were evaluated by improving the accuracy of the models in detecting the fatal crashes. It is found that data leveling methods of imbalanced dataset with metaheuristic algorithms improve the performance of crash prediction models in detecting fatal crashes especially in SVM algorithm up to 100% compared to previous studies. Also, results of sensitivity analysis on the developed model represented that head-on crashes, curved roads, and large type vehicles can increase the probability of fatal crashes up to 27.2%, 29%, and 36.8% at high posted speed limit, respectively. Also, two-vehicle crashes are much more likely to be involved in fatal crashes than single-vehicle crashes.
引用
收藏
页码:1869 / 1882
页数:14
相关论文
共 50 条
  • [1] Handling Imbalanced Data in Road Crash Severity Prediction by Machine Learning Algorithms
    Fiorentini, Nicholas
    Losa, Massimo
    [J]. INFRASTRUCTURES, 2020, 5 (07)
  • [2] Enhancing Crash Injury Severity Prediction on Imbalanced Crash Data by Sampling Technique with Variable Selection
    Yahaya, Mahama
    Jiang, Xinguo
    Fu, Chuanyun
    Bashir, Kamal
    Fan, Wenbo
    [J]. 2019 IEEE INTELLIGENT TRANSPORTATION SYSTEMS CONFERENCE (ITSC), 2019, : 363 - 368
  • [3] Modeling highly imbalanced crash severity data by ensemble methods and global sensitivity analysis
    Jiang, Liming
    Xie, Yuanchang
    Wen, Xiao
    Ren, Tianzhu
    [J]. JOURNAL OF TRANSPORTATION SAFETY & SECURITY, 2022, 14 (04) : 562 - 584
  • [4] A Combination of Metaheuristic Optimization Algorithms and Machine Learning Methods Improves the Prediction of Groundwater Level
    Kayhomayoon, Zahra
    Babaeian, Faezeh
    Milan, Sami Ghordoyee
    Azar, Naser Arya
    Berndtsson, Ronny
    [J]. WATER, 2022, 14 (05)
  • [5] Prediction of the Survival of Kidney Transplantation with imbalanced Data Using Intelligent Algorithms
    Hassani, Zeinab
    Emami, Nasibeh
    [J]. COMPUTER SCIENCE JOURNAL OF MOLDOVA, 2018, 26 (02) : 163 - 181
  • [6] Predicting child occupant crash injury severity in the United Arab Emirates using machine learning models for imbalanced dataset
    Abdulazeez, Muhammad Uba
    Khan, Wasif
    Abdullah, Kassim Abdulrahman
    [J]. IATSS RESEARCH, 2023, 47 (02) : 134 - 159
  • [7] A multivariate Poisson-lognormal regression model for prediction of crash counts by severity, using Bayesian methods
    Ma, Jianming
    Kockelman, Kara M.
    Damien, Paul
    [J]. ACCIDENT ANALYSIS AND PREVENTION, 2008, 40 (03): : 964 - 975
  • [8] Fuzzy Finite Element Model Updating Using Metaheuristic Optimization Algorithms
    Boulkaibet, I.
    Marwala, T.
    Friswell, M. I.
    Khodaparast, H. H.
    Adhikari, S.
    [J]. SPECIAL TOPICS IN STRUCTURAL DYNAMICS, VOL 6, 2017, : 91 - 101
  • [9] Metaheuristic Optimization Algorithms Hybridized With Artificial Intelligence Model for Soil Temperature Prediction: Novel Model
    Liu Penghui
    Ewees, Ahmed A.
    Beyaztas, Beste Hamiye
    Qi, Chongchong
    Salih, Sinan Q.
    Al-Ansari, Nadhir
    Bhagat, Suraj Kumar
    Yaseen, Zaher Mundher
    Singh, Vijay P.
    [J]. IEEE ACCESS, 2020, 8 : 51884 - 51904
  • [10] Assessment of Imbalanced Dataset in Alzheimer's disease Prediction using Data Mining Techniques
    Bonab, F. Rahbari
    Dezaje, M.
    Nourazarian, A. R.
    Kkhatoni, M. Asghari
    Asl, M. R. Kandovani
    [J]. INTERNATIONAL JOURNAL OF ADVANCED BIOTECHNOLOGY AND RESEARCH, 2016, 7 (04): : 1969 - 1975