Prediction model of crash severity in imbalanced dataset using data leveling methods and metaheuristic optimization algorithms

被引:14
|
作者
Danesh, Akbar [1 ]
Ehsani, Mehrdad [1 ]
Nejad, Fereidoon Moghadas [1 ]
Zakeri, Hamzeh [1 ]
机构
[1] Amirkabir Univ Technol, Dept Civil & Environm Engn, Tehran, Iran
关键词
Crash injury severity; imbalanced dataset; machine learning algorithm; prediction model; sensitivity analysis; data leveling methods; SUPPORT VECTOR MACHINE; DRIVER INJURY SEVERITY; INVASIVE WEED OPTIMIZATION; MULTINOMIAL LOGIT MODEL; BICYCLE CRASHES; HYBRID APPROACH; DECISION RULES; CLASSIFICATION; FREQUENCY; ACCIDENTS;
D O I
10.1080/13588265.2022.2028471
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Road accident is one of the important problems in the world which caused large number of deaths. In a road crash dataset, the fatal crash samples, often constitute very small proportion in comparison with non-fatal crash samples. Accurate prediction of fatal crashes, as a minority class, is one of the important challenges in such imbalanced sample distribution in the most of machine learning algorithms. This study introduced data leveling methods based on two metaheuristic optimization algorithms (biogeography-based optimization and invasive weed optimization) to obtain more balanced data. Then, three machine learning algorithms including decision tree, support vector machine (SVM) and k-nearest neighbor were applied for obtained balanced dataset. Performances of the prepared models were evaluated by improving the accuracy of the models in detecting the fatal crashes. It is found that data leveling methods of imbalanced dataset with metaheuristic algorithms improve the performance of crash prediction models in detecting fatal crashes especially in SVM algorithm up to 100% compared to previous studies. Also, results of sensitivity analysis on the developed model represented that head-on crashes, curved roads, and large type vehicles can increase the probability of fatal crashes up to 27.2%, 29%, and 36.8% at high posted speed limit, respectively. Also, two-vehicle crashes are much more likely to be involved in fatal crashes than single-vehicle crashes.
引用
收藏
页码:1869 / 1882
页数:14
相关论文
共 50 条
  • [41] An improved adaptive neuro fuzzy inference system model using conjoined metaheuristic algorithms for electrical conductivity prediction
    Ahmadianfar, Iman
    Shirvani-Hosseini, Seyedehelham
    He, Jianxun
    Samadi-Koucheksaraee, Arvin
    Yaseen, Zaher Mundher
    [J]. SCIENTIFIC REPORTS, 2022, 12 (01)
  • [42] Using metaheuristic algorithms to optimize a mixed model-based ground-motion prediction model and associated variance components
    Mohsen Akhani
    Shahram Pezeshk
    [J]. Journal of Seismology, 2022, 26 : 483 - 498
  • [43] Using metaheuristic algorithms to optimize a mixed model-based ground-motion prediction model and associated variance components
    Akhani, Mohsen
    Pezeshk, Shahram
    [J]. JOURNAL OF SEISMOLOGY, 2022, 26 (03) : 483 - 498
  • [44] A Solution of Implicit Model of Series-Parallel Photovoltaic Arrays by Using Deterministic and Metaheuristic Global Optimization Algorithms
    Perez Archila, Luis Miguel
    David Bastidas-Rodriguez, Juan
    Correa, Rodrigo
    Trejos Grisales, Luz Adriana
    Gonzalez-Montoya, Daniel
    [J]. ENERGIES, 2020, 13 (04)
  • [45] Leveraging metaheuristic algorithms with improved hybrid prediction model framework for enhancing surface roughness optimization in CNC turning AISI 316
    Bennett, Kristin S.
    DePaiva, Jose Mario
    Lazar, Eden
    Veldhuis, Stephen C.
    [J]. International Journal of Advanced Manufacturing Technology, 2024, 135 (5-6): : 1955 - 1983
  • [46] Crash severity analysis: A data-enhanced double layer stacking model using semantic understanding
    Yang, Di
    Dong, Tao
    Wang, Peng
    [J]. HELIYON, 2024, 10 (09)
  • [47] Crash Severity Prediction Using Two-Layer Ensemble Machine Learning Model for Proactive Emergency Management
    Mansoor, Umer
    Ratrout, Nedal T.
    Rahman, Seyd Masiur
    Assi, Khaled
    [J]. IEEE ACCESS, 2020, 8 : 210750 - 210762
  • [48] Evaluation of Widely Used Hydroplaning Risk Prediction Methods Using Florida's Past Crash Data
    Jayasooriya, Waruna
    Gunaratne, Manjriker
    [J]. TRANSPORTATION RESEARCH RECORD, 2014, (2457) : 140 - 150
  • [49] Development of crash prediction models by assessing the role of perpetrators and victims: a comparison of ANN & logistic model using historical crash data
    Mohanty, Malaya
    Panda, Rachita
    Gandupalli, Srinivasa Rao
    Sonowal, Didriksha
    Muskan, Muskan
    Chakraborty, Riya
    Dangeti, Mukund R.
    [J]. INTERNATIONAL JOURNAL OF INJURY CONTROL AND SAFETY PROMOTION, 2023, 30 (02) : 155 - 171
  • [50] A drone-based data management and optimization using metaheuristic algorithms and blockchain smart contracts in a secure fog environment
    Khan, Abdullah Ayub
    Laghari, Asif Ali
    Gadekallu, Thippa Reddy
    Shaikh, Zaffar Ahmed
    Javed, Abdul Rehman
    Rashid, Mamoon
    V. Estrela, Vania
    Mikhaylov, Alexey
    [J]. COMPUTERS & ELECTRICAL ENGINEERING, 2022, 102