Prediction model of crash severity in imbalanced dataset using data leveling methods and metaheuristic optimization algorithms

被引:14
|
作者
Danesh, Akbar [1 ]
Ehsani, Mehrdad [1 ]
Nejad, Fereidoon Moghadas [1 ]
Zakeri, Hamzeh [1 ]
机构
[1] Amirkabir Univ Technol, Dept Civil & Environm Engn, Tehran, Iran
关键词
Crash injury severity; imbalanced dataset; machine learning algorithm; prediction model; sensitivity analysis; data leveling methods; SUPPORT VECTOR MACHINE; DRIVER INJURY SEVERITY; INVASIVE WEED OPTIMIZATION; MULTINOMIAL LOGIT MODEL; BICYCLE CRASHES; HYBRID APPROACH; DECISION RULES; CLASSIFICATION; FREQUENCY; ACCIDENTS;
D O I
10.1080/13588265.2022.2028471
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Road accident is one of the important problems in the world which caused large number of deaths. In a road crash dataset, the fatal crash samples, often constitute very small proportion in comparison with non-fatal crash samples. Accurate prediction of fatal crashes, as a minority class, is one of the important challenges in such imbalanced sample distribution in the most of machine learning algorithms. This study introduced data leveling methods based on two metaheuristic optimization algorithms (biogeography-based optimization and invasive weed optimization) to obtain more balanced data. Then, three machine learning algorithms including decision tree, support vector machine (SVM) and k-nearest neighbor were applied for obtained balanced dataset. Performances of the prepared models were evaluated by improving the accuracy of the models in detecting the fatal crashes. It is found that data leveling methods of imbalanced dataset with metaheuristic algorithms improve the performance of crash prediction models in detecting fatal crashes especially in SVM algorithm up to 100% compared to previous studies. Also, results of sensitivity analysis on the developed model represented that head-on crashes, curved roads, and large type vehicles can increase the probability of fatal crashes up to 27.2%, 29%, and 36.8% at high posted speed limit, respectively. Also, two-vehicle crashes are much more likely to be involved in fatal crashes than single-vehicle crashes.
引用
收藏
页码:1869 / 1882
页数:14
相关论文
共 50 条
  • [21] Optimization of a Transit Services Model with a Feeder Bus and Rail System Using Metaheuristic Algorithms
    Almasi, Mohammad Hadi
    Sadollah, Ali
    Mounes, Sina Mirzapour
    Karim, Mohamed Rehan
    [J]. JOURNAL OF COMPUTING IN CIVIL ENGINEERING, 2015, 29 (06)
  • [22] Crash Density and Severity Prediction Using Recurrent Neural Networks Combined with Particle Swarm Optimization
    Xu, Xinxin
    Zeng, Ziqiang
    Wang, Yinhai
    Ash, John
    [J]. PROCEEDINGS OF THE THIRTEENTH INTERNATIONAL CONFERENCE ON MANAGEMENT SCIENCE AND ENGINEERING MANAGEMENT, VOL 1, 2020, 1001 : 566 - 580
  • [23] An NLP-Inspired Data Augmentation Method for Adverse Event Prediction Using an Imbalanced Healthcare Dataset
    Ishikawa, Tomoki
    Yakoh, Takahiro
    Urushihara, Hisashi
    [J]. IEEE ACCESS, 2022, 10 : 81166 - 81176
  • [24] Nature inspired optimization model for classification and severity prediction in COVID-19 clinical dataset
    Suma, L. S.
    Anand, H. S.
    Vinod Chandra, S. S.
    [J]. JOURNAL OF AMBIENT INTELLIGENCE AND HUMANIZED COMPUTING, 2021, 14 (3) : 1699 - 1711
  • [25] Nature inspired optimization model for classification and severity prediction in COVID-19 clinical dataset
    L. S. Suma
    H. S. Anand
    S. S. Vinod chandra
    [J]. Journal of Ambient Intelligence and Humanized Computing, 2023, 14 : 1699 - 1711
  • [26] Optimized prediction models for faulting failure of Jointed Plain concrete pavement using the metaheuristic optimization algorithms
    Ehsani, Mehrdad
    Hamidian, Pouria
    Hajikarimi, Pouria
    Nejad, Fereidoon Moghadas
    [J]. CONSTRUCTION AND BUILDING MATERIALS, 2023, 364
  • [27] Analysis of Breast Cancer Dataset Using Big Data Algorithms for Accuracy of Diseases Prediction
    Sinha, Ankita
    Sahoo, Bhaswati
    Rautaray, Siddharth Swarup
    Pandey, Manjusha
    [J]. SECOND INTERNATIONAL CONFERENCE ON COMPUTER NETWORKS AND COMMUNICATION TECHNOLOGIES, ICCNCT 2019, 2020, 44 : 271 - 277
  • [28] Offset Well Design Optimization Using a Surrogate Model and Metaheuristic Algorithms: A Bakken Case Study
    Merzoug, Ahmed
    Rasouli, Vamegh
    [J]. ENG, 2023, 4 (02): : 1290 - 1305
  • [29] Investigating the Role of Clustering in Construction-Accident Severity Prediction Using a Heterogeneous and Imbalanced Data Set
    Salarian, Ali Akbar
    Etemadfard, Hossein
    Rahimzadegan, Ali
    Ghalehnovi, Mansour
    [J]. JOURNAL OF CONSTRUCTION ENGINEERING AND MANAGEMENT, 2023, 149 (02)
  • [30] Advancement of weather-related crash prediction model using nonparametric machine learning algorithms
    Amit Ranjan Mondal
    Md Abul Ehsan Bhuiyan
    Feifei Yang
    [J]. SN Applied Sciences, 2020, 2