Road Crashes Analysis and Prediction using Gradient Boosted and Random Forest Trees

被引:0
|
作者
Elyassami, Sanaa [1 ]
Hamid, Yasir [1 ]
Habuza, Tetiana [2 ]
机构
[1] Abu Dhabi Polytech, Dept Informat Secur Engn Technol, Abu Dhabi, U Arab Emirates
[2] UAE Univ, Coll Informat Technol, Dept Comp Sci & Software Engn, Al Ain, U Arab Emirates
关键词
Crash Data Analysis; Decision Tree; Machine Learning; Random Forest; Gradient Boosted Tree;
D O I
10.1109/CIST49399.2021.9357298
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
People lose their lives every day due to road traffic crashes. The problem is so humongous globally that the World Health Organization, in its Sustainable Development Agenda 2030, is inviting the coordinates efforts across nations towards it and aspiring to cut down the deaths and injuries to half. Taking a clue from that, the proposed work is undertaken to build machine learning-based models for analyzing the crash data, identifying the important risk factors, and predict the injury severity of drivers. The proposed work studied and analyzed several factors of road accidents to create an accurate and interpretable model that predicts the occurrence and severity of car accidents by investigating crash causal factors and crash severity factors. In the proposed work, we employed three machine learning algorithms to vis-a-vis Decision Tree, Random Forest, and Gradient Boosted tree on Statewide Vehicle Crashes Dataset provided by Maryland State Police. The gradient boosted-based model reported the highest prediction accuracy and provided the most influencing factors in the predictive model. The findings showed that disregarding traffic signals and stop signs, road design problems, poor visibility, and bad weather conditions are the most important variables in the predictive road traffic crash model. Using the identified risk factors is crucial in establishing actions that may reduce the risks related to those factors.
引用
收藏
页码:520 / 525
页数:6
相关论文
共 50 条
  • [1] Waist circumference prediction for epidemiological research using gradient boosted trees
    Zhou, Weihong
    Eckler, Spencer
    Barszczyk, Andrew
    Waese-Perlman, Alex
    Wang, Yingjie
    Gu, Xiaoping
    Feng, Zhong-Ping
    Peng, Yuzhu
    Lee, Kang
    BMC MEDICAL RESEARCH METHODOLOGY, 2021, 21 (01)
  • [2] Waist circumference prediction for epidemiological research using gradient boosted trees
    Weihong Zhou
    Spencer Eckler
    Andrew Barszczyk
    Alex Waese-Perlman
    Yingjie Wang
    Xiaoping Gu
    Zhong-Ping Feng
    Yuzhu Peng
    Kang Lee
    BMC Medical Research Methodology, 21
  • [4] Analysis and prediction of injury severity in single micromobility crashes with Random Forest
    Sanjurjo-de-No, Almudena
    Perez-Zuriaga, Ana Maria
    Garcia, Alfredo
    HELIYON, 2023, 9 (12)
  • [5] Comparative Analysis of Logistic Regression, Gradient Boosted Trees, SVM, and Random Forest Algorithms for Prediction of Acute Kidney Injury Requiring Dialysis After Cardiac Surgery
    Omar, Evi Diana
    Mat, Hasnah
    Abd Karim, Ainil Zafirah
    Sanaudi, Ridwan
    Ibrahim, Fairol H.
    Omar, Mohd Azahadi
    Ismail, Muhd Zulfadli Hafiz
    Jayaraj, Vivek Jason
    Goh, Bak Leong
    INTERNATIONAL JOURNAL OF NEPHROLOGY AND RENOVASCULAR DISEASE, 2024, 17 : 197 - 204
  • [6] Comparison of gradient boosted decision trees and random forest for groundwater potential mapping in Dholpur (Rajasthan), India
    Sachdeva, Shruti
    Kumar, Bijendra
    STOCHASTIC ENVIRONMENTAL RESEARCH AND RISK ASSESSMENT, 2021, 35 (02) : 287 - 306
  • [7] Random Forest and Gradient Boosted Trees for Patient Individualized Contrast Agent Dose Reduction in CT Angiography
    Pallenberg, Rene
    Fleitmann, Marja
    Stroth, Andreas Martin
    Gerlach, Jan
    Fuerschke, Alexander
    Barkhausen, Joerg
    Bischof, Arpad
    Handels, Heinz
    CARING IS SHARING-EXPLOITING THE VALUE IN DATA FOR HEALTH AND INNOVATION-PROCEEDINGS OF MIE 2023, 2023, 302 : 952 - 956
  • [8] Comparison of gradient boosted decision trees and random forest for groundwater potential mapping in Dholpur (Rajasthan), India
    Shruti Sachdeva
    Bijendra Kumar
    Stochastic Environmental Research and Risk Assessment, 2021, 35 : 287 - 306
  • [9] Counting People using Gradient Boosted Trees
    Zhou, Bingyin
    Lu, Ming
    Wang, Yonggang
    2016 IEEE INFORMATION TECHNOLOGY, NETWORKING, ELECTRONIC AND AUTOMATION CONTROL CONFERENCE (ITNEC), 2016, : 391 - 395
  • [10] Cyanotoxin level prediction in a reservoir using gradient boosted regression trees: a case study
    Paulino José García Nieto
    Esperanza García-Gonzalo
    Fernando Sánchez Lasheras
    José Ramón Alonso Fernández
    Cristina Díaz Muñiz
    Francisco Javier de Cos Juez
    Environmental Science and Pollution Research, 2018, 25 : 22658 - 22671