Crash severity analysis and risk factors identification based on an alternate data source: a case study of developing country

被引:0
|
作者
Hanif Bhuiyan
Jinat Ara
Khan Md. Hasib
Md Imran Hossain Sourav
Faria Benta Karim
Cecilia Sik-Lanyi
Guido Governatori
Andry Rakotonirainy
Shamsunnahar Yasmin
机构
[1] Queensland University of Technology,Data61
[2] CARRS-Q,Department of Electrical Engineering and Information System
[3] CSIRO,Department of CSE
[4] University of Pannonia,Department of EEE
[5] Ahsanullah University of Science and Technology,undefined
[6] Memorial University of Newfoundland,undefined
[7] University of Asia Pacific,undefined
[8] Centre for Computational Law,undefined
来源
关键词
D O I
暂无
中图分类号
学科分类号
摘要
Road traffic injuries are one of the primary reasons for death, especially in developing countries like Bangladesh. Safety in land transport is one of the major concerns for road safety authorities and other policymakers. For this reason, contributory factors identification associated with crashes is necessary for reducing road crashes and ensuring transportation safety. This paper presents an analytical approach to identifying significant contributing factors of Bangladesh road crashes by evaluating the road crash data, considering three different severity levels (non-fetal, severe, and extremely severe). Generally, official crash databases are compiled from police-reported crash records. Though the official datasets are focusing on compiling a wide array of attributes, an assorted number of unreported issues can be observed that demands an alternative source of crash data. Therefore, this proposed approach considers compiling crash data from newspapers in Bangladesh which could be complimentary to the official crash database. To conduct the analysis, first, we filtered the useful features from compiled crash data using three popular feature selection techniques: chi-square, Two-way ANOVA, and Regression analysis. Then, we employed three machine learning classifiers: Decision Tree, Random Forest, and Naïve Bayes over the extracted features. A confusion matrix was considered to evaluate the proposed model, including classification accuracy, sensitivity, and specificity. The predictive machine learning model, namely, Random Forest using Label Encoder with chi-square and Two-way ANOVA feature selection process, seems the best option for crash severity prediction that provides high prediction accuracy. The resulting model highlights nine out of fourteen independent features as responsible factors. Significant features associated with crash severities include driver characteristics (gender, license type, seat belts), vehicle characteristics (vehicle type), road characteristics (road surface type, road classification), environmental conditions (day of crash occurred, time of crash), and injury localization. This outcome may contribute to improving traffic safety of Bangladesh.
引用
收藏
相关论文
共 50 条
  • [31] A Study of Pathologic Risk Factors in Postchemoreduced, Enucleated Specimens of Advanced Retinoblastomas in a Developing Country
    Suryawanshi, Pallavi
    Ramadwar, Mukta
    Dikshit, Rajesh
    Kane, Shubhada V.
    Kurkure, Purna
    Banavali, Shripad
    Viswanathan, Seethalakshmi
    ARCHIVES OF PATHOLOGY & LABORATORY MEDICINE, 2011, 135 (08) : 1017 - 1023
  • [32] Property Damage Crash Equivalency Factors to Solve Crash Frequency-Severity Dilemma Case Study on South Korean Rural Roads
    Oh, Jutaek
    Washington, Simon
    Lee, Dongmin
    TRANSPORTATION RESEARCH RECORD, 2010, (2148) : 83 - 92
  • [33] Comprehensive Study of Risk Factors for Fatal Pedestrian Crashes in Urban Setup in a Developing Country
    Mukherjee, Dipanjan
    Mitra, Sudeshna
    TRANSPORTATION RESEARCH RECORD, 2020, 2674 (08) : 100 - 118
  • [34] The prevalence of crash risk factors in a population-based study of motorcycle riders
    de Rome, Liz
    Fitzharris, Michael
    Baldock, Matthew
    Fernandes, Ralston
    Ma, Alice
    Brown, Julie
    INJURY-INTERNATIONAL JOURNAL OF THE CARE OF THE INJURED, 2016, 47 (09): : 2025 - 2033
  • [35] Work environment factors and provider performance in health houses: a case study of a developing country
    Yusefzadeh, Hasan
    Nabilou, Bahram
    BMC RESEARCH NOTES, 2020, 13 (01)
  • [36] Work environment factors and provider performance in health houses: a case study of a developing country
    Hasan Yusefzadeh
    Bahram Nabilou
    BMC Research Notes, 13
  • [37] Use of Multicriteria Risk Ranking of Zoonotic Diseases in a Developing Country: Case Study of Mongolia
    McFadden, A. M. J.
    Muellner, P.
    Baljinnyam, Z.
    Vink, D.
    Wilson, N.
    ZOONOSES AND PUBLIC HEALTH, 2016, 63 (02) : 138 - 151
  • [38] Tunnel driving risk identification based on multi-source data
    Jin, Sheng
    Jiang, Yang
    Liu, Bokun
    Bai, Congcong
    Zhou, Mengtao
    Dongnan Daxue Xuebao (Ziran Kexue Ban)/Journal of Southeast University (Natural Science Edition), 2024, 54 (06): : 1511 - 1519
  • [39] Determination of disease risk factors using binary data envelopment analysis and logistic regression analysis (case study: a stroke risk factors)
    Gholamazad, Maedeh
    Pourmahmoud, Jafar
    Atashi, Alireza
    Farhoudi, Mehdi
    Anvari, Reza Deljavan
    JOURNAL OF MODELLING IN MANAGEMENT, 2024, 19 (02) : 693 - 714
  • [40] Waste management barriers in developing country hospitals: Case study and AHP analysis
    de Godoy Delmonico, Diego V.
    dos Santos, Hugo H.
    Pinheiro, Marco A. P.
    de Castro, Rosani
    de Souza, Regiane M.
    WASTE MANAGEMENT & RESEARCH, 2018, 36 (01) : 48 - 58