Comparison of Statistical and Machine-Learning Models on Road Traffic Accident Severity Classification

被引:14
|
作者
Infante, Paulo [1 ,2 ]
Jacinto, Goncalo [1 ,2 ]
Afonso, Anabela [1 ,2 ]
Rego, Leonor [2 ]
Nogueira, Vitor [3 ,4 ]
Quaresma, Paulo [3 ,4 ]
Saias, Jose [3 ,4 ]
Santos, Daniel [4 ]
Nogueira, Pedro [5 ,6 ]
Silva, Marcelo [5 ,6 ]
Costa, Rosalina Pisco [7 ,8 ]
Gois, Patricia [9 ]
Manuel, Paulo Rebelo [1 ]
机构
[1] Univ Evora, IIFA, CIMA, P-7000671 Evora, Portugal
[2] Univ Evora, Dept Matemat, ECT, P-7000671 Evora, Portugal
[3] Univ Evora, Algoritmi Res Ctr, P-7000671 Evora, Portugal
[4] Univ Evora, Dept Informat, ECT, P-7000671 Evora, Portugal
[5] Univ Evora, IIFA, ICT, P-7000671 Evora, Portugal
[6] Univ Evora, Dept Geosci, P-7000671 Evora, Portugal
[7] Univ Evora, IIFA, CICS NOVA UEVORA, P-7000208 Evora, Portugal
[8] Univ Evora, Dept Sociol, ECS, P-7000803 Evora, Portugal
[9] Univ Evora, Dept Visual Arts & Design, EA, P-7000208 Evora, Portugal
关键词
injury; logistic regression; machine learning; road traffic accidents; severity of victims; CRASHES; PREDICTION; VEHICLE; SINGLE;
D O I
10.3390/computers11050080
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Portugal has the sixth highest road fatality rate among European Union members. This is a problem of different dimensions with serious consequences in people's lives. This study analyses daily data from police and government authorities on road traffic accidents that occurred between 2016 and 2019 in a district of Portugal. This paper looks for the determinants that contribute to the existence of victims in road traffic accidents, as well as the determinants for fatalities and/or serious injuries in accidents with victims. We use logistic regression models, and the results are compared to the machine-learning model results. For the severity model, where the response variable indicates whether only property damage or casualties resulted in the traffic accident, we used a large sample with a small imbalance. For the serious injuries model, where the response variable indicates whether or not there were victims with serious injuries and/or fatalities in the traffic accident with victims, we used a small sample with very imbalanced data. Empirical analysis supports the conclusion that, with a small sample of imbalanced data, machine-learning models generally do not perform better than statistical models; however, they perform similarly when the sample is large and has a small imbalance.
引用
收藏
页数:12
相关论文
共 50 条
  • [41] statistical regression and classification: from linear models to machine learning
    Maronna, Ricardo
    [J]. STATISTICAL PAPERS, 2020, 61 (02) : 917 - 918
  • [42] Malaysian Road Accident Severity: Variables and Predictive Models
    Ting, Choo-Yee
    Tan, Nicholas Yu-Zhe
    Hashim, Hizal Hanis
    Ho, Chiung Ching
    Shabadin, Akmalia
    [J]. COMPUTATIONAL SCIENCE AND TECHNOLOGY (ICCST 2019), 2020, 603 : 699 - 708
  • [43] ACCIDENT RISK IN ROAD TRAFFIC - CHARACTERISTIC QUANTITIES AND THEIR STATISTICAL TREATMENT
    BRUHNING, E
    VOLKER, R
    [J]. ACCIDENT ANALYSIS AND PREVENTION, 1982, 14 (01): : 65 - 80
  • [44] Analysis of Motorcycle Accident Injury Severity and Performance Comparison of Machine Learning Algorithms
    Santos, Kenny
    Firme, Bernardo
    Dias, Joao P.
    Amado, Conceicao
    [J]. TRANSPORTATION RESEARCH RECORD, 2024, 2678 (01) : 736 - 748
  • [45] Road Accident Analysis using Machine Learning
    Patil, Jayesh
    Prabhu, Mandar
    Walavalkar, Dhaval
    Lobo, Vivian Brian
    [J]. 2020 IEEE PUNE SECTION INTERNATIONAL CONFERENCE (PUNECON), 2020, : 108 - 112
  • [46] Road Accident Forecasting Hybrid machine learning
    Singh, Archana
    [J]. CURRENT SCIENCE, 2021, 120 (09): : 1419 - 1419
  • [47] Predicting Road Traffic Accident Severity using Accident Report Data in South Africa
    Mokoatle, Mpho
    Marivate, Vukosi
    Bukohwo, Esiefarienrhe
    [J]. PROCEEDINGS OF THE 20TH ANNUAL INTERNATIONAL CONFERENCE ON DIGITAL GOVERNMENT RESEARCH (DGO2019): GOVERNANCE IN THE AGE OF ARTIFICIAL INTELLIGENCE, 2019, : 11 - 17
  • [48] Machine-Learning Studies on Spin Models
    Shiina, Kenta
    Mori, Hiroyuki
    Okabe, Yutaka
    Lee, Hwee Kuan
    [J]. SCIENTIFIC REPORTS, 2020, 10 (01)
  • [49] Machine-Learning Studies on Spin Models
    Kenta Shiina
    Hiroyuki Mori
    Yutaka Okabe
    Hwee Kuan Lee
    [J]. Scientific Reports, 10
  • [50] Machine-learning models for shoulder rehabilitation exercises classification using a wearable system
    Sassi, Martina
    Carnevale, Arianna
    Mancuso, Matilde
    Schena, Emiliano
    Pecchia, Leandro
    Longo, Umile Giuseppe
    [J]. KNEE SURGERY SPORTS TRAUMATOLOGY ARTHROSCOPY, 2024,