Overfitting Prevention in Accident Prediction Models: Bayesian Regularization of Artificial Neural Networks

被引:22
|
作者
Fiorentini, Nicholas [1 ]
Pellegrini, Diletta [1 ]
Losa, Massimo [1 ]
机构
[1] Univ Pisa, Dept Civil & Ind Engn, Pisa, Italy
关键词
accident prediction models; crash analysis; artificial neural networks; Bayesian regularization backpropagation algorithm; overfitting Issue; road safety; four-lane divided roads; NEGATIVE BINOMIAL MODELS; LEARNING-METHODS; SAFETY;
D O I
10.1177/03611981221111367
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
In the present paper, we implemented the Bayesian regularization (BR) backpropagation algorithm for calibrating an artificial neural network (ANN) as an accident prediction model (APM) to be used on Italian four-lane divided roads. We chose the BR-ANN since it efficiently allows for dealing with small sample size and avoiding overfitting issues by adding a regularization term in the objective function to be minimized during training. Moreover, BR-ANNs are sparsely employed in road safety analyses, and their peculiarities deserve to be emphasized. In our work, the BR-ANN aims to predict the number of fatal and injury (FI) crashes across 236 road elements, for a total length of 78 km. The input features are road element length, horizontal and vertical alignment, cross-section geometry, operating speed, traffic flow, sight distance, and road area type (i.e., a categorical predictor accounting for the potential influence of merge and diverge influence areas). Training and test phases of the BR-ANN have been evaluated by determination coefficient (R-2), root mean square error (RMSE), overfitting ratio (OR), scatterplots, residuals analysis, and by the same ANN architecture trained with the gradient descent (GD) with momentum and adaptive learning rate backpropagation algorithm (GD-ANN). Results demonstrate that the BR-ANN markedly outperforms the GD-ANN, which suffers severe overfitting issues. Furthermore, BR-ANN does not overfit data (OR close to the unity), reports a satisfactory R-2 (0.726), and shows a Gaussian residual distribution with zero mean. Therefore, road authorities could consider regularized ANNs for performing appropriate safety analyses, especially when dealing with small road sample sizes.
引用
收藏
页码:1455 / 1470
页数:16
相关论文
共 50 条
  • [1] Prediction of accident severity using artificial neural networks
    Moghaddam, F. Rezaie
    Afandizadeh, Sh.
    Ziyadi, M.
    INTERNATIONAL JOURNAL OF CIVIL ENGINEERING, 2011, 9 (01) : 41 - 48
  • [2] Avoiding Overfitting: A Survey on Regularization Methods for Convolutional Neural Networks
    Goncalves Dos Santos, Claudio Filipi
    Papa, Joao Paulo
    ACM COMPUTING SURVEYS, 2022, 54 (10S)
  • [3] Regularization of Deep Neural Networks for EEG Seizure Detection to Mitigate Overfitting
    Saqib, Mohammed
    Zhu, Yuanda
    Wang, May
    Beaulieu-Jones, Brett
    2020 IEEE 44TH ANNUAL COMPUTERS, SOFTWARE, AND APPLICATIONS CONFERENCE (COMPSAC 2020), 2020, : 664 - 673
  • [4] A Comparative Analysis of Various Regularization Techniques to Solve Overfitting Problem in Artificial Neural Network
    Gupta, Shrikant
    Gupta, Rajat
    Ojha, Muneendra
    Singh, K. P.
    DATA SCIENCE AND ANALYTICS, 2018, 799 : 363 - 371
  • [6] Interpretable artificial neural networks incorporating Bayesian alphabet models for genome-wide prediction and association studies
    Zhao, Tianjing
    Fernando, Rohan
    Cheng, Hao
    G3-GENES GENOMES GENETICS, 2021, 11 (10):
  • [7] Prediction of Loan Redemption: Logit Models and Artificial Neural Networks
    Dorota Witkowska
    Mariola Chrzanowska
    International Advances in Economic Research, 2005, 11 (3) : 343 - 343
  • [8] Artificial Neural Networks for Surface Ozone Prediction: Models and Analysis
    Faris, Hossam
    Alkasassbeh, Mouhammd
    Rodan, Ali
    POLISH JOURNAL OF ENVIRONMENTAL STUDIES, 2014, 23 (02): : 341 - 348
  • [9] Bayesian Neural Networks with Maximum Mean Discrepancy regularization
    Pomponi, Jary
    Scardapane, Simone
    Uncini, Aurelio
    NEUROCOMPUTING, 2021, 453 : 428 - 437
  • [10] Choosing Between Artificial Neural Networks and Bayesian Inference in Stroke Risk Prediction
    Albu, Adriana
    Stanciu, Loredana
    Pasca, Madalina-Sofia
    Zimbru, Cristian-Grigore
    2019 E-HEALTH AND BIOENGINEERING CONFERENCE (EHB), 2019,