Comprehensive evaluation of multiple machine learning classifiers for predicting freeway incident duration

被引:3
|
作者
Hamad, Khaled [1 ,2 ]
Obaid, Lubna [1 ,2 ]
Nassif, Ali Bou [3 ]
Abu Dabous, Saleh [1 ,2 ]
Al-Ruzouq, Rami [1 ,2 ]
Zeiada, Waleed [1 ,2 ]
机构
[1] Univ Sharjah, Dept Civil & Environm Engn, Sharjah, U Arab Emirates
[2] Univ Sharjah, Res Inst Sci & Engn, Sustainable Civil Infrastruct Syst Res Grp, POB 27272, Sharjah, U Arab Emirates
[3] Univ Sharjah, Comp Engn Dept, Sharjah City, U Arab Emirates
关键词
Incident duration prediction; Feature selection; Machine learning classifiers; Classifiers comparative analysis; Incident classification; CLEARANCE TIME; INFLUENTIAL FACTORS; NEURAL-NETWORK; RESPONSE-TIME; DECISION TREE; TEXT ANALYSIS; M5P TREE; MODEL; CLASSIFICATION; FORECAST;
D O I
10.1007/s41062-023-01138-1
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
This study compares the accuracy and complexity of eleven machine learning classifiers for the problem of incident duration prediction. The proposed framework integrates feature selection and modeling techniques to evaluate the effect of multiple influencing factors and choose the best model for predicting incident durations. Models were developed and tested using an incident dataset collected from the Houston TranStar incidents archive, including more than 110,000 records. Features were selected based on integrating information gain, correlation-based, and relief-based evaluators' results. The developed and fine-tuned classifiers were compared in terms of multiple accuracy measures (precision, recall, F-1 score, and AUC) and complexity measures (memory storage, training time, and testing times). Overall, results showed that among the developed models, the support vector machines (SVM), K-Nearest Neighborhoods, and Gaussian processes classification outperformed other classifiers with a prediction accuracy of 97%. The Decision Tree classifier recorded the lowest performance with a prediction accuracy of 82%. Considering a trade-off between the model's accuracy and complexity, the classifier with higher accuracy associated with low training time complexity was the K-Nearest Neighborhoods achieving an accuracy of 97%, 0.024 s of training time, 0.042 s of testing time, and a memory storage of 0.04 megabytes. Nevertheless, the SVM achieved the same accuracy of 97% yet consumed much lower memory storage of 0.004 megabytes and a testing time of 0.01 s. Although the K-NN recorded the lowest training time, the SVM can be considered the best model for the ID-prediction classification problem.
引用
下载
收藏
页数:24
相关论文
共 50 条
  • [1] Comprehensive evaluation of multiple machine learning classifiers for predicting freeway incident duration
    Khaled Hamad
    Lubna Obaid
    Ali Bou Nassif
    Saleh Abu Dabous
    Rami Al-Ruzouq
    Waleed Zeiada
    Innovative Infrastructure Solutions, 2023, 8
  • [2] Predicting Freeway Incident Duration Using Machine Learning
    Khaled Hamad
    Mohamad Ali Khalil
    Abdul Razak Alozi
    International Journal of Intelligent Transportation Systems Research, 2020, 18 : 367 - 380
  • [3] Predicting Freeway Incident Duration Using Machine Learning
    Hamad, Khaled
    Khalil, Mohamad Ali
    Alozi, Abdul Razak
    INTERNATIONAL JOURNAL OF INTELLIGENT TRANSPORTATION SYSTEMS RESEARCH, 2020, 18 (02) : 367 - 380
  • [4] Evaluation of Machine Learning Classifiers for Predicting Deep Convection
    Ukkonen, Peter
    Makela, Antti
    JOURNAL OF ADVANCES IN MODELING EARTH SYSTEMS, 2019, 11 (06) : 1784 - 1802
  • [5] Simple time sequential procedure for predicting freeway incident duration
    Khattak, Asad J.
    Schofer, Joseph L.
    Wang, Mu-Han
    IVHS Journal, 2 (02):
  • [6] Generative and reproducible benchmarks or comprehensive evaluation machine learning classifiers
    Orzechowski, Patryk
    Moore, Jason H.
    SCIENCE ADVANCES, 2022, 8 (47)
  • [7] A SIMPLE TIME-SEQUENTIAL PROCEDURE FOR PREDICTING FREEWAY INCIDENT DURATION
    KHATTAK, AJ
    SCHOFER, JL
    WANG, MH
    IVHS JOURNAL, 1995, 2 (02): : 113 - 138
  • [8] Effect of feature optimization on performance of machine learning models for predicting traffic incident duration
    Obaid, Lubna
    Hamad, Khaled
    Khalil, Mohamad Ali
    Nassif, Ali Bou
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 131
  • [9] Improved Phishing Attack Detection with Machine Learning: A Comprehensive Evaluation of Classifiers and Features
    Kapan, Sibel
    Sora Gunal, Efnan
    APPLIED SCIENCES-BASEL, 2023, 13 (24):
  • [10] Evaluation of machine learning classifiers for predicting essential genes in Mycobacterium tuberculosis strains
    Das, Monish Mukul
    Sarkar, Keka
    BIOINFORMATION, 2022, 18 (12) : 1126 - 1130