PCA-based missing information imputation for real-time crash likelihood prediction under imbalanced data

被引:34
|
作者
Ke, Jintao [1 ]
Zhang, Shuaichao [2 ]
Yang, Hai [1 ]
Chen, Xiqun [2 ]
机构
[1] Hong Kong Univ Sci & Technol, Dept Civil & Environm Engn, Kowloon, Hong Kong, Peoples R China
[2] Zhejiang Univ, Coll Civil Engn & Architecture, Hangzhou 310058, Zhejiang, Peoples R China
基金
中国国家自然科学基金;
关键词
Real-time crash likelihood prediction; PCA-based missing data imputation; cost-sensitive learning; SMOTE; support vector machine; adaboost; URBAN EXPRESSWAYS; INJURY SEVERITY; HYBRID APPROACH; RISK-ASSESSMENT; SAFETY; MODEL; WEATHER; SHANGHAI; IMPROVE;
D O I
10.1080/23249935.2018.1542414
中图分类号
U [交通运输];
学科分类号
08 ; 0823 ;
摘要
As an important research topic, real-time crash likelihood prediction has been studied for many years. However, few research focuses on the missing data imputation in real-time crash likelihood prediction, although missing values are commonly observed due to breakdown of sensors or external interference. Besides, classifying imbalanced data is also a critical issue in real-time crash likelihood prediction, since the number of crash-prone cases is much smaller than that of non-crash cases. In this paper, three principal component analysis (PCA) based approaches are established for imputing missing values, while two kinds of solutions are developed to tackle the issue of imbalanced data. The results show that the proposed methods can help the classifiers achieve better predictive performance under situations with missing data. The two solutions, i.e. cost-sensitive learning, and synthetic minority oversampling technique (SMOTE), can help improve the sensitivity by adjusting the classifiers to pay more attention to the minority class.
引用
收藏
页码:872 / 895
页数:24
相关论文
共 50 条
  • [1] Chunk-wise regularised PCA-based imputation of missing data
    A. Iodice D’Enza
    A. Markos
    F. Palumbo
    [J]. Statistical Methods & Applications, 2022, 31 : 365 - 386
  • [2] Chunk-wise regularised PCA-based imputation of missing data
    Iodice D'Enza, A.
    Markos, A.
    Palumbo, F.
    [J]. STATISTICAL METHODS AND APPLICATIONS, 2022, 31 (02): : 365 - 386
  • [3] Real-time Traffic Prediction: A Novel Imputation Optimization Algorithm with Missing Data
    Liu, Anqi
    Li, Changle
    Yue, Wenwei
    Zhou, Xun
    [J]. 2018 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM), 2018,
  • [4] Handling Imbalanced Data for Real-Time Crash Prediction: Application of Boosting and Sampling Techniques
    Ariannezhad, Amin
    Karimpour, Abolfazl
    Qin, Xiao
    Wu, Yao-Jan
    Salmani, Yasamin
    [J]. JOURNAL OF TRANSPORTATION ENGINEERING PART A-SYSTEMS, 2021, 147 (03)
  • [5] Class-imbalanced crash prediction based on real-time traffic and weather data: A driving simulator study
    Elamrani Abou Elassad, Zouhair
    Mousannif, Hajar
    Al Moatassime, Hassan
    [J]. TRAFFIC INJURY PREVENTION, 2020, 21 (03) : 201 - 208
  • [6] Wasserstein Generative Adversarial Network to Address the Imbalanced Data Problem in Real-Time Crash Risk Prediction
    Man, Cheuk Ki
    Quddus, Mohammed
    Theofilatos, Athanasios
    Yu, Rongjie
    Imprialou, Marianna
    [J]. IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (12) : 23002 - 23013
  • [7] Empirical likelihood-based inference under imputation for missing response data
    Wang, QH
    Rao, JNK
    [J]. ANNALS OF STATISTICS, 2002, 30 (03): : 896 - 924
  • [8] A Direct PCA-Based Approach for Real-Time Description of Physiological Organ Deformations
    de Senneville, Baudouin Denis
    El Hamidi, Abdallah
    Moonen, Chrit
    [J]. IEEE TRANSACTIONS ON MEDICAL IMAGING, 2015, 34 (04) : 974 - 982
  • [9] Predicting crash likelihood and severity on freeways with real-time loop detector data
    Xu, Chengcheng
    Tarko, Andrew P.
    Wang, Wei
    Liu, Pan
    [J]. ACCIDENT ANALYSIS AND PREVENTION, 2013, 57 : 30 - 39
  • [10] Real-time crash prediction in an urban expressway using disaggregated data
    Basso, Franco
    Basso, Leonardo J.
    Bravo, Francisco
    Pezoa, Raul
    [J]. TRANSPORTATION RESEARCH PART C-EMERGING TECHNOLOGIES, 2018, 86 : 202 - 219