Cost-sensitive learning for semi-supervised hit-and-run analysis

被引:10
|
作者
Zhu, Siying [1 ]
Wan, Jianwu [1 ]
机构
[1] Nanyang Technol Univ, Sch Civil & Environm Engn, Singapore, Singapore
来源
关键词
Hit-and-run; Cost-sensitive; Semi-supervised learning; Imbalanced dataset; Unlabelled data; CRASHES; ACCIDENTS; VEHICLE; BARRIERS; NETWORK; MODEL; ROAD;
D O I
10.1016/j.aap.2021.106199
中图分类号
TB18 [人体工程学];
学科分类号
1201 ;
摘要
Hit-and-run crashes not only degrade the morality, but also result in delays of medical services provided to victims. However, class imbalance problem exists as the number of hit-and-run crashes is much smaller than that of non-hit-and-run crashes. The missing label problem also exists in the crash analysis due to reasons like data barrier such that the information hidden in the unlabelled samples has not been effectively utilised. In this paper, a cost-sensitive semi-supervised logistic regression (CS3LR) model is proposed for hit-and-run analysis, in order to tackle class-imbalanced data distribution and missing label problem, based on the crash dataset of Victorian, Australia (2013-2019). By performing label estimation with logistic regression jointly utilising both labelled and unlabelled data with pseudo labels in a well-designed cost-sensitive semi-supervised maximum likelihood framework, the proposed model can obtain an unbiased likelihood parameter for hit-and-run prediction and analysis. Comparing the experimental results of CS3LR model with two logistic regression models and seven machine learning methods, better performance of CS3LR model is demonstrated. The most significant contributing factors to hit-and-run crashes extracted by CS3LR with only 10% labelled data show a high degree of consistency with the true contributing factors obtained by the supervised cost-sensitive logistic regression with complete hit-and-run labels. The effects of class-weighted ratio and hyper-parameter lambda on the performance of hitand-run crash prediction model have also been analysed. The results can further provide recommendations and implications on the policies and counter-measures for preventing hit-and-run collisions and crimes. The methodology proposed in this paper can also be employed to analyse crash data with other types of missing labels, such as crash severity.
引用
收藏
页数:14
相关论文
共 50 条
  • [1] Cost-Sensitive Graph Convolutional Network With Self-Paced Learning for Hit-and-Run Analysis
    Wan, Jianwu
    Zhu, Siying
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2024, 25 (02) : 1675 - 1690
  • [2] A cost-sensitive semi-supervised learning modelbased on uncertainty
    Zhu, Hongyu
    Wang, Xizhao
    NEUROCOMPUTING, 2017, 251 : 106 - 114
  • [3] Cost-Sensitive Support Vector Machine for Semi-Supervised Learning
    Qi, Zhiquan
    Tian, Yingjie
    Shi, Yong
    Yu, Xiaodan
    2013 INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE, 2013, 18 : 1684 - 1689
  • [4] Cost-Sensitive Semi-Supervised Discriminant Analysis for Face Recognition
    Lu, Jiwen
    Zhou, Xiuzhuang
    Tan, Yap-Peng
    Shang, Yuanyuan
    Zhou, Jie
    IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2012, 7 (03) : 944 - 953
  • [5] Cost-Sensitive Canonical Correlation Analysis for Semi-Supervised Multi-View Learning
    Wan, Jianwu
    Zhu, Feng
    IEEE SIGNAL PROCESSING LETTERS, 2020, 27 : 1330 - 1334
  • [6] Cost-Sensitive Semi-supervised Classification for Fraud Applications
    Elshaar, Sulaf
    Sadaoui, Samira
    AGENTS AND ARTIFICIAL INTELLIGENCE, ICAART 2020, 2021, 12613 : 173 - 187
  • [7] Cost-Sensitive Semi-Supervised Support Vector Machine
    Li, Yu-Feng
    Kwok, James T.
    Zhou, Zhi-Hua
    PROCEEDINGS OF THE TWENTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE (AAAI-10), 2010, : 500 - 505
  • [8] A Cost-Sensitive Semi-Supervised Support Vector Machine Algorithm
    Han, Min
    Wang, Zhao
    Sun, Zhaoxu
    Xu, Yongli
    Jiang, Nan
    2012 THIRD INTERNATIONAL CONFERENCE ON THEORETICAL AND MATHEMATICAL FOUNDATIONS OF COMPUTER SCIENCE (ICTMF 2012), 2013, 38 : 238 - 244
  • [9] Cost-Sensitive Label Propagation for Semi-Supervised Face Recognition
    Wan, Jianwu
    Wang, Yi
    IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2019, 14 (07) : 1729 - 1743
  • [10] Semi-supervised Feature Selection Based on Cost-Sensitive and Structural Information
    Tao, Yiling
    Lu, Guangquan
    Ma, Chaoqun
    Su, Zidong
    Hu, Zehui
    DATABASES THEORY AND APPLICATIONS (ADC 2021), 2021, 12610 : 23 - 36