Cost-sensitive learning for semi-supervised hit-and-run analysis

被引:10
|
作者
Zhu, Siying [1 ]
Wan, Jianwu [1 ]
机构
[1] Nanyang Technol Univ, Sch Civil & Environm Engn, Singapore, Singapore
来源
关键词
Hit-and-run; Cost-sensitive; Semi-supervised learning; Imbalanced dataset; Unlabelled data; CRASHES; ACCIDENTS; VEHICLE; BARRIERS; NETWORK; MODEL; ROAD;
D O I
10.1016/j.aap.2021.106199
中图分类号
TB18 [人体工程学];
学科分类号
1201 ;
摘要
Hit-and-run crashes not only degrade the morality, but also result in delays of medical services provided to victims. However, class imbalance problem exists as the number of hit-and-run crashes is much smaller than that of non-hit-and-run crashes. The missing label problem also exists in the crash analysis due to reasons like data barrier such that the information hidden in the unlabelled samples has not been effectively utilised. In this paper, a cost-sensitive semi-supervised logistic regression (CS3LR) model is proposed for hit-and-run analysis, in order to tackle class-imbalanced data distribution and missing label problem, based on the crash dataset of Victorian, Australia (2013-2019). By performing label estimation with logistic regression jointly utilising both labelled and unlabelled data with pseudo labels in a well-designed cost-sensitive semi-supervised maximum likelihood framework, the proposed model can obtain an unbiased likelihood parameter for hit-and-run prediction and analysis. Comparing the experimental results of CS3LR model with two logistic regression models and seven machine learning methods, better performance of CS3LR model is demonstrated. The most significant contributing factors to hit-and-run crashes extracted by CS3LR with only 10% labelled data show a high degree of consistency with the true contributing factors obtained by the supervised cost-sensitive logistic regression with complete hit-and-run labels. The effects of class-weighted ratio and hyper-parameter lambda on the performance of hitand-run crash prediction model have also been analysed. The results can further provide recommendations and implications on the policies and counter-measures for preventing hit-and-run collisions and crimes. The methodology proposed in this paper can also be employed to analyse crash data with other types of missing labels, such as crash severity.
引用
收藏
页数:14
相关论文
共 50 条
  • [31] Semi-Supervised Incremental Learning
    Bouchachia, Abdelhamid
    Prossegger, Markus
    Duman, Hakan
    2010 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS (FUZZ-IEEE 2010), 2010,
  • [32] Semi-supervised learning by disagreement
    Zhi-Hua Zhou
    Ming Li
    Knowledge and Information Systems, 2010, 24 : 415 - 439
  • [33] Cost Sensitive Semi-Supervised Canonical Correlation Analysis for Multi-view Dimensionality Reduction
    Wan, Jianwu
    Wang, Hongyuan
    Yang, Ming
    NEURAL PROCESSING LETTERS, 2017, 45 (02) : 411 - 430
  • [34] Deep Semi-Supervised Learning
    Hailat, Zeyad
    Komarichev, Artem
    Chen, Xue-Wen
    2018 24TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2018, : 2154 - 2159
  • [35] Semi-Supervised Learning by Disagreement
    Zhou, Zhi-Hua
    2008 IEEE INTERNATIONAL CONFERENCE ON GRANULAR COMPUTING, VOLS 1 AND 2, 2008, : 93 - 93
  • [36] Reliable Semi-supervised Learning
    Shao, Junming
    Huang, Chen
    Yang, Qinli
    Luo, Guangchun
    2016 IEEE 16TH INTERNATIONAL CONFERENCE ON DATA MINING (ICDM), 2016, : 1197 - 1202
  • [37] Semi-supervised Learning with Transfer Learning
    Zhou, Huiwei
    Zhang, Yan
    Huang, Degen
    Li, Lishuang
    CHINESE COMPUTATIONAL LINGUISTICS AND NATURAL LANGUAGE PROCESSING BASED ON NATURALLY ANNOTATED BIG DATA, 2013, 8208 : 109 - 119
  • [38] Cost Sensitive Semi-Supervised Canonical Correlation Analysis for Multi-view Dimensionality Reduction
    Jianwu Wan
    Hongyuan Wang
    Ming Yang
    Neural Processing Letters, 2017, 45 : 411 - 430
  • [39] Semi-supervised learning with dropouts
    Abhishek
    Yadav, Rakesh Kumar
    Verma, Shekhar
    EXPERT SYSTEMS WITH APPLICATIONS, 2023, 215
  • [40] PRIVILEGED SEMI-SUPERVISED LEARNING
    Chen, Xingyu
    Gong, Chen
    Ma, Chao
    Huang, Xiaolin
    Yang, Jie
    2018 25TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2018, : 2999 - 3003