Recognition Model of Highway Toll Evasion Behavior Considering Cost-Sensitivity

被引:0
|
作者
Zhao J. [1 ,2 ]
Xu H. [1 ]
Lü X. [1 ]
Li P. [3 ]
Huang S. [3 ]
机构
[1] School of Traffic and Transportation, Beijing Jiaotong University, Beijing
[2] Key Laboratory of Transport Industry of Big Data Application Technologies for Comprehensive Transport, Beijing Jiaotong University, Beijing
[3] TransChina(Beijing)Technology Co.,Ltd., Beijing
来源
Huanan Ligong Daxue Xuebao/Journal of South China University of Technology (Natural Science) | 2024年 / 52卷 / 05期
基金
中国国家自然科学基金;
关键词
cost-sensitivity; ensemble learning; feature selection; highway transport; machine learning;
D O I
10.12141/j.issn.1000-565X.230078
中图分类号
学科分类号
摘要
In order to effectively improve the efficiency of highway vehicle toll evasion inspection, based on ETC (Electronic Toll Collection) toll data, this paper proposed a highway vehicle evasion recognition model by combining KNN (K-Nearest Neighbor), adaptive boosting (Adaboost) algorithm and cost-sensitive learning mechanism. Firstly, in view of the large volume and redundancy of the original ETC toll flow data, data discretization and standardization processing rules were developed to repair and standardize the data form, and then two types of toll evasion features were extracted. Secondly, seven types of toll evasion, such as large vehicles with small tags, were selected as the main research objects by analyzing the ETC data set. Thirdly, to address the problem of inefficient model classification due to the“high-dimensional”characteristics of the evasion data, the best subset of features showing the evasion characteristics was selected by Pearson and Spearman correlation analysis and ReliefF importance analysis. Fourthly, to address the model overfitting problem caused by the class “imbalance”between toll evasion vehicles and normal vehicles, KNN was used as the base classifier in the Adaboost algorithm, and the boundary ambiguity of different categories was alleviated through TomekLinks undersampling, then a cost-sensitive learning mechanism was introduced to improve the model’s emphasis on the minority class (toll evasion vehicles) to alleviate the tendency to discriminate the majority class (normal vehicles). Finally, the performance of the KNN-Adaboost model incorporating cost-sensitive learning mechanisms was verified by comparing the recognition effects of different classification models for various types of evasion events. The results show that the precision of the proposed model is 0. 98, Recall is 0. 96, F1-score is 0. 97, and Kappa coefficient is 0. 95, indicating that the proposed model can better solve the sample class imbalance problem than other models and has higher recognition accuracy for minority class,and it can be a reference for improving the efficiency of highway toll inspection. © 2024 South China University of Technology. All rights reserved.
引用
收藏
页码:10 / 19
页数:9
相关论文
共 18 条
  • [1] CHEN Hailiang, WU Xuming, Discussion on the scheme of ETC anti-evasion discriminating system for Guangdong expressway [J], China ITS Journal, 12, pp. 62-65, (2014)
  • [2] (2014)
  • [3] (2019)
  • [4] YANG Xiang, Research on online audit method of two-way change cards vehicles based on multi-source big data of expressway [J], China Municipal Engineering, 3, pp. 59-62, (2022)
  • [5] Yang YANG, LI Shilei, TANG Bowen, Analysis on inspection methods and development trend of highway fake plate vehicles [J], Auto & Safety, 3, pp. 78-82, (2022)
  • [6] ZHAO Yan, WU Shuling, LIN Zhiheng, Study on the prediction model of toll fraud behavior for highway pass card [J], China Sciencepaper, 10, 19, pp. 2245-2251, (2015)
  • [7] LI Songjiang, ZHOU Zhou, LI Yanfang, Prediction of highway escape cost based on IGA-IBP algorithm [J], Computer Engineering and Design, 39, 12, pp. 3840-3845, (2018)
  • [8] (2018)
  • [9] XIANG Hongyan, YANG Pengtao, YI Jiajia, State prediction model of expressway escaping vehicle based on RF-LR [J], Journal of Chongqing Normal University (Natural Science), 37, 1, pp. 75-80, (2020)
  • [10] (2020)