Vehicle Classification Algorithm Based on Improved Vision Transformer

被引:1
|
作者
Dong, Xinlong [1 ]
Shi, Peicheng [1 ]
Tang, Yueyue [1 ]
Yang, Li [1 ]
Yang, Aixi [2 ]
Liang, Taonian [3 ]
机构
[1] Anhui Polytech Univ, Sch Mech & Automot Engn, Wuhu 241000, Peoples R China
[2] Zhejiang Univ, Polytech Inst, Hangzhou 310015, Peoples R China
[3] Chery New Energy Automobile Co Ltd, Wuhu 241000, Peoples R China
来源
WORLD ELECTRIC VEHICLE JOURNAL | 2024年 / 15卷 / 08期
关键词
vehicle classification; vision transformer; local detail features; sparse attention module; contrast loss;
D O I
10.3390/wevj15080344
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Vehicle classification technology is one of the foundations in the field of automatic driving. With the development of deep learning technology, visual transformer structures based on attention mechanisms can represent global information quickly and effectively. However, due to direct image segmentation, local feature details and information will be lost. To solve this problem, we propose an improved vision transformer vehicle classification network (IND-ViT). Specifically, we first design a CNN-In D branch module to extract local features before image segmentation to make up for the loss of detail information in the vision transformer. Then, in order to solve the problem of misdetection caused by the large similarity of some vehicles, we propose a sparse attention module, which can screen out the discernible regions in the image and further improve the detailed feature representation ability of the model. Finally, this paper uses the contrast loss function to further increase the intra-class consistency and inter-class difference of classification features and improve the accuracy of vehicle classification recognition. Experimental results show that the accuracy of the proposed model on the datasets of vehicle classification BIT-Vehicles, CIFAR-10, Oxford Flower-102, and Caltech-101 is higher than that of the original vision transformer model. Respectively, it increased by 1.3%, 1.21%, 7.54%, and 3.60%; at the same time, it also met a certain real-time requirement to achieve a balance of accuracy and real time.
引用
收藏
页数:18
相关论文
共 50 条
  • [1] DeepFake detection algorithm based on improved vision transformer
    Heo, Young-Jin
    Yeo, Woon-Ha
    Kim, Byung-Gyu
    APPLIED INTELLIGENCE, 2023, 53 (07) : 7512 - 7527
  • [2] DeepFake detection algorithm based on improved vision transformer
    Young-Jin Heo
    Woon-Ha Yeo
    Byung-Gyu Kim
    Applied Intelligence, 2023, 53 : 7512 - 7527
  • [3] Algorithm for vision-based vehicle detection and classification
    Hu, Youpan
    He, Qing
    Zhuang, Xiaobin
    Wang, Haibin
    Li, Baopu
    Wen, Zhenfu
    Leng, Bin
    Guan, Guan
    Chen, Dongjie
    2013 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND BIOMIMETICS (ROBIO), 2013, : 568 - 572
  • [4] Small Target Accurate Vehicle Detection Algorithm Based on Improved Transformer
    Xie Guangda
    Li Yang
    Qu Hongquan
    Sun Zaiming
    LASER & OPTOELECTRONICS PROGRESS, 2022, 59 (18)
  • [5] Vision-based vehicle classification
    Gupte, S
    Masoud, O
    Papanikolopoulos, NP
    2000 IEEE INTELLIGENT TRANSPORTATION SYSTEMS PROCEEDINGS, 2000, : 46 - 51
  • [6] GeneViT: Gene Vision Transformer with Improved DeepInsight for cancer classification
    Gokhale, Madhuri
    Mohanty, Sraban Kumar
    Ojha, Aparajita
    COMPUTERS IN BIOLOGY AND MEDICINE, 2023, 155
  • [7] Malware Family Classification Based on Vision Transformer
    Li, Jing
    Luo, Xueping
    Journal of Computers (Taiwan), 2023, 34 (01) : 87 - 99
  • [8] Improved Vision-Based Vehicle Detection and Classification by Optimized YOLOv4
    Zhao, Jingyi
    Hao, Shengnan
    Dai, Chenxu
    Zhang, Haiyang
    Zhao, Li
    Ji, Zhanlin
    Ganchev, Ivan
    IEEE Access, 2022, 10 : 8590 - 8603
  • [9] Improved Vision-Based Vehicle Detection and Classification by Optimized YOLOv4
    Zhao, Jingyi
    Hao, Shengnan
    Dai, Chenxu
    Zhang, Haiyang
    Zhao, Li
    Ji, Zhanlin
    Ganchev, Ivan
    IEEE ACCESS, 2022, 10 : 8590 - 8603
  • [10] Bayesian Network Based Computer Vision Algorithm for Vehicle Classification from Incomplete Data
    Zheng, Chen-zhao
    2017 2ND INTERNATIONAL CONFERENCE ON COMPUTATIONAL MODELING, SIMULATION AND APPLIED MATHEMATICS (CMSAM), 2017, : 439 - 444