Vehicle Classification Algorithm Based on Improved Vision Transformer

被引:1
|
作者
Dong, Xinlong [1 ]
Shi, Peicheng [1 ]
Tang, Yueyue [1 ]
Yang, Li [1 ]
Yang, Aixi [2 ]
Liang, Taonian [3 ]
机构
[1] Anhui Polytech Univ, Sch Mech & Automot Engn, Wuhu 241000, Peoples R China
[2] Zhejiang Univ, Polytech Inst, Hangzhou 310015, Peoples R China
[3] Chery New Energy Automobile Co Ltd, Wuhu 241000, Peoples R China
来源
WORLD ELECTRIC VEHICLE JOURNAL | 2024年 / 15卷 / 08期
关键词
vehicle classification; vision transformer; local detail features; sparse attention module; contrast loss;
D O I
10.3390/wevj15080344
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Vehicle classification technology is one of the foundations in the field of automatic driving. With the development of deep learning technology, visual transformer structures based on attention mechanisms can represent global information quickly and effectively. However, due to direct image segmentation, local feature details and information will be lost. To solve this problem, we propose an improved vision transformer vehicle classification network (IND-ViT). Specifically, we first design a CNN-In D branch module to extract local features before image segmentation to make up for the loss of detail information in the vision transformer. Then, in order to solve the problem of misdetection caused by the large similarity of some vehicles, we propose a sparse attention module, which can screen out the discernible regions in the image and further improve the detailed feature representation ability of the model. Finally, this paper uses the contrast loss function to further increase the intra-class consistency and inter-class difference of classification features and improve the accuracy of vehicle classification recognition. Experimental results show that the accuracy of the proposed model on the datasets of vehicle classification BIT-Vehicles, CIFAR-10, Oxford Flower-102, and Caltech-101 is higher than that of the original vision transformer model. Respectively, it increased by 1.3%, 1.21%, 7.54%, and 3.60%; at the same time, it also met a certain real-time requirement to achieve a balance of accuracy and real time.
引用
收藏
页数:18
相关论文
共 50 条
  • [21] Galaxy morphology classification based on Convolutional vision Transformer (CvT)
    Cao, Jie
    Xu, Tingting
    Deng, Yuhe
    Deng, Linhua
    Yang, Mingcun
    Liu, Zhijing
    Zhou, Weihong
    ASTRONOMY & ASTROPHYSICS, 2024, 683
  • [22] An Arrhythmia Classification Model Based on Vision Transformer with Deformable Attention
    Dong, Yanfang
    Zhang, Miao
    Qiu, Lishen
    Wang, Lirong
    Yu, Yong
    MICROMACHINES, 2023, 14 (06)
  • [23] Enhancing Auditory Brainstem Response Classification Based On Vision Transformer
    Ahmed, Hunar Abubakir
    Majidpour, Jafar
    Ahmed, Mohammed Hussein
    Jameel, Samer Kais
    Majidpour, Amir
    COMPUTER JOURNAL, 2023, 67 (05): : 1872 - 1878
  • [24] Olive Disease Classification Based on Vision Transformer and CNN Models
    Alshammari, Hamoud
    Gasmi, Karim
    Ben Ltaifa, Ibtihel
    Krichen, Moez
    Ben Ammar, Lassaad
    Mahmood, Mahmood A.
    COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2022, 2022
  • [25] Malicious Code Family Classification Method Based on Vision Transformer
    Chen, Shi
    Liu, Ying
    Hu, Wei
    Liu, Jianyi
    Gao, Yating
    Lin, Bingjie
    2022 IEEE 10TH INTERNATIONAL CONFERENCE ON INFORMATION, COMMUNICATION AND NETWORKS (ICICN 2022), 2022, : 704 - 709
  • [26] Intelligent diagnosis of turnout faults based on improved Vision Transformer
    Wang, Yingqi
    Li, Gang
    Hu, Qizheng
    Yang, Yong
    Journal of Railway Science and Engineering, 2024, 21 (10) : 4321 - 4333
  • [27] BAMBOO DEFECT CLASSIFICATION BASED ON IMPROVED TRANSFORMER NETWORK
    Hu, Junfeng
    Yu, Xi
    Zhao, Yafeng
    WOOD RESEARCH, 2022, 67 (03) : 501 - 510
  • [28] An improved calibration algorithm for ITS based on vision
    Liu, Fuqiang
    Wang, Jing
    Guo, Lian
    Wang, Xinhong
    2007 IEEE PACIFIC RIM CONFERENCE ON COMMUNICATIONS, COMPUTERS AND SIGNAL PROCESSING, VOLS 1 AND 2, 2007, : 589 - 592
  • [29] Vision based vehicle detection transfer learning algorithm
    Cai, Yingfeng
    Wang, Hai
    Dongnan Daxue Xuebao (Ziran Kexue Ban)/Journal of Southeast University (Natural Science Edition), 2015, 45 (02): : 275 - 280
  • [30] A FEASIBLE ARRHYTHMIA CLASSIFICATION ALGORITHM BASED ON TRANSFORMER MODEL
    Shi, Cui
    Meng, Qinghua
    Nie, Mingshuo
    JOURNAL OF NONLINEAR AND CONVEX ANALYSIS, 2022, 23 (09) : 2035 - 2047