ADA-VIT: ATTENTION-GUIDED DATA AUGMENTATION FOR VISION TRANSFORMERS

被引:0
|
作者
Baili, Nada [1 ]
Frigui, Hichem [1 ]
机构
[1] Univ Louisville, Comp Sci & Engn Dept, 220 Eastern Pkwy, Louisville, KY 40292 USA
关键词
Vision Transformer; Data Augmentation;
D O I
10.1109/ICIP49359.2023.10222908
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The limitations of a machine learning model can often be traced back to the existence of under-represented regions in the feature space of the training data. Data augmentation is a common technique that has been used to inflate training datasets with new samples to improve the model performance. However, these techniques usually focus on expanding the data in size and do not necessarily aim to cover the under-represented regions of the feature space. In this paper, we propose an Attention-guided Data Augmentation technique for Vision Transformers (ADA-ViT). Our framework exploits the attention mechanism in vision transformers to extract visual concepts related to misclassified samples. The retrieved concepts describe under-represented regions in the training dataset that contributed to the misclassifications. We leverage this information to guide our data augmentation process by identifying new samples and using them to augment the training data. We hypothesize that this focused data augmentation populates under-represented regions and improves the model's accuracy. We evaluate our framework on the CUB dataset and CUB-Families. Our experiments show that ADA-ViT outperforms state-of-the-art data augmentation strategies, and can improve the accuracy of a transformer by an average margin of 2.5% on the CUB dataset and 3.3% on CUB-Families.
引用
收藏
页码:385 / 389
页数:5
相关论文
共 46 条
  • [31] Self-supervised Vision Transformers with Data Augmentation Strategies Using Morphological Operations for Writer Retrieval
    Peer, Marco
    Kleber, Florian
    Sablatnig, Robert
    FRONTIERS IN HANDWRITING RECOGNITION, ICFHR 2022, 2022, 13639 : 122 - 136
  • [32] Improving satellite image classification accuracy using GAN-based data augmentation and vision transformers
    Alzahem, Ayyub
    Boulila, Wadii
    Koubaa, Anis
    Khan, Zahid
    Alturki, Ibrahim
    EARTH SCIENCE INFORMATICS, 2023, 16 (04) : 4169 - 4186
  • [33] Solar Flare Forecasting Based on Magnetogram Sequences Learning with Multiscale Vision Transformers and Data Augmentation Techniques
    Grim, Luis Fernando L.
    Gradvohl, Andre Leon S.
    SOLAR PHYSICS, 2024, 299 (03)
  • [34] Solar Flare Forecasting Based on Magnetogram Sequences Learning with Multiscale Vision Transformers and Data Augmentation Techniques
    Luís Fernando L. Grim
    André Leon S. Gradvohl
    Solar Physics, 2024, 299
  • [35] Improving satellite image classification accuracy using GAN-based data augmentation and vision transformers
    Ayyub Alzahem
    Wadii Boulila
    Anis Koubaa
    Zahid Khan
    Ibrahim Alturki
    Earth Science Informatics, 2023, 16 : 4169 - 4186
  • [36] Language-guided Residual Graph Attention Network and Data Augmentation for Visual Grounding
    Wang, Jia
    Shuai, Hong-Han
    Li, Yung-Hui
    Cheng, Wen-Huang
    ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2024, 20 (01)
  • [37] A method based on an attention-guided multibranch residual network for coupled noise suppression in distributed acoustic sensing data
    Cheng, Zhaohui
    Ren, Yuteng
    Du, Xishan
    Yuan, Yijun
    GEOPHYSICS, 2025, 90 (02) : V147 - V160
  • [38] An attention-guided convolution neural network for denoising of distributed acoustic sensing-vertical seismic profile data
    Li, Yue
    Zhang, Yipan
    Dong, Xintong
    Wang, Hongzhou
    GEOPHYSICAL PROSPECTING, 2023, 71 (06) : 914 - 930
  • [39] HBO-DEViT: Vision Transformer Based Attention-Guided Evolutionary Architecture for Ship-Iceberg Categorisation in Arctic SAR Images
    Sen, Anuvab
    Sai, Sujith
    Mallick, Chhandak
    Roy, Subhabrata
    Sen, Udayon
    IGARSS 2024-2024 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, IGARSS 2024, 2024, : 201 - 204
  • [40] Noise suppression of distributed fiber-optical acoustic sensing seismic data by attention-guided multiscale generative adversarial network
    Wu, Ning
    Wang, Yuying
    Li, Yue
    GEOPHYSICS, 2023, 88 (03) : D227 - D239