ADA-VIT: ATTENTION-GUIDED DATA AUGMENTATION FOR VISION TRANSFORMERS

被引：0

作者：

Baili, Nada ^{[1
]}

Frigui, Hichem ^{[1
]}

机构：

[1] Univ Louisville, Comp Sci & Engn Dept, 220 Eastern Pkwy, Louisville, KY 40292 USA

来源：

2023 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP | 2023年

关键词：

Vision Transformer; Data Augmentation;

D O I：

10.1109/ICIP49359.2023.10222908

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The limitations of a machine learning model can often be traced back to the existence of under-represented regions in the feature space of the training data. Data augmentation is a common technique that has been used to inflate training datasets with new samples to improve the model performance. However, these techniques usually focus on expanding the data in size and do not necessarily aim to cover the under-represented regions of the feature space. In this paper, we propose an Attention-guided Data Augmentation technique for Vision Transformers (ADA-ViT). Our framework exploits the attention mechanism in vision transformers to extract visual concepts related to misclassified samples. The retrieved concepts describe under-represented regions in the training dataset that contributed to the misclassifications. We leverage this information to guide our data augmentation process by identifying new samples and using them to augment the training data. We hypothesize that this focused data augmentation populates under-represented regions and improves the model's accuracy. We evaluate our framework on the CUB dataset and CUB-Families. Our experiments show that ADA-ViT outperforms state-of-the-art data augmentation strategies, and can improve the accuracy of a transformer by an average margin of 2.5% on the CUB dataset and 3.3% on CUB-Families.

引用

页码：385 / 389

页数：5

共 46 条

[41] Removing multiple types of noise of distributed acoustic sensing seismic data using attention-guided denoising convolutional neural network
Wang, Cong
Huang, Xingguo
Li, Yue
Jensen, Kristian
FRONTIERS IN EARTH SCIENCE, 2023, 10
[42] Self-Attention-Guided Multiindicator Retrieval for Ocean Surface Wind Field With Multimodal Data Augmentation and Fusion
Li, Menglong
Hou, Yonghong
Song, Xiaowei
Hou, Chunping
Xiong, Zixiang
Ma, Dan
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62
[43] Enhancing IIoT Vision Data Transmission and Processing via Spatial-Difference-Attention-Guided Saliency Detection
Jia, Ning
Liu, Xianhui
Sun, Yougang
Liu, Zhuang
IEEE INTERNET OF THINGS JOURNAL, 2024, 11 (16): : 26668 - 26679
[44] Computer-Assisted Diagnosis of Hepatic Portal Hypertension: A Novel, Attention-Guided Deep Learning Framework Based On CT Imaging and Laboratory Data Integration
Wang, Y.
Li, X.
Konanur, M.
Konkel, B.
Seyferth, E.
Brajer, N.
Bashir, M.
Lafata, K.
MEDICAL PHYSICS, 2021, 48 (06)
[45] Attention-Guided Convolution Neural Network Assisted With Handcrafted Features for Ship Classification in Low-Resolution Sentinel-1 SAR Image Data
Bhattacharjee, Shovakar
Shanmugam, Palanisamy
Das, Sukhendu
IEEE ACCESS, 2024, 12 (48668-48685) : 48668 - 48685
[46] COViT-GAN: Vision Transformer for COVID-19 Detection in CT Scan Images with Self-Attention GAN for Data Augmentation
Ambita, Ara Abigail E.
Boquio, Eujene Nikka, V
Naval, Prospero C., Jr.
ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2021, PT II, 2021, 12892 : 587 - 598

← 1 2 3 4 5 →