AViT: Adapting Vision Transformers for Small Skin Lesion Segmentation Datasets

被引：1

作者：

Du, Siyi ^{[1
]}

Bayasi, Nourhan

Hamarneh, Ghassan ^{[1
,2
]}

Garbi, Rafeef ^{[1
]}

机构：

[1] Univ British Columbia, Vancouver, BC, Canada

[2] Simon Fraser Univ, Burnaby, BC, Canada

来源：

MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2023 WORKSHOPS | 2023年 / 14393卷

关键词：

Vision Transformer; Data-efficiency; Efficiency; Medical Image Segmentation; Dermatology;

D O I：

10.1007/978-3-031-47401-9_3

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Skin lesion segmentation (SLS) plays an important role in skin lesion analysis. Vision transformers (ViTs) are considered an auspicious solution for SLS, but they require more training data compared to convolutional neural networks (CNNs) due to their inherent parameterheavy structure and lack of some inductive biases. To alleviate this issue, current approaches fine-tune pre-trained ViT backbones on SLS datasets, aiming to leverage the knowledge learned from a larger set of natural images to lower the amount of skin training data needed. However, fully fine-tuning all parameters of large backbones is computationally expensive and memory intensive. In this paper, we propose AViT, a novel efficient strategy to mitigate ViTs' data-hunger by transferring any pretrained ViTs to the SLS task. Specifically, we integrate lightweight modules (adapters) within the transformer layers, which modulate the feature representation of a ViT without updating its pre-trained weights. In addition, we employ a shallow CNN as a prompt generator to create a prompt embedding from the input image, which grasps fine-grained information and CNN's inductive biases to guide the segmentation task on small datasets. Our quantitative experiments on 4 skin lesion datasets demonstrate that AViT achieves competitive, and at times superior, performance to SOTA but with significantly fewer trainable parameters. Our code is available at https://github.com/siyi-wind/AViT.

引用

页码：25 / 36

页数：12

共 50 条

[1] Exploring Advances in Transformers and CNN for Skin Lesion Diagnosis on Small Datasets
de Lima, Leandro M.
Krohling, Renato A.
[J]. INTELLIGENT SYSTEMS, PT II, 2022, 13654 : 282 - 296
[2] Skin Lesion Segmentation Based on Vision Transformers and Convolutional Neural Networks-A Comparative Study
Gulzar, Yonis
Khan, Sumeer Ahmad
[J]. APPLIED SCIENCES-BASEL, 2022, 12 (12):
[3] Boundary-Aware Transformers for Skin Lesion Segmentation
Wang, Jiacheng
Wei, Lan
Wang, Liansheng
Zhou, Qichao
Zhu, Lei
Qin, Jing
[J]. MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2021, PT I, 2021, 12901 : 206 - 216
[4] Accumulated Trivial Attention Matters in Vision Transformers on Small Datasets
Chen, Xiangyu
Hu, Qinghao
Li, Kaidong
Zhong, Cuncong
Wang, Guanghui
[J]. 2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2023, : 3973 - 3981
[5] Vision Transformers for Small Histological Datasets Learned Through Knowledge Distillation
Kanwal, Neel
Eftestol, Trygve
Khoraminia, Farbod
Zuiverloon, Tahlita C. M.
Engan, Kjersti
[J]. ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PAKDD 2023, PT III, 2023, 13937 : 167 - 179
[6] Transformers Meet Small Datasets
Shao, Ran
Bi, Xiao-Jun
[J]. IEEE ACCESS, 2022, 10 : 118454 - 118464
[7] Bridging the Gap Between Vision Transformers and Convolutional Neural Networks on Small Datasets
Lu, Zhiying
Xie, Hongtao
Liu, Chuanbin
Zhang, Yongdong
[J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35, NEURIPS 2022, 2022,
[8] FAT-Net: Feature adaptive transformers for automated skin lesion segmentation
Wu, Huisi
Chen, Shihuai
Chen, Guilian
Wang, Wei
Lei, Baiying
Wen, Zhenkun
[J]. MEDICAL IMAGE ANALYSIS, 2022, 76
[9] Fine-Grained Fish Classification From Small to Large Datasets With Vision Transformers
Veiga, Ricardo J. M.
Rodrigues, Joao M. F.
[J]. IEEE ACCESS, 2024, 12 : 113642 - 113660
[10] Enhancing performance of vision transformers on small datasets through local inductive bias incorporation
Akkaya I.B.
Kathiresan S.S.
Arani E.
Zonooz B.
[J]. Pattern Recognition, 2024, 153

← 1 2 3 4 5 →