AViT: Adapting Vision Transformers for Small Skin Lesion Segmentation Datasets

被引:1
|
作者
Du, Siyi [1 ]
Bayasi, Nourhan
Hamarneh, Ghassan [1 ,2 ]
Garbi, Rafeef [1 ]
机构
[1] Univ British Columbia, Vancouver, BC, Canada
[2] Simon Fraser Univ, Burnaby, BC, Canada
关键词
Vision Transformer; Data-efficiency; Efficiency; Medical Image Segmentation; Dermatology;
D O I
10.1007/978-3-031-47401-9_3
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Skin lesion segmentation (SLS) plays an important role in skin lesion analysis. Vision transformers (ViTs) are considered an auspicious solution for SLS, but they require more training data compared to convolutional neural networks (CNNs) due to their inherent parameterheavy structure and lack of some inductive biases. To alleviate this issue, current approaches fine-tune pre-trained ViT backbones on SLS datasets, aiming to leverage the knowledge learned from a larger set of natural images to lower the amount of skin training data needed. However, fully fine-tuning all parameters of large backbones is computationally expensive and memory intensive. In this paper, we propose AViT, a novel efficient strategy to mitigate ViTs' data-hunger by transferring any pretrained ViTs to the SLS task. Specifically, we integrate lightweight modules (adapters) within the transformer layers, which modulate the feature representation of a ViT without updating its pre-trained weights. In addition, we employ a shallow CNN as a prompt generator to create a prompt embedding from the input image, which grasps fine-grained information and CNN's inductive biases to guide the segmentation task on small datasets. Our quantitative experiments on 4 skin lesion datasets demonstrate that AViT achieves competitive, and at times superior, performance to SOTA but with significantly fewer trainable parameters. Our code is available at https://github.com/siyi-wind/AViT.
引用
收藏
页码:25 / 36
页数:12
相关论文
共 50 条
  • [1] Exploring Advances in Transformers and CNN for Skin Lesion Diagnosis on Small Datasets
    de Lima, Leandro M.
    Krohling, Renato A.
    [J]. INTELLIGENT SYSTEMS, PT II, 2022, 13654 : 282 - 296
  • [2] Skin Lesion Segmentation Based on Vision Transformers and Convolutional Neural Networks-A Comparative Study
    Gulzar, Yonis
    Khan, Sumeer Ahmad
    [J]. APPLIED SCIENCES-BASEL, 2022, 12 (12):
  • [3] Boundary-Aware Transformers for Skin Lesion Segmentation
    Wang, Jiacheng
    Wei, Lan
    Wang, Liansheng
    Zhou, Qichao
    Zhu, Lei
    Qin, Jing
    [J]. MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2021, PT I, 2021, 12901 : 206 - 216
  • [4] Accumulated Trivial Attention Matters in Vision Transformers on Small Datasets
    Chen, Xiangyu
    Hu, Qinghao
    Li, Kaidong
    Zhong, Cuncong
    Wang, Guanghui
    [J]. 2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2023, : 3973 - 3981
  • [5] Vision Transformers for Small Histological Datasets Learned Through Knowledge Distillation
    Kanwal, Neel
    Eftestol, Trygve
    Khoraminia, Farbod
    Zuiverloon, Tahlita C. M.
    Engan, Kjersti
    [J]. ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PAKDD 2023, PT III, 2023, 13937 : 167 - 179
  • [6] Transformers Meet Small Datasets
    Shao, Ran
    Bi, Xiao-Jun
    [J]. IEEE ACCESS, 2022, 10 : 118454 - 118464
  • [7] Bridging the Gap Between Vision Transformers and Convolutional Neural Networks on Small Datasets
    Lu, Zhiying
    Xie, Hongtao
    Liu, Chuanbin
    Zhang, Yongdong
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35, NEURIPS 2022, 2022,
  • [8] FAT-Net: Feature adaptive transformers for automated skin lesion segmentation
    Wu, Huisi
    Chen, Shihuai
    Chen, Guilian
    Wang, Wei
    Lei, Baiying
    Wen, Zhenkun
    [J]. MEDICAL IMAGE ANALYSIS, 2022, 76
  • [9] Fine-Grained Fish Classification From Small to Large Datasets With Vision Transformers
    Veiga, Ricardo J. M.
    Rodrigues, Joao M. F.
    [J]. IEEE ACCESS, 2024, 12 : 113642 - 113660
  • [10] Enhancing performance of vision transformers on small datasets through local inductive bias incorporation
    Akkaya I.B.
    Kathiresan S.S.
    Arani E.
    Zonooz B.
    [J]. Pattern Recognition, 2024, 153