SkinDistilViT: Lightweight Vision Transformer for Skin Lesion Classification

被引:2
|
作者
Lungu-Stan, Vlad-Constantin [1 ]
Cercel, Dumitru-Clementin [1 ]
Pop, Florin [1 ,2 ]
机构
[1] Univ Politehn Bucuresti, Fac Automat Control & Comp, Bucharest, Romania
[2] Natl Inst Res & Dev Informat ICI Bucharest, Bucharest, Romania
关键词
Skin Lesion Diagnosis; Vision Transformer; Knowledge Distillation;
D O I
10.1007/978-3-031-44207-0_23
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Skin cancer is a treatable disease if discovered early. We provide a production-specific solution to the skin cancer classification problem that matches human performance in melanoma identification by training a vision transformer on melanoma medical images annotated by experts. Since inference cost, both time and memory wise is important in practice, we employ knowledge distillation to obtain a model that retains 98.33% of the teacher's balanced multi-class accuracy, at a fraction of the cost. Memory-wise, our model is 49.60% smaller than the teacher. Time-wise, our solution is 69.25% faster on GPU and 97.96% faster on CPU. By adding classification heads at each level of the transformer and employing a cascading distillation process, we improve the balanced multi-class accuracy of the base model by 2.1%, while creating a range of models of various sizes but comparable performance. We provide the code at https://github.com/Longman- Stan/SkinDistilVit.
引用
收藏
页码:268 / 280
页数:13
相关论文
共 50 条
  • [21] Bottleneck Transformer model with Channel Self-Attention for skin lesion classification
    Tada, Masato
    Han, Xian-Hua
    2023 18TH INTERNATIONAL CONFERENCE ON MACHINE VISION AND APPLICATIONS, MVA, 2023,
  • [22] Dual scale light weight cross attention transformer for skin lesion classification
    Yadav, Dhirendra Prasad
    Sharma, Bhisham
    Chauhan, Shivank
    Webber, Julian L.
    Mehbodniya, Abolfazl
    PLOS ONE, 2024, 19 (12):
  • [23] Multiclass skin lesion classification in dermoscopic images using swin transformer model
    Selen Ayas
    Neural Computing and Applications, 2023, 35 : 6713 - 6722
  • [24] Lesion classification and diabetic retinopathy grading by integrating softmax and pooling operators into vision transformer
    Liu, Chong
    Wang, Weiguang
    Lian, Jian
    Jiao, Wanzhen
    FRONTIERS IN PUBLIC HEALTH, 2025, 12
  • [25] EPVT: Environment-Aware Prompt Vision Transformer for Domain Generalization in Skin Lesion Recognition
    Yan, Siyuan
    Liu, Chi
    Yu, Zhen
    Ju, Lie
    Mahapatra, Dwarikanath
    Mar, Victoria
    Janda, Monika
    Soyer, Peter
    Ge, Zongyuan
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2023, PT VII, 2023, 14226 : 249 - 259
  • [26] White patchy skin lesion classification using feature enhancement and interaction transformer module
    Li, Zhiming
    Jiang, Shuying
    Xiang, Fan
    Li, Chunying
    Li, Shuli
    Gao, Tianwen
    He, Kaiqiao
    Chen, Jianru
    Zhang, Junpeng
    Zhang, Junran
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2025, 107
  • [27] LTUNet: A Lightweight Transformer-Based UNet with Multi-scale Mechanism for Skin Lesion Segmentation
    Guo, Huike
    Zhang, Han
    Li, Minghe
    Quan, Xiongwen
    ARTIFICIAL INTELLIGENCE, CICAI 2023, PT II, 2024, 14474 : 147 - 158
  • [28] An efficient multi-class classification of skin cancer using optimized vision transformer
    R. P. Desale
    P. S. Patil
    Medical & Biological Engineering & Computing, 2024, 62 : 773 - 789
  • [29] An efficient multi-class classification of skin cancer using optimized vision transformer
    Desale, R. P.
    Patil, P. S.
    MEDICAL & BIOLOGICAL ENGINEERING & COMPUTING, 2024, 62 (03) : 773 - 789
  • [30] Multiclass Skin Lesion Classification Using a Novel Lightweight Deep Learning Framework for Smart Healthcare
    Hoang, Long
    Lee, Suk-Hwan
    Lee, Eung-Joo
    Kwon, Ki-Ryong
    APPLIED SCIENCES-BASEL, 2022, 12 (05):