SkinDistilViT: Lightweight Vision Transformer for Skin Lesion Classification

被引:2
|
作者
Lungu-Stan, Vlad-Constantin [1 ]
Cercel, Dumitru-Clementin [1 ]
Pop, Florin [1 ,2 ]
机构
[1] Univ Politehn Bucuresti, Fac Automat Control & Comp, Bucharest, Romania
[2] Natl Inst Res & Dev Informat ICI Bucharest, Bucharest, Romania
关键词
Skin Lesion Diagnosis; Vision Transformer; Knowledge Distillation;
D O I
10.1007/978-3-031-44207-0_23
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Skin cancer is a treatable disease if discovered early. We provide a production-specific solution to the skin cancer classification problem that matches human performance in melanoma identification by training a vision transformer on melanoma medical images annotated by experts. Since inference cost, both time and memory wise is important in practice, we employ knowledge distillation to obtain a model that retains 98.33% of the teacher's balanced multi-class accuracy, at a fraction of the cost. Memory-wise, our model is 49.60% smaller than the teacher. Time-wise, our solution is 69.25% faster on GPU and 97.96% faster on CPU. By adding classification heads at each level of the transformer and employing a cascading distillation process, we improve the balanced multi-class accuracy of the base model by 2.1%, while creating a range of models of various sizes but comparable performance. We provide the code at https://github.com/Longman- Stan/SkinDistilVit.
引用
收藏
页码:268 / 280
页数:13
相关论文
共 50 条
  • [1] Assist-Dermo: A Lightweight Separable Vision Transformer Model for Multiclass Skin Lesion Classification
    Abbas, Qaisar
    Daadaa, Yassine
    Rashid, Umer
    Ibrahim, Mostafa E. A.
    DIAGNOSTICS, 2023, 13 (15)
  • [2] Lightweight vision image transformer (LViT) model for skin cancer disease classification
    Dwivedi, Tanay
    Chaurasia, Brijesh Kumar
    Shukla, Man Mohan
    INTERNATIONAL JOURNAL OF SYSTEM ASSURANCE ENGINEERING AND MANAGEMENT, 2024, 15 (10) : 5030 - 5055
  • [3] Vision transformer and CNN-based skin lesion analysis: classification of monkeypox
    Yolcu Oztel G.
    Multimedia Tools and Applications, 2024, 83 (28) : 71909 - 71923
  • [4] Enhancing Skin Lesion Classification: A Self-Attention Fusion Approach with Vision Transformer
    Heroza, Rahmat Izwan
    Gan, John Q.
    Raza, Haider
    MEDICAL IMAGE UNDERSTANDING AND ANALYSIS, PT II, MIUA 2024, 2024, 14860 : 309 - 322
  • [5] SkinSwinViT: A Lightweight Transformer-Based Method for Multiclass Skin Lesion Classification with Enhanced Generalization Capabilities
    Tang, Kun
    Su, Jing
    Chen, Ruihan
    Huang, Rui
    Dai, Ming
    Li, Yongjiang
    APPLIED SCIENCES-BASEL, 2024, 14 (10):
  • [6] DeepMetaForge: A Deep Vision-Transformer Metadata-Fusion Network for Automatic Skin Lesion Classification
    Vachmanus, Sirawich
    Noraset, Thanapon
    Piyanonpong, Waritsara
    Rattananukrom, Teerapong
    Tuarob, Suppawong
    IEEE ACCESS, 2023, 11 : 145467 - 145484
  • [7] A Novel Vision Transformer Model for Skin Cancer Classification
    Guang Yang
    Suhuai Luo
    Peter Greer
    Neural Processing Letters, 2023, 55 : 9335 - 9351
  • [8] A Novel Vision Transformer Model for Skin Cancer Classification
    Yang, Guang
    Luo, Suhuai
    Greer, Peter
    NEURAL PROCESSING LETTERS, 2023, 55 (07) : 9335 - 9351
  • [9] Lightweight Vision Transformer for damaged wheat detection and classification using spectrograms
    Lin, Hao
    Guo, Min
    Ma, Miao
    JOURNAL OF ELECTRONIC IMAGING, 2024, 33 (05)
  • [10] Enhanced deep bottleneck transformer model for skin lesion classification*
    Nakai, Katsuhiro
    Chen, Yen-Wei
    Han, Xian-Hua
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2022, 78