SkinSwinViT: A Lightweight Transformer-Based Method for Multiclass Skin Lesion Classification with Enhanced Generalization Capabilities

被引:0
|
作者
Tang, Kun [1 ]
Su, Jing [1 ]
Chen, Ruihan [1 ,2 ]
Huang, Rui [1 ]
Dai, Ming [1 ]
Li, Yongjiang [1 ]
机构
[1] Guangdong Ocean Univ, Sch Math & Comp, Zhanjiang 524008, Peoples R China
[2] Int Macau Inst Acad Res, Artificial Intelligence Res Inst, Taipa 999078, Peoples R China
来源
APPLIED SCIENCES-BASEL | 2024年 / 14卷 / 10期
关键词
skin lesions; ISIC2018; transformer; data enhancement; multiclassification; MACHINE; NETWORK;
D O I
10.3390/app14104005
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
In recent decades, skin cancer has emerged as a significant global health concern, demanding timely detection and effective therapeutic interventions. Automated image classification via computational algorithms holds substantial promise in significantly improving the efficacy of clinical diagnoses. This study is committed to mitigating the challenge of diagnostic accuracy in the classification of multiclass skin lesions. This endeavor is inherently formidable owing to the resemblances among various lesions and the constraints associated with extracting precise global and local image features within diverse dimensional spaces using conventional convolutional neural network methodologies. Consequently, this study introduces the SkinSwinViT methodology for skin lesion classification, a pioneering model grounded in the Swin Transformer framework featuring a global attention mechanism. Leveraging the inherent cross-window attention mechanism within the Swin Transformer architecture, the model adeptly captures local features and interdependencies within skin lesion images while additionally incorporating a global self-attention mechanism to discern overarching features and contextual information effectively. The evaluation of the model's performance involved the ISIC2018 challenge dataset. Furthermore, data augmentation techniques augmented training dataset size and enhanced model performance. Experimental results highlight the superiority of the SkinSwinViT method, achieving notable metrics of accuracy, recall, precision, specificity, and F1 score at 97.88%, 97.55%, 97.83%, 99.36%, and 97.79%, respectively.
引用
收藏
页数:20
相关论文
共 50 条
  • [31] Skin lesion image classification method based on extension theory and deep learning
    Bian, Xiaofei
    Pan, Haiwei
    Zhang, Kejia
    Li, Pengyuan
    Li, Jinbao
    Chen, Chunling
    MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (12) : 16389 - 16409
  • [32] TripleFormer: improving transformer-based image classification method using multiple self-attention inputs
    Gong, Yu
    Wu, Peng
    Xu, Renjie
    Zhang, Xiaoming
    Wang, Tao
    Li, Xuan
    VISUAL COMPUTER, 2024, 40 (12): : 9039 - 9050
  • [33] A 3-D-Swin Transformer-Based Hierarchical Contrastive Learning Method for Hyperspectral Image Classification
    Huang, Xin
    Dong, Mengjie
    Li, Jiayi
    Guo, Xian
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
  • [34] Skin lesion image classification method based on extension theory and deep learning
    Xiaofei Bian
    Haiwei Pan
    Kejia Zhang
    Pengyuan Li
    Jinbao Li
    Chunling Chen
    Multimedia Tools and Applications, 2022, 81 : 16389 - 16409
  • [35] A novel LSTM-autoencoder and enhanced transformer-based detection method for shield machine cutterhead clogging
    Qin, ChengJin
    Wu, RuiHong
    Huang, GuoQiang
    Tao, JianFeng
    Liu, ChengLiang
    SCIENCE CHINA-TECHNOLOGICAL SCIENCES, 2023, 66 (02) : 512 - 527
  • [36] A novel LSTM-autoencoder and enhanced transformer-based detection method for shield machine cutterhead clogging
    ChengJin Qin
    RuiHong Wu
    GuoQiang Huang
    JianFeng Tao
    ChengLiang Liu
    Science China Technological Sciences, 2023, 66 : 512 - 527
  • [37] A novel LSTM-autoencoder and enhanced transformer-based detection method for shield machine cutterhead clogging
    QIN ChengJin
    WU RuiHong
    HUANG GuoQiang
    TAO JianFeng
    LIU ChengLiang
    Science China(Technological Sciences), 2023, 66 (02) : 512 - 527
  • [38] A Lightweight Pine Wilt Disease Detection Method Based on Vision Transformer-Enhanced YOLO
    Yuan, Quanbo
    Zou, Suhua
    Wang, Huijuan
    Luo, Wei
    Zheng, Xiuling
    Liu, Lantao
    Meng, Zhaopeng
    FORESTS, 2024, 15 (06):
  • [39] Cervical Lesion Classification Method Based on Cross-Validation Decision Fusion Method of Vision Transformer and DenseNet
    Li, Ping
    Wang, Xiaoxia
    Liu, Peizhong
    Xu, Tianxiang
    Sun, Pengming
    Dong, Binhua
    Xue, Huifeng
    JOURNAL OF HEALTHCARE ENGINEERING, 2022, 2022
  • [40] Automated classification and localization of sewer pipe defects in small-sample CCTV imagery: an enhanced transformer-based framework
    Ren, Qiubing
    Li, Mingchao
    Li, Mingze
    Fang, Xin
    Xiao, Lei
    JOURNAL OF CIVIL STRUCTURAL HEALTH MONITORING, 2025,