Multi-Class Skin Cancer Classification Using Vision Transformer Networks and Convolutional Neural Network-Based Pre-Trained Models

被引:13
|
作者
Arshed, Muhammad Asad [1 ,2 ]
Mumtaz, Shahzad [3 ]
Ibrahim, Muhammad [2 ]
Ahmed, Saeed [1 ]
Tahir, Muhammad [4 ,5 ]
Shafi, Muhammad [6 ]
机构
[1] Univ Management & Technol, Sch Syst & Technol, Lahore 54770, Pakistan
[2] Islamia Univ Bahawalpur, Dept Comp Sci, Bahawalpur 63100, Pakistan
[3] Islamia Univ Bahawalpur, Dept Data Sci, Bahawalpur 63100, Pakistan
[4] Univ Manitoba, Dept Elect & Comp Engn, Winnipeg, MB R3T 5V6, Canada
[5] Abdul Wali Khan Univ, Dept Comp Sci, Mardan 23200, Pakistan
[6] Sohar Univ, Fac Comp & Informat Technol, Sohar 311, Oman
关键词
skin cancer diagnosis; multi-class; vision transformer; pretrained models; fine tuning; transfer learning; data augmentation;
D O I
10.3390/info14070415
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Skin cancer, particularly melanoma, has been recognized as one of the most lethal forms of cancer. Detecting and diagnosing skin lesions accurately can be challenging due to the striking similarities between the various types of skin lesions, such as melanoma and nevi, especially when examining the color images of the skin. However, early diagnosis plays a crucial role in saving lives and reducing the burden on medical resources. Consequently, the development of a robust autonomous system for skin cancer classification becomes imperative. Convolutional neural networks (CNNs) have been widely employed over the past decade to automate cancer diagnosis. Nonetheless, the emergence of the Vision Transformer (ViT) has recently gained a considerable level of popularity in the field and has emerged as a competitive alternative to CNNs. In light of this, the present study proposed an alternative method based on the off-the-shelf ViT for identifying various skin cancer diseases. To evaluate its performance, the proposed method was compared with 11 CNN-based transfer learning methods that have been known to outperform other deep learning techniques that are currently in use. Furthermore, this study addresses the issue of class imbalance within the dataset, a common challenge in skin cancer classification. In addressing this concern, the proposed study leverages the vision transformer and the CNN-based transfer learning models to classify seven distinct types of skin cancers. Through our investigation, we have found that the employment of pre-trained vision transformers achieved an impressive accuracy of 92.14%, surpassing CNN-based transfer learning models across several evaluation metrics for skin cancer diagnosis.
引用
收藏
页数:14
相关论文
共 50 条
  • [21] Dynamic Convolutional Neural Networks as Efficient Pre-Trained Audio Models
    Schmid, Florian
    Koutini, Khaled
    Widmer, Gerhard
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2024, 32 : 2227 - 2241
  • [22] Graph-Based Audio Classification Using Pre-Trained Models and Graph Neural Networks
    Castro-Ospina, Andres Eduardo
    Solarte-Sanchez, Miguel Angel
    Vega-Escobar, Laura Stella
    Isaza, Claudia
    Martinez-Vargas, Juan David
    [J]. SENSORS, 2024, 24 (07)
  • [23] Multi-stage glaucoma classification using pre-trained convolutional neural networks and voting-based classifier fusion
    Velpula, Vijaya Kumar
    Sharma, Lakhan Dev
    [J]. FRONTIERS IN PHYSIOLOGY, 2023, 14
  • [24] Convolutional Neural Networks for Histopathology Image Classification: Training vs. Using Pre-Trained Networks
    Kieffer, Brady
    Babaie, Morteza
    Kalra, Shivam
    Tizhoosh, H. R.
    [J]. PROCEEDINGS OF THE 2017 SEVENTH INTERNATIONAL CONFERENCE ON IMAGE PROCESSING THEORY, TOOLS AND APPLICATIONS (IPTA 2017), 2017,
  • [25] Pre-Trained Convolutional Neural Networks for Breast Cancer Detection Using Ultrasound Images
    Masud, Mehedi
    Hossain, M. Shamim
    Alhumyani, Hesham
    Alshamrani, Sultan S.
    Cheikhrouhou, Omar
    Ibrahim, Saleh
    Muhammad, Ghulam
    Rashed, Amr E. Eldin
    Gupta, B. B.
    [J]. ACM TRANSACTIONS ON INTERNET TECHNOLOGY, 2021, 21 (04)
  • [26] CLASSIFICATION OF NOISE BETWEEN FLOORS IN A BUILDING USING PRE-TRAINED DEEP CONVOLUTIONAL NEURAL NETWORKS
    Choi, Hwiyong
    Lee, Seungjun
    Yang, Haesang
    Seong, Woojae
    [J]. 2018 16TH INTERNATIONAL WORKSHOP ON ACOUSTIC SIGNAL ENHANCEMENT (IWAENC), 2018, : 535 - 539
  • [27] Efficient pollen grain classification using pre-trained Convolutional Neural Networks: a comprehensive study
    Rostami, Masoud A.
    Balmaki, Behnaz
    Dyer, Lee A.
    Allen, Julie M.
    Sallam, Mohamed F.
    Frontalini, Fabrizio
    [J]. JOURNAL OF BIG DATA, 2023, 10 (01)
  • [28] Scanned ECG Arrhythmia Classification Using a Pre-trained Convolutional Neural Network as a Feature Extractor
    Aldosari, Hanadi
    Coenen, Frans
    Lip, Gregory Y. H.
    Zheng, Yalin
    [J]. ARTIFICIAL INTELLIGENCE XXXIX, AI 2022, 2022, 13652 : 64 - 80
  • [29] Medical Image Classification using Pre-trained Convolutional Neural Networks and Support Vector Machine
    Ahmed, Ali
    [J]. INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2021, 21 (06): : 1 - 6
  • [30] Multi-Class Skin Diseases Classification Using Deep Convolutional Neural Network and Support Vector Machine
    Hameed, Nazia
    Shabut, Antesar M.
    Hossain, M. A.
    [J]. 2018 12TH INTERNATIONAL CONFERENCE ON SOFTWARE, KNOWLEDGE, INFORMATION MANAGEMENT & APPLICATIONS (SKIMA), 2018, : 23 - +