A multi-stage multi-modal learning algorithm with adaptive multimodal fusion for improving multi-label skin lesion classification

被引:0
|
作者
Zuo, Lihan [1 ]
Wang, Zizhou [2 ]
Wang, Yan [2 ]
机构
[1] Southwest Jiaotong Univ, Sch Comp & Artificial Intelligence, Chengdu 610000, Peoples R China
[2] ASTAR, Inst High Performance Comp, Singapore 138632, Singapore
关键词
Skin lesion classification; Multi-modal learning; Multi-label classification; Multi-modal information fusion; 7-POINT CHECKLIST; DERMOSCOPIC IMAGES; NEURAL-NETWORK;
D O I
10.1016/j.artmed.2025.103091
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Skin cancer is frequently occurring and has become a major contributor to both cancer incidence and mortality. Accurate and timely diagnosis of skin cancer holds the potential to save lives. Deep learning- based methods have demonstrated significant advancements in the screening of skin cancers. However, most current approaches rely on a single modality input for diagnosis, thereby missing out on valuable complementary information that could enhance accuracy. Although some multimodal-based methods exist, they often lack adaptability and fail to fully leverage multimodal information. In this paper, we introduce a novel uncertainty-based hybrid fusion strategy fora multi-modal learning algorithm aimed at skin cancer diagnosis. Our approach specifically combines three different modalities: clinical images, dermoscopy images, and metadata, to make the final classification. For the fusion of two image modalities, we employ an intermediate fusion strategy that considers the similarity between clinical and dermoscopy images to extract features containing both complementary and correlated information. To capture the correlated information, we utilize cosine similarity, and we employ concatenation as the means for integrating complementary information. In the fusion of image and metadata modalities, we leverage uncertainty to obtain confident late fusion results, allowing our method to adaptively combine the information from different modalities. We conducted comprehensive experiments using a popular publicly available skin disease diagnosis dataset, and the results of these experiments demonstrate the effectiveness of our proposed method. Our proposed fusion algorithm could enhance the clinical applicability of automated skin lesion classification, offering amore robust and adaptive way to make automatic diagnoses with the help of uncertainty mechanism. Code is available at https://github.com/Zuo-Lihan/CosCatNet-Adaptive_Fusion_Algorithm.
引用
收藏
页数:14
相关论文
共 50 条
  • [41] Multi-Label Classification Algorithm for Adaptive Heterogeneous Classifier Group
    Han, Meng
    Yang, Shurong
    Wu, Hongxin
    Ding, Jian
    MATHEMATICS, 2025, 13 (01)
  • [42] Label Enhancement Manifold Learning Algorithm for Multi-label Image Classification
    Tan, Chao
    Ji, Genlin
    2020 EIGHTH INTERNATIONAL CONFERENCE ON ADVANCED CLOUD AND BIG DATA (CBD 2020), 2020, : 96 - 102
  • [43] M3TR: Multi-modal Multi-label Recognition with Transformer
    Zhao, Jiawei
    Zhao, Yifan
    Li, Jia
    PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 469 - 477
  • [44] Multi-modal Multi-label Semantic Indexing of Images using Unlabeled Data
    Li, Wei
    Sun, Maosong
    ALPIT 2008: SEVENTH INTERNATIONAL CONFERENCE ON ADVANCED LANGUAGE PROCESSING AND WEB INFORMATION TECHNOLOGY, PROCEEDINGS, 2008, : 204 - 209
  • [45] Metric Learning for Multi-label Classification
    Brighi, Marco
    Franco, Annalisa
    Maio, Dario
    STRUCTURAL, SYNTACTIC, AND STATISTICAL PATTERN RECOGNITION, S+SSPR 2020, 2021, 12644 : 24 - 33
  • [46] Hyperspherical Learning in Multi-Label Classification
    Ke, Bo
    Zhu, Yunquan
    Li, Mengtian
    Shu, Xiujun
    Qiao, Ruizhi
    Ren, Bo
    COMPUTER VISION, ECCV 2022, PT XXV, 2022, 13685 : 38 - 55
  • [47] College Student Scholarships and Subsidies Granting: A Multi-Modal Multi-Label Approach
    Ye, Han-Jia
    Zhan, De-Chuan
    Li, Xiaolin
    Huang, Zhen-Chuan
    Jiang, Yuan
    2016 IEEE 16TH INTERNATIONAL CONFERENCE ON DATA MINING (ICDM), 2016, : 559 - 568
  • [48] Compact learning for multi-label classification
    Lv, Jiaqi
    Wu, Tianran
    Peng, Chenglun
    Liu, Yunpeng
    Xu, Ning
    Geng, Xin
    PATTERN RECOGNITION, 2021, 113
  • [49] On active learning in multi-label classification
    Brinker, K
    FROM DATA AND INFORMATION ANALYSIS TO KNOWLEDGE ENGINEERING, 2006, : 206 - 213
  • [50] Actor-agnostic Multi-label Action Recognition with Multi-modal Query
    Mondal, Anindya
    Nag, Sauradip
    Prada, Joaquin M.
    Zhu, Xiatian
    Dutta, Anjan
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS, ICCVW, 2023, : 784 - 794