Hybrid unsupervised representation learning and pseudo-label supervised self-distillation for rare disease imaging phenotype classification with dispersion-aware imbalance correction

被引:0
|
作者
Sun, Jinghan [1 ,2 ]
Wei, Dong [2 ]
Wang, Liansheng [1 ]
Zheng, Yefeng [2 ]
机构
[1] Xiamen Univ, Xiamen 361005, Peoples R China
[2] Jarvis Res Ctr, Tencent YouTu Lab, Shenzhen 518000, Peoples R China
关键词
Rare disease classification; Unsupervised representation learning; Pseudo-label supervised self-distillation; Dispersion-aware imbalance correction;
D O I
10.1016/j.media.2024.103102
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Rare diseases are characterized by low prevalence and are often chronically debilitating or life -threatening. Imaging phenotype classification of rare diseases is challenging due to the severe shortage of training examples. Few -shot learning (FSL) methods tackle this challenge by extracting generalizable prior knowledge from a large base dataset of common diseases and normal controls and transferring the knowledge to rare diseases. Yet, most existing methods require the base dataset to be labeled and do not make full use of the precious examples of rare diseases. In addition, the extremely small size of the training samples may result in inter -class performance imbalance due to insufficient sampling of the true distributions. To this end, we propose in this work a novel hybrid approach to rare disease imaging phenotype classification, featuring three key novelties targeted at the above drawbacks. First, we adopt the unsupervised representation learning (URL) based on self -supervising contrastive loss, whereby to eliminate the overhead in labeling the base dataset. Second, we integrate the URL with pseudo -label supervised classification for effective self -distillation of the knowledge about the rare diseases, composing a hybrid approach taking advantage of both unsupervised and (pseudo-) supervised learning on the base dataset. Third, we use the feature dispersion to assess the intra-class diversity of training samples, to alleviate the inter -class performance imbalance via dispersion -aware correction. Experimental results of imaging phenotype classification of both simulated (skin lesions and cervical smears) and real clinical rare diseases (retinal diseases) show that our hybrid approach substantially outperforms existing FSL methods (including those using a fully supervised base dataset) via effective integration of the URL, pseudo -label driven self -distillation, and dispersion -aware imbalance correction, thus establishing a new state of the art.
引用
收藏
页数:14
相关论文
共 1 条
  • [1] Unsupervised Representation Learning Meets Pseudo-Label Supervised Self-Distillation: A New Approach to Rare Disease Classification
    Sun, Jinghan
    Wei, Dong
    Ma, Kai
    Wang, Liansheng
    Zheng, Yefeng
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2021, PT V, 2021, 12905 : 519 - 529