A Transformer-Based Knowledge Distillation Network for Cortical Cataract Grading

被引:1
|
作者
Wang, Jinhong [1 ,2 ]
Xu, Zhe [3 ]
Zheng, Wenhao [1 ,2 ]
Ying, Haochao [4 ]
Chen, Tingting [1 ,2 ]
Liu, Zuozhu [5 ]
Chen, Danny Z. [6 ]
Yao, Ke [3 ]
Wu, Jian [7 ,8 ]
机构
[1] Zhejiang Univ, Affiliated Hosp 2, Coll Comp Sci & Technol, Hangzhou 310027, Peoples R China
[2] Zhejiang Univ, Affiliated Hosp 2, Eye Ctr, Hangzhou 310027, Peoples R China
[3] Zhejiang Univ, Affiliated Hosp 2, Eye Ctr, Sch Med, Hangzhou 310009, Zhejiang, Peoples R China
[4] Zhejiang Univ, Sch Publ Hlth, Hangzhou 310058, Peoples R China
[5] Zhejiang Univ, ZJU UIUC Inst, Res & Dev Ctr Intelligent Healthcare, ZJU Angelalign Inc, Haining 310058, Peoples R China
[6] Univ Notre Dame, Dept Comp Sci & Engn, Notre Dame, IN 46556 USA
[7] Zhejiang Univ, Affiliated Hosp 2, Sch Med, Sch Publ Hlth, Hangzhou 310058, Peoples R China
[8] Zhejiang Univ, Inst Wenzhou, Hangzhou 310058, Peoples R China
关键词
Cataracts; Transformers; Annotations; Feature extraction; Image edge detection; Fuses; Knowledge engineering; Cataract grading; knowledge distillation; transformer; medical imaging classification; CLASSIFICATION; IMAGES;
D O I
10.1109/TMI.2023.3327274
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Cortical cataract, a common type of cataract, is particularly difficult to be diagnosed automatically due to the complex features of the lesions. Recently, many methods based on edge detection or deep learning were proposed for automatic cataract grading. However, these methods suffer a large performance drop in cortical cataract grading due to the more complex cortical opacities and uncertain data. In this paper, we propose a novel Transformer-based Knowledge Distillation Network, called TKD-Net, for cortical cataract grading. To tackle the complex opacity problem, we first devise a zone decomposition strategy to extract more refined features and introduce special sub-scores to consider critical factors of clinical cortical opacity assessment (location, area, density) for comprehensive quantification. Next, we develop a multi-modal mix-attention Transformer to efficiently fuse sub-scores and image modality for complex feature learning. However, obtaining the sub-score modality is a challenge in the clinic, which could cause the modality missing problem instead. To simultaneously alleviate the issues of modality missing and uncertain data, we further design a Transformer-based knowledge distillation method, which uses a teacher model with perfect data to guide a student model with modality-missing and uncertain data. We conduct extensive experiments on a dataset of commonly-used slit-lamp images annotated by the LOCS III grading system to demonstrate that our TKD-Net outperforms state-of-the-art methods, as well as the effectiveness of its key components.
引用
收藏
页码:1089 / 1101
页数:13
相关论文
共 50 条
  • [41] TRANSFORMER-BASED HIERARCHICAL CLUSTERING FOR BRAIN NETWORK ANALYSIS
    Dai, Wei
    Cui, Hejie
    Kan, Xuan
    Guo, Ying
    Van Rooij, Sanne
    Yang, Carl
    2023 IEEE 20TH INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING, ISBI, 2023,
  • [42] Transformer-Based Unified Neural Network for Quality Estimation and Transformer-Based Re-decoding Model for Machine Translation
    Chen, Cong
    Zong, Qinqin
    Luo, Qi
    Qiu, Bailian
    Li, Maoxi
    MACHINE TRANSLATION, CCMT 2020, 2020, 1328 : 66 - 75
  • [43] A Transformer-Based Model With Self-Distillation for Multimodal Emotion Recognition in Conversations
    Ma, Hui
    Wang, Jian
    Lin, Hongfei
    Zhang, Bo
    Zhang, Yijia
    Xu, Bo
    IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 776 - 788
  • [44] Hydrophobicity-Based Grading of Industrial Composite Insulators Images Using Cross Attention Vision Transformer With Knowledge Distillation
    Das, Samiran
    Chatterjee, Sujoy
    Basu, Mainak
    IEEE TRANSACTIONS ON DIELECTRICS AND ELECTRICAL INSULATION, 2024, 31 (01) : 523 - 532
  • [45] TASKED: Transformer-based Adversarial learning for human activity recognition using wearable sensors via Self-KnowledgE Distillation
    Suh, Sungho
    Rey, Vitor Fortes
    Lukowicz, Paul
    KNOWLEDGE-BASED SYSTEMS, 2023, 260
  • [46] Vision Transformer-Based Self-supervised Learning for Ulcerative Colitis Grading in Colonoscopy
    Pyatha, Ajay
    Xu, Ziang
    Ali, Sharib
    DATA ENGINEERING IN MEDICAL IMAGING, DEMI 2023, 2023, 14314 : 102 - 110
  • [47] Medical image classification: Knowledge transfer via residual U-Net and vision transformer-based teacher-student model with knowledge distillation
    Song, Yucheng
    Wang, Jincan
    Ge, Yifan
    Li, Lifeng
    Guo, Jia
    Dong, Quanxing
    Liao, Zhifang
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2024, 102
  • [48] TopNet: Transformer-based Object Placement Network for Image Compositing
    Zhu, Sijie
    Lin, Zhe
    Cohen, Scott
    Kuen, Jason
    Zhang, Zhifei
    Chen, Chen
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR, 2023, : 1838 - 1847
  • [49] Transformer-Based Network for Accurate Classification of Lung Auscultation Sounds
    Sonali C.S.
    Kiran J.
    Suma K.V.
    Chinmayi B.S.
    Easa M.
    Critical Reviews in Biomedical Engineering, 2023, 51 (06) : 1 - 16
  • [50] TIPFNet: a transformer-based infrared polarization image fusion network
    Li, Kunyuan
    Qi, Meibin
    Zhuang, Shuo
    Yang, Yanfang
    Gao, Jun
    OPTICS LETTERS, 2022, 47 (16) : 4255 - 4258