A Transformer-Based Knowledge Distillation Network for Cortical Cataract Grading

被引：1

作者：

Wang, Jinhong ^{[1
,2
]}

Xu, Zhe ^{[3
]}

Zheng, Wenhao ^{[1
,2
]}

Ying, Haochao ^{[4
]}

Chen, Tingting ^{[1
,2
]}

Liu, Zuozhu ^{[5
]}

Chen, Danny Z. ^{[6
]}

Yao, Ke ^{[3
]}

Wu, Jian ^{[7
,8
]}

机构：

[1] Zhejiang Univ, Affiliated Hosp 2, Coll Comp Sci & Technol, Hangzhou 310027, Peoples R China

[2] Zhejiang Univ, Affiliated Hosp 2, Eye Ctr, Hangzhou 310027, Peoples R China

[3] Zhejiang Univ, Affiliated Hosp 2, Eye Ctr, Sch Med, Hangzhou 310009, Zhejiang, Peoples R China

[4] Zhejiang Univ, Sch Publ Hlth, Hangzhou 310058, Peoples R China

[5] Zhejiang Univ, ZJU UIUC Inst, Res & Dev Ctr Intelligent Healthcare, ZJU Angelalign Inc, Haining 310058, Peoples R China

[6] Univ Notre Dame, Dept Comp Sci & Engn, Notre Dame, IN 46556 USA

[7] Zhejiang Univ, Affiliated Hosp 2, Sch Med, Sch Publ Hlth, Hangzhou 310058, Peoples R China

[8] Zhejiang Univ, Inst Wenzhou, Hangzhou 310058, Peoples R China

来源：

IEEE TRANSACTIONS ON MEDICAL IMAGING | 2024年 / 43卷 / 03期

关键词：

Cataracts; Transformers; Annotations; Feature extraction; Image edge detection; Fuses; Knowledge engineering; Cataract grading; knowledge distillation; transformer; medical imaging classification; CLASSIFICATION; IMAGES;

D O I：

10.1109/TMI.2023.3327274

中图分类号：

TP39 [计算机的应用];

学科分类号：

081203 ; 0835 ;

摘要：

Cortical cataract, a common type of cataract, is particularly difficult to be diagnosed automatically due to the complex features of the lesions. Recently, many methods based on edge detection or deep learning were proposed for automatic cataract grading. However, these methods suffer a large performance drop in cortical cataract grading due to the more complex cortical opacities and uncertain data. In this paper, we propose a novel Transformer-based Knowledge Distillation Network, called TKD-Net, for cortical cataract grading. To tackle the complex opacity problem, we first devise a zone decomposition strategy to extract more refined features and introduce special sub-scores to consider critical factors of clinical cortical opacity assessment (location, area, density) for comprehensive quantification. Next, we develop a multi-modal mix-attention Transformer to efficiently fuse sub-scores and image modality for complex feature learning. However, obtaining the sub-score modality is a challenge in the clinic, which could cause the modality missing problem instead. To simultaneously alleviate the issues of modality missing and uncertain data, we further design a Transformer-based knowledge distillation method, which uses a teacher model with perfect data to guide a student model with modality-missing and uncertain data. We conduct extensive experiments on a dataset of commonly-used slit-lamp images annotated by the LOCS III grading system to demonstrate that our TKD-Net outperforms state-of-the-art methods, as well as the effectiveness of its key components.

引用

页码：1089 / 1101

页数：13

共 50 条

[41] TRANSFORMER-BASED HIERARCHICAL CLUSTERING FOR BRAIN NETWORK ANALYSIS
Dai, Wei
Cui, Hejie
Kan, Xuan
Guo, Ying
Van Rooij, Sanne
Yang, Carl
2023 IEEE 20TH INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING, ISBI, 2023,
[42] Transformer-Based Unified Neural Network for Quality Estimation and Transformer-Based Re-decoding Model for Machine Translation
Chen, Cong
Zong, Qinqin
Luo, Qi
Qiu, Bailian
Li, Maoxi
MACHINE TRANSLATION, CCMT 2020, 2020, 1328 : 66 - 75
[43] A Transformer-Based Model With Self-Distillation for Multimodal Emotion Recognition in Conversations
Ma, Hui
Wang, Jian
Lin, Hongfei
Zhang, Bo
Zhang, Yijia
Xu, Bo
IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 776 - 788
[44] Hydrophobicity-Based Grading of Industrial Composite Insulators Images Using Cross Attention Vision Transformer With Knowledge Distillation
Das, Samiran
Chatterjee, Sujoy
Basu, Mainak
IEEE TRANSACTIONS ON DIELECTRICS AND ELECTRICAL INSULATION, 2024, 31 (01) : 523 - 532
[45] TASKED: Transformer-based Adversarial learning for human activity recognition using wearable sensors via Self-KnowledgE Distillation
Suh, Sungho
Rey, Vitor Fortes
Lukowicz, Paul
KNOWLEDGE-BASED SYSTEMS, 2023, 260
[46] Vision Transformer-Based Self-supervised Learning for Ulcerative Colitis Grading in Colonoscopy
Pyatha, Ajay
Xu, Ziang
Ali, Sharib
DATA ENGINEERING IN MEDICAL IMAGING, DEMI 2023, 2023, 14314 : 102 - 110
[47] Medical image classification: Knowledge transfer via residual U-Net and vision transformer-based teacher-student model with knowledge distillation
Song, Yucheng
Wang, Jincan
Ge, Yifan
Li, Lifeng
Guo, Jia
Dong, Quanxing
Liao, Zhifang
JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2024, 102
[48] TopNet: Transformer-based Object Placement Network for Image Compositing
Zhu, Sijie
Lin, Zhe
Cohen, Scott
Kuen, Jason
Zhang, Zhifei
Chen, Chen
2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR, 2023, : 1838 - 1847
[49] Transformer-Based Network for Accurate Classification of Lung Auscultation Sounds
Sonali C.S.
Kiran J.
Suma K.V.
Chinmayi B.S.
Easa M.
Critical Reviews in Biomedical Engineering, 2023, 51 (06) : 1 - 16
[50] TIPFNet: a transformer-based infrared polarization image fusion network
Li, Kunyuan
Qi, Meibin
Zhuang, Shuo
Yang, Yanfang
Gao, Jun
OPTICS LETTERS, 2022, 47 (16) : 4255 - 4258

← 1 2 3 4 5 →