Cluster-to-Conquer: A Framework for End-to-End Multi-Instance Learning for Whole Slide Image Classification

被引：0

作者：

Sharma, Yash ^{[1
]}

Shrivastava, Aman ^{[1
]}

Ehsan, Lubaina ^{[1
]}

Moskaluk, Christopher A. ^{[1
]}

Syed, Sana ^{[1
]}

Brown, Donald E. ^{[1
]}

机构：

[1] Univ Virginia, Charlottesville, VA 22903 USA

来源：

MEDICAL IMAGING WITH DEEP LEARNING, VOL 143 | 2021年 / 143卷

基金：

美国国家卫生研究院;

关键词：

Deep Learning; Multi-Instance Learning; Weak Supervision; Histopathology;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In recent years, the availability of digitized Whole Slide Images (WSIs) has enabled the use of deep learning-based computer vision techniques for automated disease diagnosis. However, WSIs present unique computational and algorithmic challenges. WSIs are gigapixel-sized (similar to 100K pixels), making them infeasible to be used directly for training deep neural networks. Also, often only slide-level labels are available for training as detailed annotations are tedious and can be time-consuming for experts. Approaches using multiple-instance learning (MIL) frameworks have been shown to overcome these challenges. Current state-of-the-art approaches divide the learning framework into two decoupled parts: a convolutional neural network (CNN) for encoding the patches followed by an independent aggregation approach for slide-level prediction. In this approach, the aggregation step has no bearing on the representations learned by the CNN encoder. We have proposed an end-to-end framework that clusters the patches from a WSI into k-groups, samples k' patches from each group for training, and uses an adaptive attention mechanism for slide level prediction; Cluster-to-Conquer (C2C). We have demonstrated that dividing a WSI into clusters can improve the model training by exposing it to diverse discriminative features extracted from the patches. We regularized the clustering mechanism by introducing a KL-divergence loss between the attention weights of patches in a cluster and the uniform distribution. The framework is optimized end-to-end on slide-level cross-entropy, patch-level cross-entropy, and KL-divergence loss (Implementation: https://github.com/YashSharma/C2C).

引用

页码：682 / 698

页数：17

共 50 条

[1] Multi-Instance Aware Localization for End-to-End Imitation Learning
Venkatesh, Sagar Gubbi
Upadrashta, Raviteja
Kolathaya, Shishir
Amrutur, Bharadwaj
[J]. 2020 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2020, : 5225 - 5230
[2] MuRCL: Multi-Instance Reinforcement Contrastive Learning for Whole Slide Image Classification
Zhu, Zhonghang
Yu, Lequan
Wu, Wei
Yu, Rongshan
Zhang, Defu
Wang, Liansheng
[J]. IEEE TRANSACTIONS ON MEDICAL IMAGING, 2023, 42 (05) : 1337 - 1348
[3] Multi-scale multi-instance contrastive learning for whole slide image classification
Zhang, Jianan
Hao, Fang
Liu, Xueyu
Yao, Shupei
Wu, Yongfei
Li, Ming
Zheng, Wen
[J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 138
[4] CoSSD - An end-to-end framework for multi-instance source separation and detection
Baligar, Shrishail
Newsam, Shawn
[J]. 2022 30TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO 2022), 2022, : 150 - 154
[5] Clustering-Based Multi-instance Learning Network for Whole Slide Image Classification
Wu, Wei
Zhu, Zhonghang
Magnier, Baptiste
Wang, Liansheng
[J]. COMPUTATIONAL MATHEMATICS MODELING IN CANCER ANALYSIS, CMMCA 2022, 2022, 13574 : 100 - 109
[6] Second-order multi-instance learning model for whole slide image classification
Wang, Qian
Zou, Ying
Zhang, Jianxin
Liu, Bin
[J]. PHYSICS IN MEDICINE AND BIOLOGY, 2021, 66 (14):
[7] RMDL: Recalibrated multi-instance deep learning for whole slide gastric image classification
Wang, Shujun
Zhu, Yaxi
Yu, Lequan
Chen, Hao
Lin, Huangjing
Wan, Xiangbo
Fan, Xinjuan
Heng, Pheng-Ann
[J]. MEDICAL IMAGE ANALYSIS, 2019, 58
[8] Deep multi-instance learning for end-to-end person re-identification
Yuan, Caihong
Xu, Chunyan
Wang, Tianjiang
Liu, Fang
Zhao, Zhiqiang
Feng, Ping
Guo, Jingjuan
[J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2018, 77 (10) : 12437 - 12467
[9] Deep multi-instance learning for end-to-end person re-identification
Caihong Yuan
Chunyan Xu
Tianjiang Wang
Fang Liu
Zhiqiang Zhao
Ping Feng
Jingjuan Guo
[J]. Multimedia Tools and Applications, 2018, 77 : 12437 - 12467
[10] End-to-end Multiple Instance Learning for Whole-Slide Cytopathology of Urothelial Carcinoma
Butke, Joshua
Frick, Tatjana
Roghmann, Florian
El-Mashtoly, Samir F.
Gerwert, Klaus
Mosig, Axel
[J]. MICCAI WORKSHOP ON COMPUTATIONAL PATHOLOGY, VOL 156, 2021, 156 : 57 - 68

← 1 2 3 4 5 →