Cluster-to-Conquer: A Framework for End-to-End Multi-Instance Learning for Whole Slide Image Classification

被引：0

作者：

Sharma, Yash ^{[1
]}

Shrivastava, Aman ^{[1
]}

Ehsan, Lubaina ^{[1
]}

Moskaluk, Christopher A. ^{[1
]}

Syed, Sana ^{[1
]}

Brown, Donald E. ^{[1
]}

机构：

[1] Univ Virginia, Charlottesville, VA 22903 USA

来源：

MEDICAL IMAGING WITH DEEP LEARNING, VOL 143 | 2021年 / 143卷

基金：

美国国家卫生研究院;

关键词：

Deep Learning; Multi-Instance Learning; Weak Supervision; Histopathology;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In recent years, the availability of digitized Whole Slide Images (WSIs) has enabled the use of deep learning-based computer vision techniques for automated disease diagnosis. However, WSIs present unique computational and algorithmic challenges. WSIs are gigapixel-sized (similar to 100K pixels), making them infeasible to be used directly for training deep neural networks. Also, often only slide-level labels are available for training as detailed annotations are tedious and can be time-consuming for experts. Approaches using multiple-instance learning (MIL) frameworks have been shown to overcome these challenges. Current state-of-the-art approaches divide the learning framework into two decoupled parts: a convolutional neural network (CNN) for encoding the patches followed by an independent aggregation approach for slide-level prediction. In this approach, the aggregation step has no bearing on the representations learned by the CNN encoder. We have proposed an end-to-end framework that clusters the patches from a WSI into k-groups, samples k' patches from each group for training, and uses an adaptive attention mechanism for slide level prediction; Cluster-to-Conquer (C2C). We have demonstrated that dividing a WSI into clusters can improve the model training by exposing it to diverse discriminative features extracted from the patches. We regularized the clustering mechanism by introducing a KL-divergence loss between the attention weights of patches in a cluster and the uniform distribution. The framework is optimized end-to-end on slide-level cross-entropy, patch-level cross-entropy, and KL-divergence loss (Implementation: https://github.com/YashSharma/C2C).

引用

页码：682 / 698

页数：17

共 50 条

[31] ReMix: A General and Efficient Framework for Multiple Instance Learning Based Whole Slide Image Classification
Yang, Jiawei
Chen, Hanbo
Zhao, Yu
Yang, Fan
Zhang, Yao
He, Lei
Yao, Jianhua
[J]. MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2022, PT II, 2022, 13432 : 35 - 45
[32] Learning Multi-Instance Deep Discriminative Patterns for Image Classification
Tang, Peng
Wang, Xinggang
Feng, Bin
Liu, Wenyu
[J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2017, 26 (07) : 3385 - 3396
[33] End-to-end learning of representations for instance-level document image retrieval
Liu, Li
Lu, Yue
Suen, Ching Y.
[J]. APPLIED SOFT COMPUTING, 2023, 136
[34] END-TO-END LEARNING OF POLYGONS FOR REMOTE SENSING IMAGE CLASSIFICATION
Girard, Nicolas
Tarabalka, Yuliya
[J]. IGARSS 2018 - 2018 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, 2018, : 2083 - 2086
[35] Rethinking Overfitting of Multiple Instance Learning for Whole Slide Image Classification
Song, Hongjian
Tang, Jie
Xiao, Hongzhao
Hu, Juncheng
[J]. 2023 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, ICME, 2023, : 546 - 551
[36] Multiple Instance Learning with random sampling for Whole Slide Image Classification
Keshvarikhojasteh, H.
Pluim, J. P. W.
Veta, M.
[J]. DIGITAL AND COMPUTATIONAL PATHOLOGY, MEDICAL IMAGING 2024, 2024, 12933
[37] CaMIL: Causal Multiple Instance Learning for Whole Slide Image Classification
Chen, Kaitao
Sun, Shiliang
Zhao, Jing
[J]. THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 2, 2024, : 1120 - 1128
[38] DEEP HIERARCHICAL MULTIPLE INSTANCE LEARNING FOR WHOLE SLIDE IMAGE CLASSIFICATION
Zhou, Yuanpin
Lu, Yao
[J]. 2022 IEEE INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING (IEEE ISBI 2022), 2022,
[39] ACTNET: End-to-End Learning of Feature Activations and Multi-stream Aggregation for Effective Instance Image Retrieval
Husain, Syed Sameed
Ong, Eng-Jon
Bober, Miroslaw
[J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 2021, 129 (05) : 1432 - 1450
[40] ACTNET: End-to-End Learning of Feature Activations and Multi-stream Aggregation for Effective Instance Image Retrieval
Syed Sameed Husain
Eng-Jon Ong
Miroslaw Bober
[J]. International Journal of Computer Vision, 2021, 129 : 1432 - 1450

← 1 2 3 4 5 →