Cluster-to-Conquer: A Framework for End-to-End Multi-Instance Learning for Whole Slide Image Classification

被引：0

作者：

Sharma, Yash ^{[1
]}

Shrivastava, Aman ^{[1
]}

Ehsan, Lubaina ^{[1
]}

Moskaluk, Christopher A. ^{[1
]}

Syed, Sana ^{[1
]}

Brown, Donald E. ^{[1
]}

机构：

[1] Univ Virginia, Charlottesville, VA 22903 USA

来源：

MEDICAL IMAGING WITH DEEP LEARNING, VOL 143 | 2021年 / 143卷

基金：

美国国家卫生研究院;

关键词：

Deep Learning; Multi-Instance Learning; Weak Supervision; Histopathology;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In recent years, the availability of digitized Whole Slide Images (WSIs) has enabled the use of deep learning-based computer vision techniques for automated disease diagnosis. However, WSIs present unique computational and algorithmic challenges. WSIs are gigapixel-sized (similar to 100K pixels), making them infeasible to be used directly for training deep neural networks. Also, often only slide-level labels are available for training as detailed annotations are tedious and can be time-consuming for experts. Approaches using multiple-instance learning (MIL) frameworks have been shown to overcome these challenges. Current state-of-the-art approaches divide the learning framework into two decoupled parts: a convolutional neural network (CNN) for encoding the patches followed by an independent aggregation approach for slide-level prediction. In this approach, the aggregation step has no bearing on the representations learned by the CNN encoder. We have proposed an end-to-end framework that clusters the patches from a WSI into k-groups, samples k' patches from each group for training, and uses an adaptive attention mechanism for slide level prediction; Cluster-to-Conquer (C2C). We have demonstrated that dividing a WSI into clusters can improve the model training by exposing it to diverse discriminative features extracted from the patches. We regularized the clustering mechanism by introducing a KL-divergence loss between the attention weights of patches in a cluster and the uniform distribution. The framework is optimized end-to-end on slide-level cross-entropy, patch-level cross-entropy, and KL-divergence loss (Implementation: https://github.com/YashSharma/C2C).

引用

页码：682 / 698

页数：17

共 50 条

[41] End-to-End Multilevel Hybrid Attention Framework for Hyperspectral Image Classification
Xiang, Jianhong
Wei, Chen
Wang, Minhui
Teng, Long
[J]. IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2022, 19
[42] Multi-Instance Multi-Label Learning for Image Classification with Large Vocabularies
Yakhnenko, Oksana
Honavar, Vasant
[J]. PROCEEDINGS OF THE BRITISH MACHINE VISION CONFERENCE 2011, 2011,
[43] End-to-End Learning-Based Image Compression With a Decoupled Framework
Zhang, Zhaobin
Esenlik, Semih
Wu, Yaojun
Wang, Meng
Zhang, Kai
Zhang, Li
[J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (05) : 3067 - 3081
[44] RoFormer for Position Aware Multiple Instance Learning in Whole Slide Image Classification
Pochet, Etienne
Maroun, Rami
Trullo, Roger
[J]. MACHINE LEARNING IN MEDICAL IMAGING, MLMI 2023, PT II, 2024, 14349 : 437 - 446
[45] Iterative multiple instance learning for weakly annotated whole slide image classification
Zhou, Yuanpin
Che, Shuanlong
Lu, Fang
Liu, Si
Yan, Ziye
Wei, Jun
Li, Yinghua
Ding, Xiangdong
Lu, Yao
[J]. PHYSICS IN MEDICINE AND BIOLOGY, 2023, 68 (15):
[46] DGMIL: Distribution Guided Multiple Instance Learning for Whole Slide Image Classification
Qu, Linhao
Luo, Xiaoyuan
Liu, Shaolei
Wang, Manning
Song, Zhijian
[J]. MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2022, PT II, 2022, 13432 : 24 - 34
[47] Neighborhood attention transformer multiple instance learning for whole slide image classification
Aftab, Rukhma
Yan, Qiang
Zhao, Juanjuan
Yong, Gao
Huajie, Yue
Urrehman, Zia
Khalid, Faizi Mohammad
[J]. FRONTIERS IN ONCOLOGY, 2024, 14
[48] A multi-task-based classification framework for multi-instance distance metric learning
Hao, Zhifeng
Ruan, Yibang
Xiao, Yanshan
Liu, Bo
[J]. NEUROCOMPUTING, 2018, 275 : 418 - 429
[49] A New multi-instance multi-label learning approach for image and text classification
Kaobi Yan
Zhixin Li
Canlong Zhang
[J]. Multimedia Tools and Applications, 2016, 75 : 7875 - 7890
[50] A New multi-instance multi-label learning approach for image and text classification
Yan, Kaobi
Li, Zhixin
Zhang, Canlong
[J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2016, 75 (13) : 7875 - 7890

← 1 2 3 4 5 →