Cluster-to-Conquer: A Framework for End-to-End Multi-Instance Learning for Whole Slide Image Classification

被引:0
|
作者
Sharma, Yash [1 ]
Shrivastava, Aman [1 ]
Ehsan, Lubaina [1 ]
Moskaluk, Christopher A. [1 ]
Syed, Sana [1 ]
Brown, Donald E. [1 ]
机构
[1] Univ Virginia, Charlottesville, VA 22903 USA
基金
美国国家卫生研究院;
关键词
Deep Learning; Multi-Instance Learning; Weak Supervision; Histopathology;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In recent years, the availability of digitized Whole Slide Images (WSIs) has enabled the use of deep learning-based computer vision techniques for automated disease diagnosis. However, WSIs present unique computational and algorithmic challenges. WSIs are gigapixel-sized (similar to 100K pixels), making them infeasible to be used directly for training deep neural networks. Also, often only slide-level labels are available for training as detailed annotations are tedious and can be time-consuming for experts. Approaches using multiple-instance learning (MIL) frameworks have been shown to overcome these challenges. Current state-of-the-art approaches divide the learning framework into two decoupled parts: a convolutional neural network (CNN) for encoding the patches followed by an independent aggregation approach for slide-level prediction. In this approach, the aggregation step has no bearing on the representations learned by the CNN encoder. We have proposed an end-to-end framework that clusters the patches from a WSI into k-groups, samples k' patches from each group for training, and uses an adaptive attention mechanism for slide level prediction; Cluster-to-Conquer (C2C). We have demonstrated that dividing a WSI into clusters can improve the model training by exposing it to diverse discriminative features extracted from the patches. We regularized the clustering mechanism by introducing a KL-divergence loss between the attention weights of patches in a cluster and the uniform distribution. The framework is optimized end-to-end on slide-level cross-entropy, patch-level cross-entropy, and KL-divergence loss (Implementation: https://github.com/YashSharma/C2C).
引用
收藏
页码:682 / 698
页数:17
相关论文
共 50 条
  • [31] ReMix: A General and Efficient Framework for Multiple Instance Learning Based Whole Slide Image Classification
    Yang, Jiawei
    Chen, Hanbo
    Zhao, Yu
    Yang, Fan
    Zhang, Yao
    He, Lei
    Yao, Jianhua
    [J]. MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2022, PT II, 2022, 13432 : 35 - 45
  • [32] Learning Multi-Instance Deep Discriminative Patterns for Image Classification
    Tang, Peng
    Wang, Xinggang
    Feng, Bin
    Liu, Wenyu
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2017, 26 (07) : 3385 - 3396
  • [33] End-to-end learning of representations for instance-level document image retrieval
    Liu, Li
    Lu, Yue
    Suen, Ching Y.
    [J]. APPLIED SOFT COMPUTING, 2023, 136
  • [34] END-TO-END LEARNING OF POLYGONS FOR REMOTE SENSING IMAGE CLASSIFICATION
    Girard, Nicolas
    Tarabalka, Yuliya
    [J]. IGARSS 2018 - 2018 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, 2018, : 2083 - 2086
  • [35] Rethinking Overfitting of Multiple Instance Learning for Whole Slide Image Classification
    Song, Hongjian
    Tang, Jie
    Xiao, Hongzhao
    Hu, Juncheng
    [J]. 2023 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, ICME, 2023, : 546 - 551
  • [36] Multiple Instance Learning with random sampling for Whole Slide Image Classification
    Keshvarikhojasteh, H.
    Pluim, J. P. W.
    Veta, M.
    [J]. DIGITAL AND COMPUTATIONAL PATHOLOGY, MEDICAL IMAGING 2024, 2024, 12933
  • [37] CaMIL: Causal Multiple Instance Learning for Whole Slide Image Classification
    Chen, Kaitao
    Sun, Shiliang
    Zhao, Jing
    [J]. THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 2, 2024, : 1120 - 1128
  • [38] DEEP HIERARCHICAL MULTIPLE INSTANCE LEARNING FOR WHOLE SLIDE IMAGE CLASSIFICATION
    Zhou, Yuanpin
    Lu, Yao
    [J]. 2022 IEEE INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING (IEEE ISBI 2022), 2022,
  • [39] ACTNET: End-to-End Learning of Feature Activations and Multi-stream Aggregation for Effective Instance Image Retrieval
    Husain, Syed Sameed
    Ong, Eng-Jon
    Bober, Miroslaw
    [J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 2021, 129 (05) : 1432 - 1450
  • [40] ACTNET: End-to-End Learning of Feature Activations and Multi-stream Aggregation for Effective Instance Image Retrieval
    Syed Sameed Husain
    Eng-Jon Ong
    Miroslaw Bober
    [J]. International Journal of Computer Vision, 2021, 129 : 1432 - 1450