Cluster-to-Conquer: A Framework for End-to-End Multi-Instance Learning for Whole Slide Image Classification

被引:0
|
作者
Sharma, Yash [1 ]
Shrivastava, Aman [1 ]
Ehsan, Lubaina [1 ]
Moskaluk, Christopher A. [1 ]
Syed, Sana [1 ]
Brown, Donald E. [1 ]
机构
[1] Univ Virginia, Charlottesville, VA 22903 USA
基金
美国国家卫生研究院;
关键词
Deep Learning; Multi-Instance Learning; Weak Supervision; Histopathology;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In recent years, the availability of digitized Whole Slide Images (WSIs) has enabled the use of deep learning-based computer vision techniques for automated disease diagnosis. However, WSIs present unique computational and algorithmic challenges. WSIs are gigapixel-sized (similar to 100K pixels), making them infeasible to be used directly for training deep neural networks. Also, often only slide-level labels are available for training as detailed annotations are tedious and can be time-consuming for experts. Approaches using multiple-instance learning (MIL) frameworks have been shown to overcome these challenges. Current state-of-the-art approaches divide the learning framework into two decoupled parts: a convolutional neural network (CNN) for encoding the patches followed by an independent aggregation approach for slide-level prediction. In this approach, the aggregation step has no bearing on the representations learned by the CNN encoder. We have proposed an end-to-end framework that clusters the patches from a WSI into k-groups, samples k' patches from each group for training, and uses an adaptive attention mechanism for slide level prediction; Cluster-to-Conquer (C2C). We have demonstrated that dividing a WSI into clusters can improve the model training by exposing it to diverse discriminative features extracted from the patches. We regularized the clustering mechanism by introducing a KL-divergence loss between the attention weights of patches in a cluster and the uniform distribution. The framework is optimized end-to-end on slide-level cross-entropy, patch-level cross-entropy, and KL-divergence loss (Implementation: https://github.com/YashSharma/C2C).
引用
收藏
页码:682 / 698
页数:17
相关论文
共 50 条
  • [1] Multi-Instance Aware Localization for End-to-End Imitation Learning
    Venkatesh, Sagar Gubbi
    Upadrashta, Raviteja
    Kolathaya, Shishir
    Amrutur, Bharadwaj
    [J]. 2020 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2020, : 5225 - 5230
  • [2] MuRCL: Multi-Instance Reinforcement Contrastive Learning for Whole Slide Image Classification
    Zhu, Zhonghang
    Yu, Lequan
    Wu, Wei
    Yu, Rongshan
    Zhang, Defu
    Wang, Liansheng
    [J]. IEEE TRANSACTIONS ON MEDICAL IMAGING, 2023, 42 (05) : 1337 - 1348
  • [3] Multi-scale multi-instance contrastive learning for whole slide image classification
    Zhang, Jianan
    Hao, Fang
    Liu, Xueyu
    Yao, Shupei
    Wu, Yongfei
    Li, Ming
    Zheng, Wen
    [J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 138
  • [4] CoSSD - An end-to-end framework for multi-instance source separation and detection
    Baligar, Shrishail
    Newsam, Shawn
    [J]. 2022 30TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO 2022), 2022, : 150 - 154
  • [5] Clustering-Based Multi-instance Learning Network for Whole Slide Image Classification
    Wu, Wei
    Zhu, Zhonghang
    Magnier, Baptiste
    Wang, Liansheng
    [J]. COMPUTATIONAL MATHEMATICS MODELING IN CANCER ANALYSIS, CMMCA 2022, 2022, 13574 : 100 - 109
  • [6] Second-order multi-instance learning model for whole slide image classification
    Wang, Qian
    Zou, Ying
    Zhang, Jianxin
    Liu, Bin
    [J]. PHYSICS IN MEDICINE AND BIOLOGY, 2021, 66 (14):
  • [7] RMDL: Recalibrated multi-instance deep learning for whole slide gastric image classification
    Wang, Shujun
    Zhu, Yaxi
    Yu, Lequan
    Chen, Hao
    Lin, Huangjing
    Wan, Xiangbo
    Fan, Xinjuan
    Heng, Pheng-Ann
    [J]. MEDICAL IMAGE ANALYSIS, 2019, 58
  • [8] Deep multi-instance learning for end-to-end person re-identification
    Yuan, Caihong
    Xu, Chunyan
    Wang, Tianjiang
    Liu, Fang
    Zhao, Zhiqiang
    Feng, Ping
    Guo, Jingjuan
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2018, 77 (10) : 12437 - 12467
  • [9] Deep multi-instance learning for end-to-end person re-identification
    Caihong Yuan
    Chunyan Xu
    Tianjiang Wang
    Fang Liu
    Zhiqiang Zhao
    Ping Feng
    Jingjuan Guo
    [J]. Multimedia Tools and Applications, 2018, 77 : 12437 - 12467
  • [10] End-to-end Multiple Instance Learning for Whole-Slide Cytopathology of Urothelial Carcinoma
    Butke, Joshua
    Frick, Tatjana
    Roghmann, Florian
    El-Mashtoly, Samir F.
    Gerwert, Klaus
    Mosig, Axel
    [J]. MICCAI WORKSHOP ON COMPUTATIONAL PATHOLOGY, VOL 156, 2021, 156 : 57 - 68