Cluster-to-Conquer: A Framework for End-to-End Multi-Instance Learning for Whole Slide Image Classification

被引:0
|
作者
Sharma, Yash [1 ]
Shrivastava, Aman [1 ]
Ehsan, Lubaina [1 ]
Moskaluk, Christopher A. [1 ]
Syed, Sana [1 ]
Brown, Donald E. [1 ]
机构
[1] Univ Virginia, Charlottesville, VA 22903 USA
基金
美国国家卫生研究院;
关键词
Deep Learning; Multi-Instance Learning; Weak Supervision; Histopathology;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In recent years, the availability of digitized Whole Slide Images (WSIs) has enabled the use of deep learning-based computer vision techniques for automated disease diagnosis. However, WSIs present unique computational and algorithmic challenges. WSIs are gigapixel-sized (similar to 100K pixels), making them infeasible to be used directly for training deep neural networks. Also, often only slide-level labels are available for training as detailed annotations are tedious and can be time-consuming for experts. Approaches using multiple-instance learning (MIL) frameworks have been shown to overcome these challenges. Current state-of-the-art approaches divide the learning framework into two decoupled parts: a convolutional neural network (CNN) for encoding the patches followed by an independent aggregation approach for slide-level prediction. In this approach, the aggregation step has no bearing on the representations learned by the CNN encoder. We have proposed an end-to-end framework that clusters the patches from a WSI into k-groups, samples k' patches from each group for training, and uses an adaptive attention mechanism for slide level prediction; Cluster-to-Conquer (C2C). We have demonstrated that dividing a WSI into clusters can improve the model training by exposing it to diverse discriminative features extracted from the patches. We regularized the clustering mechanism by introducing a KL-divergence loss between the attention weights of patches in a cluster and the uniform distribution. The framework is optimized end-to-end on slide-level cross-entropy, patch-level cross-entropy, and KL-divergence loss (Implementation: https://github.com/YashSharma/C2C).
引用
收藏
页码:682 / 698
页数:17
相关论文
共 50 条
  • [41] End-to-End Multilevel Hybrid Attention Framework for Hyperspectral Image Classification
    Xiang, Jianhong
    Wei, Chen
    Wang, Minhui
    Teng, Long
    [J]. IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2022, 19
  • [42] Multi-Instance Multi-Label Learning for Image Classification with Large Vocabularies
    Yakhnenko, Oksana
    Honavar, Vasant
    [J]. PROCEEDINGS OF THE BRITISH MACHINE VISION CONFERENCE 2011, 2011,
  • [43] End-to-End Learning-Based Image Compression With a Decoupled Framework
    Zhang, Zhaobin
    Esenlik, Semih
    Wu, Yaojun
    Wang, Meng
    Zhang, Kai
    Zhang, Li
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (05) : 3067 - 3081
  • [44] RoFormer for Position Aware Multiple Instance Learning in Whole Slide Image Classification
    Pochet, Etienne
    Maroun, Rami
    Trullo, Roger
    [J]. MACHINE LEARNING IN MEDICAL IMAGING, MLMI 2023, PT II, 2024, 14349 : 437 - 446
  • [45] Iterative multiple instance learning for weakly annotated whole slide image classification
    Zhou, Yuanpin
    Che, Shuanlong
    Lu, Fang
    Liu, Si
    Yan, Ziye
    Wei, Jun
    Li, Yinghua
    Ding, Xiangdong
    Lu, Yao
    [J]. PHYSICS IN MEDICINE AND BIOLOGY, 2023, 68 (15):
  • [46] DGMIL: Distribution Guided Multiple Instance Learning for Whole Slide Image Classification
    Qu, Linhao
    Luo, Xiaoyuan
    Liu, Shaolei
    Wang, Manning
    Song, Zhijian
    [J]. MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2022, PT II, 2022, 13432 : 24 - 34
  • [47] Neighborhood attention transformer multiple instance learning for whole slide image classification
    Aftab, Rukhma
    Yan, Qiang
    Zhao, Juanjuan
    Yong, Gao
    Huajie, Yue
    Urrehman, Zia
    Khalid, Faizi Mohammad
    [J]. FRONTIERS IN ONCOLOGY, 2024, 14
  • [48] A multi-task-based classification framework for multi-instance distance metric learning
    Hao, Zhifeng
    Ruan, Yibang
    Xiao, Yanshan
    Liu, Bo
    [J]. NEUROCOMPUTING, 2018, 275 : 418 - 429
  • [49] A New multi-instance multi-label learning approach for image and text classification
    Kaobi Yan
    Zhixin Li
    Canlong Zhang
    [J]. Multimedia Tools and Applications, 2016, 75 : 7875 - 7890
  • [50] A New multi-instance multi-label learning approach for image and text classification
    Yan, Kaobi
    Li, Zhixin
    Zhang, Canlong
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2016, 75 (13) : 7875 - 7890