Probabilistic Attention Based on Gaussian Processes for Deep Multiple Instance Learning

被引:7
|
作者
Schmidt, Arne [1 ]
Morales-Alvarez, Pablo [2 ]
Molina, Rafael [1 ]
机构
[1] Univ Granada, Dept Comp Sci & Artificial Intelligence, Granada 18071, Spain
[2] Univ Granada, Dept Stat & Operat Res, Granada 18071, Spain
基金
欧盟地平线“2020”;
关键词
Attention mechanism; digital pathology; Gaus-sian processes (GPs); multiple instance learning (MIL); whole slide images (WSIs); CROWDS;
D O I
10.1109/TNNLS.2023.3245329
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Multiple instance learning (MIL) is a weakly supervised learning paradigm that is becoming increasingly popular because it requires less labeling effort than fully supervised methods. This is especially interesting for areas where the creation of large annotated datasets remains challenging, as in medicine. Although recent deep learning MIL approaches have obtained state-of-the-art results, they are fully deterministic and do not provide uncertainty estimations for the predictions. In this work, we introduce the attention Gaussian process (AGP) model, a novel probabilistic attention mechanism based on Gaussian processes (GPs) for deep MIL. AGP provides accurate bag-level predictions as well as instance-level explainability and can be trained end-to-end. Moreover, its probabilistic nature guarantees robustness to overfit on small datasets and uncertainty estimations for the predictions. The latter is especially important in medical applications, where decisions have a direct impact on the patient's health. The proposed model is validated experimentally as follows. First, its behavior is illustrated in two synthetic MIL experiments based on the well-known MNIST and CIFAR-10 datasets, respectively. Then, it is evaluated in three different real-world cancer detection experiments. AGP outperforms state-of-the-art MIL approaches, including deterministic deep learning ones. It shows a strong performance even on a small dataset with less than 100 labels and generalizes better than competing methods on an external test set. Moreover, we experimentally show that predictive uncertainty correlates with the risk of wrong predictions, and therefore it is a good indicator of reliability in practice. Our code is publicly available.
引用
收藏
页码:10909 / 10922
页数:14
相关论文
共 50 条
  • [41] Evaluation of emphysema on thoracic low-dose CTs through attention-based multiple instance deep learning
    Fuhrman, Jordan
    Yip, Rowena
    Zhu, Yeqing
    Jirapatnakul, Artit C.
    Li, Feng
    Henschke, Claudia I.
    Yankelevitz, David F.
    Giger, Maryellen L.
    [J]. SCIENTIFIC REPORTS, 2023, 13 (01)
  • [42] Multi-scale representation attention based deep multiple instance learning for gigapixel whole slide image analysis
    Xiang, Hangchen
    Shen, Junyi
    Yan, Qingguo
    Xu, Meilian
    Shi, Xiaoshuang
    Zhu, Xiaofeng
    [J]. MEDICAL IMAGE ANALYSIS, 2023, 89
  • [43] Evaluation of emphysema on thoracic low-dose CTs through attention-based multiple instance deep learning
    Jordan Fuhrman
    Rowena Yip
    Yeqing Zhu
    Artit C. Jirapatnakul
    Feng Li
    Claudia I. Henschke
    David F. Yankelevitz
    Maryellen L. Giger
    [J]. Scientific Reports, 13
  • [44] A Machine Consciousness Architecture Based on Deep Learning and Gaussian Processes
    Garrido Merchan, Eduardo C.
    Molina, Martin
    [J]. HYBRID ARTIFICIAL INTELLIGENT SYSTEMS, HAIS 2020, 2020, 12344 : 350 - 361
  • [45] Whole slide images based cancer survival prediction using attention guided deep multiple instance learning networks
    Yao, Jiawen
    Zhu, Xinliang
    Jonnagaddala, Jitendra
    Hawkins, Nicholas
    Huang, Junzhou
    [J]. MEDICAL IMAGE ANALYSIS, 2020, 65
  • [46] Accounting for Dependencies in Deep Learning Based Multiple Instance Learning for Whole Slide Imaging
    Myronenko, Andriy
    Xu, Ziyue
    Yang, Dong
    Roth, Holger R.
    Xu, Daguang
    [J]. MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2021, PT VIII, 2021, 12908 : 329 - 338
  • [47] Deep multiple instance learning versus conventional deep single instance learning for interpretable oral cancer detection
    Koriakina, Nadezhda
    Sladoje, Natasa
    Basic, Vladimir
    Lindblad, Joakim
    [J]. PLOS ONE, 2024, 19 (04):
  • [48] Accurate Screening of COVID-19 Using Attention-Based Deep 3D Multiple Instance Learning
    Han, Zhongyi
    Wei, Benzheng
    Hong, Yanfei
    Li, Tianyang
    Cong, Jinyu
    Zhu, Xue
    Wei, Haifeng
    Zhang, Wei
    [J]. IEEE TRANSACTIONS ON MEDICAL IMAGING, 2020, 39 (08) : 2584 - 2594
  • [49] Tongue Coating Classification Based on Multiple-Instance Learning and Deep Features
    Li, Xiaoqiang
    Tang, Yonghui
    Sun, Yue
    [J]. NEURAL INFORMATION PROCESSING (ICONIP 2019), PT IV, 2019, 1142 : 504 - 511
  • [50] Positional Multi-Cross-Attention for Bone Age Estimation Using Deep Multiple Instance Learning
    Do, Thanh-Cong
    Yang, Hyung-Jeong
    Kang, Sae-Ryung
    Kim, Soo-Hyung
    Lee, Guee-Sang
    Min, Jung-Joon
    [J]. 2022 26TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2022, : 4285 - 4291