Probabilistic Attention Based on Gaussian Processes for Deep Multiple Instance Learning

被引:7
|
作者
Schmidt, Arne [1 ]
Morales-Alvarez, Pablo [2 ]
Molina, Rafael [1 ]
机构
[1] Univ Granada, Dept Comp Sci & Artificial Intelligence, Granada 18071, Spain
[2] Univ Granada, Dept Stat & Operat Res, Granada 18071, Spain
基金
欧盟地平线“2020”;
关键词
Attention mechanism; digital pathology; Gaus-sian processes (GPs); multiple instance learning (MIL); whole slide images (WSIs); CROWDS;
D O I
10.1109/TNNLS.2023.3245329
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Multiple instance learning (MIL) is a weakly supervised learning paradigm that is becoming increasingly popular because it requires less labeling effort than fully supervised methods. This is especially interesting for areas where the creation of large annotated datasets remains challenging, as in medicine. Although recent deep learning MIL approaches have obtained state-of-the-art results, they are fully deterministic and do not provide uncertainty estimations for the predictions. In this work, we introduce the attention Gaussian process (AGP) model, a novel probabilistic attention mechanism based on Gaussian processes (GPs) for deep MIL. AGP provides accurate bag-level predictions as well as instance-level explainability and can be trained end-to-end. Moreover, its probabilistic nature guarantees robustness to overfit on small datasets and uncertainty estimations for the predictions. The latter is especially important in medical applications, where decisions have a direct impact on the patient's health. The proposed model is validated experimentally as follows. First, its behavior is illustrated in two synthetic MIL experiments based on the well-known MNIST and CIFAR-10 datasets, respectively. Then, it is evaluated in three different real-world cancer detection experiments. AGP outperforms state-of-the-art MIL approaches, including deterministic deep learning ones. It shows a strong performance even on a small dataset with less than 100 labels and generalizes better than competing methods on an external test set. Moreover, we experimentally show that predictive uncertainty correlates with the risk of wrong predictions, and therefore it is a good indicator of reliability in practice. Our code is publicly available.
引用
收藏
页码:10909 / 10922
页数:14
相关论文
共 50 条
  • [1] Attention-based Deep Multiple Instance Learning
    Ilse, Maximilian
    Tomczak, Jakub M.
    Welling, Max
    [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 80, 2018, 80
  • [2] Combining Attention-Based Multiple Instance Learning and Gaussian Processes for CT Hemorrhage Detection
    Wu, Yunan
    Schmidt, Arne
    Hernandez-Sanchez, Enrique
    Molina, Rafael
    Katsaggelos, Aggelos K.
    [J]. MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2021, PT II, 2021, 12902 : 582 - 591
  • [3] ATTENTION-BASED DEEP MULTIPLE INSTANCE LEARNING WITH ADAPTIVE INSTANCE SAMPLING
    Tarkhan, Aliasghar
    Trung Kien Nguyen
    Simon, Noah
    Bengtsson, Thomas
    Ocampo, Paolo
    Dai, Jian
    [J]. 2022 IEEE INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING (IEEE ISBI 2022), 2022,
  • [4] Multiple instance learning via Gaussian processes
    Minyoung Kim
    Fernando De la Torre
    [J]. Data Mining and Knowledge Discovery, 2014, 28 : 1078 - 1106
  • [5] Multiple instance learning via Gaussian processes
    Kim, Minyoung
    De la Torre, Fernando
    [J]. DATA MINING AND KNOWLEDGE DISCOVERY, 2014, 28 (04) : 1078 - 1106
  • [6] Loss-Based Attention for Deep Multiple Instance Learning
    Shi, Xiaoshuang
    Xing, Fuyong
    Xie, Yuanpu
    Zhang, Zizhao
    Cui, Lei
    Yang, Lin
    [J]. THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 5742 - 5749
  • [7] Variational Bayesian Multiple Instance Learning with Gaussian Processes
    Haussmann, Manuel
    Hamprecht, Fred A.
    Kandemir, Melih
    [J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 810 - 819
  • [8] Deep Gaussian processes for multiple instance learning: Application to CT intracranial hemorrhage detection *
    Lopez-Perez, Miguel
    Schmidt, Arne
    Wu, Yunan
    Molina, Rafael
    Katsaggelos, Aggelos K.
    [J]. COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2022, 219
  • [9] Deep Gaussian mixture model based instance relevance estimation for multiple instance learning applications
    Waqas, Muhammad
    Tahir, Muhammad Atif
    Qureshi, Rizwan
    [J]. APPLIED INTELLIGENCE, 2023, 53 (09) : 10310 - 10325
  • [10] Deep Gaussian mixture model based instance relevance estimation for multiple instance learning applications
    Muhammad Waqas
    Muhammad Atif Tahir
    Rizwan Qureshi
    [J]. Applied Intelligence, 2023, 53 : 10310 - 10325