Retrieval-Augmented Multiple Instance Learning

被引:0
|
作者
Cui, Yufei [1 ,2 ]
Liu, Ziquan [3 ]
Chen, Yixin [4 ]
Lu, Yuchen [1 ,5 ]
Yu, Xinyue [1 ,5 ]
Liu, Xue [1 ,2 ]
Kuo, Tei-Wei [6 ,7 ]
Rodrigues, Miguel R. D. [3 ]
Xue, Chun Jason [4 ]
Chan, Antoni B. [4 ]
机构
[1] Mila, Milan, Italy
[2] McGill Univ, Montreal, PQ, Canada
[3] UCL, London, England
[4] City Univ Hong Kong, Hong Kong, Peoples R China
[5] Univ Montreal, Montreal, PQ, Canada
[6] Natl Taiwan Univ, Taipei, Taiwan
[7] MBZUAI, Abu Dhabi, U Arab Emirates
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Multiple Instance Learning (MIL) is a crucial weakly supervised learning method applied across various domains, e.g., medical diagnosis based on whole slide images (WSIs). Recent advancements in MIL algorithms have yielded exceptional performance when the training and test data originate from the same domain, such as WSIs obtained from the same hospital. However, this paper reveals a performance deterioration of MIL models when tested on an out-of-domain test set, exemplified by WSIs sourced from a novel hospital. To address this challenge, this paper introduces the Retrieval-AugMented MIL (RAM-MIL) framework, which integrates Optimal Transport (OT) as the distance metric for nearest neighbor retrieval. The development of RAM-MIL is driven by two key insights. First, a theoretical discovery indicates that reducing the input's intrinsic dimension can minimize the approximation error in attention-based MIL. Second, previous studies highlight a link between input intrinsic dimension and the feature merging process with the retrieved data. Empirical evaluations conducted on WSI classification demonstrate that the proposed RAM-MIL framework achieves state-of-the-art performance in both in-domain scenarios, where the training and retrieval data are in the same domain, and more crucially, in out-of-domain scenarios, where the (unlabeled) retrieval data originates from a different domain. Furthermore, the use of the transportation matrix derived from OT renders the retrieval results interpretable at the instance level, in contrast to the vanilla l(2) distance, and allows for visualization for human experts.
引用
收藏
页数:20
相关论文
共 50 条
  • [31] Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks
    Lewis, Patrick
    Perez, Ethan
    Piktus, Aleksandra
    Petroni, Fabio
    Karpukhin, Vladimir
    Goyal, Naman
    Kuttler, Heinrich
    Lewis, Mike
    Yih, Wen-tau
    Rocktaschel, Tim
    Riedel, Sebastian
    Kiela, Douwe
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
  • [32] REALM: Retrieval-Augmented Language Model Pre-Training
    Guu, Kelvin
    Lee, Kenton
    Tung, Zora
    Pasupat, Panupong
    Chang, Ming-Wei
    [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 119, 2020, 119
  • [33] Retrieval-Augmented Mining of Temporal Logic Specifications from Data
    Saveri, Gaia
    Bortolussi, Luca
    [J]. MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES: RESEARCH TRACK, PT VII, ECML PKDD 2024, 2024, 14947 : 315 - 331
  • [34] Interactive AI With Retrieval-Augmented Generation for Next Generation Networking
    Zhang, Ruichen
    Du, Hongyang
    Liu, Yinqiu
    Niyato, Dusit
    Kang, Jiawen
    Sun, Sumei
    Shen, Xuemin
    Poor, H. Vincent
    [J]. IEEE Network, 2024, 38 (06): : 414 - 424
  • [35] Retrieval-Augmented Knowledge Graph Reasoning for Commonsense Question Answering
    Sha, Yuchen
    Feng, Yujian
    He, Miao
    Liu, Shangdong
    Ji, Yimu
    [J]. MATHEMATICS, 2023, 11 (15)
  • [36] GRAPH-BASED MULTIPLE-INSTANCE LEARNING WITH INSTANCE WEIGHTING FOR IMAGE RETRIEVAL
    Li, Fei
    Liu, Rujie
    [J]. 2011 18TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2011,
  • [37] Classroom Video Assessment and Retrieval via Multiple Instance Learning
    Qiao, Qifeng
    Beling, Peter A.
    [J]. ARTIFICIAL INTELLIGENCE IN EDUCATION, 2011, 6738 : 272 - 279
  • [38] An online multiple instance learning system for semantic image retrieval
    Zhang, Chengcui
    Chen, Xin
    Chen, Wei-Bang
    [J]. ISM WORKSHOPS 2007: NINTH IEEE INTERNATIONAL SYMPOSIUM ON MULTIMEDIA - WORKSHOPS, PROCEEDINGS, 2007, : 83 - 84
  • [39] IM-RAG: Multi-Round Retrieval-Augmented Generation Through Learning Inner Monologues
    Yang, Diji
    Rao, Jinmeng
    Chen, Kezhen
    Guo, Xiaoyuan
    Zhang, Yawen
    Yang, Jie
    Zhang, Yi
    [J]. PROCEEDINGS OF THE 47TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, SIGIR 2024, 2024, : 730 - 740
  • [40] Motion retrieval based on multiple instance learning by isomap and RBF
    Xiang, Jian
    [J]. PROCEEDINGS OF THE FIRST INTERNATIONAL SYMPOSIUM ON DATA, PRIVACY, AND E-COMMERCE, 2007, : 113 - +