Retrieval-Augmented Multiple Instance Learning

被引:0
|
作者
Cui, Yufei [1 ,2 ]
Liu, Ziquan [3 ]
Chen, Yixin [4 ]
Lu, Yuchen [1 ,5 ]
Yu, Xinyue [1 ,5 ]
Liu, Xue [1 ,2 ]
Kuo, Tei-Wei [6 ,7 ]
Rodrigues, Miguel R. D. [3 ]
Xue, Chun Jason [4 ]
Chan, Antoni B. [4 ]
机构
[1] Mila, Milan, Italy
[2] McGill Univ, Montreal, PQ, Canada
[3] UCL, London, England
[4] City Univ Hong Kong, Hong Kong, Peoples R China
[5] Univ Montreal, Montreal, PQ, Canada
[6] Natl Taiwan Univ, Taipei, Taiwan
[7] MBZUAI, Abu Dhabi, U Arab Emirates
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Multiple Instance Learning (MIL) is a crucial weakly supervised learning method applied across various domains, e.g., medical diagnosis based on whole slide images (WSIs). Recent advancements in MIL algorithms have yielded exceptional performance when the training and test data originate from the same domain, such as WSIs obtained from the same hospital. However, this paper reveals a performance deterioration of MIL models when tested on an out-of-domain test set, exemplified by WSIs sourced from a novel hospital. To address this challenge, this paper introduces the Retrieval-AugMented MIL (RAM-MIL) framework, which integrates Optimal Transport (OT) as the distance metric for nearest neighbor retrieval. The development of RAM-MIL is driven by two key insights. First, a theoretical discovery indicates that reducing the input's intrinsic dimension can minimize the approximation error in attention-based MIL. Second, previous studies highlight a link between input intrinsic dimension and the feature merging process with the retrieved data. Empirical evaluations conducted on WSI classification demonstrate that the proposed RAM-MIL framework achieves state-of-the-art performance in both in-domain scenarios, where the training and retrieval data are in the same domain, and more crucially, in out-of-domain scenarios, where the (unlabeled) retrieval data originates from a different domain. Furthermore, the use of the transportation matrix derived from OT renders the retrieval results interpretable at the instance level, in contrast to the vanilla l(2) distance, and allows for visualization for human experts.
引用
收藏
页数:20
相关论文
共 50 条
  • [41] Multimodal Named Entity Recognition and Relation Extraction with Retrieval-Augmented Strategy
    Hu, Xuming
    [J]. PROCEEDINGS OF THE 46TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, SIGIR 2023, 2023, : 3488 - 3488
  • [42] Development and Evaluation of a Retrieval-Augmented Large Language Model Framework for Ophthalmology
    Luo, Ming-Jie
    Pang, Jianyu
    Bi, Shaowei
    Lai, Yunxi
    Zhao, Jiaman
    Shang, Yuanrui
    Cui, Tingxin
    Yang, Yahan
    Lin, Zhenzhe
    Zhao, Lanqin
    Wu, Xiaohang
    Lin, Duoru
    Chen, Jingjing
    Lin, Haotian
    [J]. JAMA OPHTHALMOLOGY, 2024, 142 (09) : 798 - 805
  • [43] Retrieval-augmented large language models for clinical trial screening.
    Tan, Ryan
    Ho, Si Xian
    Oo, Shiyun Vivianna Fequira
    Chua, Shi Ling
    Zaw, Ma Wai Wai
    Tan, Daniel Shao-Weng
    [J]. JOURNAL OF CLINICAL ONCOLOGY, 2024, 42 (16)
  • [44] Retrieval-Augmented Generation with Knowledge Graphs for Customer Service Question Answering
    Xu, Zhentao
    Cruz, Mark Jerome
    Guevara, Matthew
    Wang, Tie
    Deshpande, Manasi
    Wang, Xiaofeng
    Li, Zheng
    [J]. PROCEEDINGS OF THE 47TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, SIGIR 2024, 2024, : 2905 - 2909
  • [45] GastroBot: a Chinese gastrointestinal disease chatbot based on the retrieval-augmented generation
    Zhou, Qingqing
    Liu, Can
    Duan, Yuchen
    Sun, Kaijie
    Li, Yu
    Kan, Hongxing
    Gu, Zongyun
    Shu, Jianhua
    Hu, Jili
    [J]. FRONTIERS IN MEDICINE, 2024, 11
  • [46] FiD-Light: Efficient and Effective Retrieval-Augmented Text Generation
    Hofstatter, Sebastian
    Chen, Jiecao
    Raman, Karthik
    Zamani, Hamed
    [J]. PROCEEDINGS OF THE 46TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, SIGIR 2023, 2023, : 1437 - 1447
  • [47] Retrieval-Augmented Response Generation for Knowledge-Grounded Conversation in the Wild
    Ahn, Yeonchan
    Lee, Sang-Goo
    Shim, Junho
    Park, Jaehui
    [J]. IEEE ACCESS, 2022, 10 : 131374 - 131385
  • [48] FABULA: Intelligence Report Generation Using Retrieval-Augmented Narrative Construction
    Ranade, Priyanka
    Joshi, Anupam
    [J]. PROCEEDINGS OF THE 2023 IEEE/ACM INTERNATIONAL CONFERENCE ON ADVANCES IN SOCIAL NETWORKS ANALYSIS AND MINING, ASONAM 2023, 2023, : 604 - 611
  • [49] LOCALIZED CONTENT BASED IMAGE RETRIEVAL BY MULTIPLE INSTANCE ACTIVE LEARNING
    Zhang, Dan
    Wang, Fei
    Shi, Zhenwei
    Zhang, Changshui
    [J]. 2008 15TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOLS 1-5, 2008, : 921 - 924
  • [50] Motion retrieval with ensemble multiple instance learning based on mocap database
    Zhu, Hongli
    Xiang, Jian
    Yu, Fei
    [J]. IMECS 2007: INTERNATIONAL MULTICONFERENCE OF ENGINEERS AND COMPUTER SCIENTISTS, VOLS I AND II, 2007, : 1445 - +