Aligning Medical Images with General Knowledge from Large Language Models

被引:0
|
作者
Fang, Xiao [1 ]
Lin, Yi [1 ]
Zhang, Dong [2 ]
Cheng, Kwang-Ting [2 ]
Chen, Hao [1 ,3 ,4 ]
机构
[1] HKUST, Dept Comp Sci & Engn, Hong Kong, Peoples R China
[2] HKUST, Dept Elect & Comp Engn, Hong Kong, Peoples R China
[3] HKUST, Dept Chem & Biol Engn, Hong Kong, Peoples R China
[4] HKUST Shenzhen Hong Kong Collaborat Innovat Res I, Shenzhen, Peoples R China
关键词
Prompt Learning; Vision-Language Models; Large Language Model; Medical Image Analysis;
D O I
10.1007/978-3-031-72117-5_6
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Pre-trained large vision-language models (VLMs) like CLIP have revolutionized visual representation learning using natural language as supervisions, and demonstrated promising generalization ability. In this work, we propose ViP, a novel visual symptom-guided prompt learning framework for medical image analysis, which facilitates general knowledge transfer from CLIP. ViP consists of two key components: a visual symptom generator (VSG) and a dual-prompt network. Specifically, VSG aims to extract explicable visual symptoms from pre-trained large language models, while the dual-prompt network utilizes these visual symptoms to guide the training on two learnable prompt modules, i.e., context prompt and merge prompt, which effectively adapts our framework to medical image analysis via large VLMs. Extensive experimental results demonstrate that ViP can outperform state-of-the-art methods on two challenging datasets. The code is available at https://github.com/xiaofang007/ViP.
引用
收藏
页码:57 / 67
页数:11
相关论文
共 50 条
  • [41] Leveraging Medical Knowledge Graphs Into Large Language Models for Diagnosis Prediction: Design and Application Study
    Gao, Yanjun
    Li, Ruizhe
    Croxford, Emma
    Caskey, John
    Patterson, Brian W.
    Churpek, Matthew
    Miller, Timothy
    Dligach, Dmitriy
    Afshar, Majid
    JMIR AI, 2025, 4
  • [42] Large Language Models can Share Images, Too!
    Lee, Young-Jun
    Lee, Dokyong
    Sung, Joo Won
    Hyeon, Jonghwan
    Choi, Ho-Jin
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: ACL 2024, 2024, : 692 - 713
  • [43] From language models to large-scale food and biomedical knowledge graphs
    Gjorgjina Cenikj
    Lidija Strojnik
    Risto Angelski
    Nives Ogrinc
    Barbara Koroušić Seljak
    Tome Eftimov
    Scientific Reports, 13
  • [44] The Promise and Challenge of Large Language Models for Knowledge Engineering: Insights from a Hackathon
    Walker, Johanna
    Koutsiana, Elisavet
    Nwachukwu, Michelle
    Merono-Penuela, Albert
    Simperl, Elena
    EXTENDED ABSTRACTS OF THE 2024 CHI CONFERENCE ON HUMAN FACTORS IN COMPUTING SYSTEMS, CHI 2024, 2024,
  • [45] Enhancing Sequential Recommenders with Augmented Knowledge from Aligned Large Language Models
    Ren, Yankun
    Chen, Zhongde
    Yang, Xinxing
    Li, Longfei
    Jiang, Cong
    Cheng, Lei
    Zhang, Bo
    Mo, Linjian
    Zhou, Jun
    PROCEEDINGS OF THE 47TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, SIGIR 2024, 2024, : 345 - 354
  • [46] From language models to large-scale food and biomedical knowledge graphs
    Cenikj, Gjorgjina
    Strojnik, Lidija
    Angelski, Risto
    Ogrinc, Nives
    Seljak, Barbara Korousic
    Eftimov, Tome
    SCIENTIFIC REPORTS, 2023, 13 (01)
  • [47] Queryfy: from knowledge graphs to questions using open Large Language Models
    Brei, Felix
    Meyer, Lars-Peter
    Martin, Michael
    IT-INFORMATION TECHNOLOGY, 2025,
  • [48] Had Enough of Experts? Quantitative Knowledge Retrieval From Large Language Models
    Selby, David
    Iwashita, Yuichiro
    Spriestersbach, Kai
    Saad, Mohammad
    Bappert, Dennis
    Warrier, Archana
    Mukherjee, Sumantrak
    Kise, Koichi
    Vollmer, Sebastian
    STAT, 2025, 14 (02):
  • [49] Assessing the Utilization of Large Language Models in Medical Education: Insights From Undergraduate Medical Students
    Biri, Sairavi Kiran
    Kumar, Subir
    Panigrahi, Muralidhar
    Mondal, Shaikat
    Behera, Joshil Kumar
    Himel, Mondal
    CUREUS JOURNAL OF MEDICAL SCIENCE, 2023, 15 (10)
  • [50] Evaluating the Effectiveness of advanced large language models in medical Knowledge: A Comparative study using Japanese national medical examination
    Liu, Mingxin
    Okuhara, Tsuyoshi
    Dai, Zhehao
    Huang, Wenbo
    Gu, Lin
    Okada, Hiroko
    Furukawa, Emi
    Kiuchi, Takahiro
    INTERNATIONAL JOURNAL OF MEDICAL INFORMATICS, 2025, 193