Aligning Medical Images with General Knowledge from Large Language Models

被引：0

作者：

Fang, Xiao ^{[1
]}

Lin, Yi ^{[1
]}

Zhang, Dong ^{[2
]}

Cheng, Kwang-Ting ^{[2
]}

Chen, Hao ^{[1
,3
,4
]}

机构：

[1] HKUST, Dept Comp Sci & Engn, Hong Kong, Peoples R China

[2] HKUST, Dept Elect & Comp Engn, Hong Kong, Peoples R China

[3] HKUST, Dept Chem & Biol Engn, Hong Kong, Peoples R China

[4] HKUST Shenzhen Hong Kong Collaborat Innovat Res I, Shenzhen, Peoples R China

来源：

MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2024, PT X | 2024年 / 15010卷

关键词：

Prompt Learning; Vision-Language Models; Large Language Model; Medical Image Analysis;

D O I：

10.1007/978-3-031-72117-5_6

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Pre-trained large vision-language models (VLMs) like CLIP have revolutionized visual representation learning using natural language as supervisions, and demonstrated promising generalization ability. In this work, we propose ViP, a novel visual symptom-guided prompt learning framework for medical image analysis, which facilitates general knowledge transfer from CLIP. ViP consists of two key components: a visual symptom generator (VSG) and a dual-prompt network. Specifically, VSG aims to extract explicable visual symptoms from pre-trained large language models, while the dual-prompt network utilizes these visual symptoms to guide the training on two learnable prompt modules, i.e., context prompt and merge prompt, which effectively adapts our framework to medical image analysis via large VLMs. Extensive experimental results demonstrate that ViP can outperform state-of-the-art methods on two challenging datasets. The code is available at https://github.com/xiaofang007/ViP.

引用

页码：57 / 67

页数：11

共 50 条

[41] Leveraging Medical Knowledge Graphs Into Large Language Models for Diagnosis Prediction: Design and Application Study
Gao, Yanjun
Li, Ruizhe
Croxford, Emma
Caskey, John
Patterson, Brian W.
Churpek, Matthew
Miller, Timothy
Dligach, Dmitriy
Afshar, Majid
JMIR AI, 2025, 4
[42] Large Language Models can Share Images, Too!
Lee, Young-Jun
Lee, Dokyong
Sung, Joo Won
Hyeon, Jonghwan
Choi, Ho-Jin
FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: ACL 2024, 2024, : 692 - 713
[43] From language models to large-scale food and biomedical knowledge graphs
Gjorgjina Cenikj
Lidija Strojnik
Risto Angelski
Nives Ogrinc
Barbara Koroušić Seljak
Tome Eftimov
Scientific Reports, 13
[44] The Promise and Challenge of Large Language Models for Knowledge Engineering: Insights from a Hackathon
Walker, Johanna
Koutsiana, Elisavet
Nwachukwu, Michelle
Merono-Penuela, Albert
Simperl, Elena
EXTENDED ABSTRACTS OF THE 2024 CHI CONFERENCE ON HUMAN FACTORS IN COMPUTING SYSTEMS, CHI 2024, 2024,
[45] Enhancing Sequential Recommenders with Augmented Knowledge from Aligned Large Language Models
Ren, Yankun
Chen, Zhongde
Yang, Xinxing
Li, Longfei
Jiang, Cong
Cheng, Lei
Zhang, Bo
Mo, Linjian
Zhou, Jun
PROCEEDINGS OF THE 47TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, SIGIR 2024, 2024, : 345 - 354
[46] From language models to large-scale food and biomedical knowledge graphs
Cenikj, Gjorgjina
Strojnik, Lidija
Angelski, Risto
Ogrinc, Nives
Seljak, Barbara Korousic
Eftimov, Tome
SCIENTIFIC REPORTS, 2023, 13 (01)
[47] Queryfy: from knowledge graphs to questions using open Large Language Models
Brei, Felix
Meyer, Lars-Peter
Martin, Michael
IT-INFORMATION TECHNOLOGY, 2025,
[48] Had Enough of Experts? Quantitative Knowledge Retrieval From Large Language Models
Selby, David
Iwashita, Yuichiro
Spriestersbach, Kai
Saad, Mohammad
Bappert, Dennis
Warrier, Archana
Mukherjee, Sumantrak
Kise, Koichi
Vollmer, Sebastian
STAT, 2025, 14 (02):
[49] Assessing the Utilization of Large Language Models in Medical Education: Insights From Undergraduate Medical Students
Biri, Sairavi Kiran
Kumar, Subir
Panigrahi, Muralidhar
Mondal, Shaikat
Behera, Joshil Kumar
Himel, Mondal
CUREUS JOURNAL OF MEDICAL SCIENCE, 2023, 15 (10)
[50] Evaluating the Effectiveness of advanced large language models in medical Knowledge: A Comparative study using Japanese national medical examination
Liu, Mingxin
Okuhara, Tsuyoshi
Dai, Zhehao
Huang, Wenbo
Gu, Lin
Okada, Hiroko
Furukawa, Emi
Kiuchi, Takahiro
INTERNATIONAL JOURNAL OF MEDICAL INFORMATICS, 2025, 193

← 1 2 3 4 5 →