OphGLM: An ophthalmology large language-and-vision assistant

被引:1
|
作者
Deng, Zhuo [1 ]
Gao, Weihao [1 ]
Chen, Chucheng [1 ]
Niu, Zhiyuan [1 ]
Gong, Zheng [1 ]
Zhang, Ruiheng [2 ]
Cao, Zhenjie [1 ]
Li, Fang [1 ]
Ma, Zhaoyi [3 ,4 ]
Wei, Wenbin [2 ]
Ma, Lan [1 ]
机构
[1] Tsinghua Univ, Shenzhen Int Grad Sch, Shenzhen, Peoples R China
[2] Beijing Tongren Hosp, Beijing Tongren Eye Ctr, Beijing, Peoples R China
[3] Natl Hlth Commiss Capacity Bldg, Beijing, Peoples R China
[4] Continuing Educ Ctr, Beijing, Peoples R China
基金
中国国家自然科学基金;
关键词
Ophthalmology; Visual dialogue interaction; Large language models; ARTIFICIAL-INTELLIGENCE;
D O I
10.1016/j.artmed.2024.103001
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Vision computer-aided diagnostic methods have been used in early ophthalmic disease screening and diagnosis. However, the limited output formats of these methods lead to poor human-computer interaction and low clinical applicability value. Thus, ophthalmic visual question answering is worth studying. Unfortunately, no practical solutions exist before Large Language Models(LLMs). In this paper, we investigate the ophthalmic visual diagnostic interaction problem. We construct an ophthalmology large language-and-vision assistant, OphGLM, consisting of an image encoder, a text encoder, a fusion module, and an LLM module. We establish anew Chinese ophthalmic fine-tuning dataset, FundusTuning-CN, including the fundus instruction and conversation sets. Based on FundusTuning-CN, we establish a novel LLM-tuning strategy to introduce visual model understanding and ophthalmic knowledge into LLMs at a low cost and high efficiency. Leveraging the pre-training of the image encoder, OphGLM demonstrates strong visual understanding and surpasses opensource visual language models in common fundus disease classification tasks. The FundusTuning-CN enables OphGLM to surpass open-source medical LLMs in both ophthalmic knowledge and interactive capabilities. Our proposed OphGLM has the potential to revolutionize clinical applications in ophthalmology. The dataset, code, and models will be publicly available at https://github.com/ML-AILab/OphGLM.
引用
收藏
页数:8
相关论文
共 50 条
  • [41] Designing a Large Language Model Based Open Data Assistant for Effective Use
    Schelhorn, Till Carlo
    Gnewuch, Ulrich
    Maedche, Alexander
    DESIGN SCIENCE RESEARCH FOR A RESILIENT FUTURE, DESRIST 2024, 2024, 14621 : 398 - 411
  • [42] Large language models illuminate a progressive pathway to artificial intelligent healthcare assistant
    Mingze Yuan
    Peng Bao
    Jiajia Yuan
    Yunhao Shen
    Zifan Chen
    Yi Xie
    Jie Zhao
    Quanzheng Li
    Yang Chen
    Li Zhang
    Lin Shen
    Bin Dong
    Medicine Plus, 2024, 1 (02) : 59 - 81
  • [43] Learning from Mistakes via Cooperative Study Assistant for Large Language Models
    Wang, Danqing
    Li, Lei
    2023 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2023), 2023, : 10667 - 10685
  • [44] Using Laboratory Documentation to Extend a Large Language Model to Serve as a Laboratory Assistant
    Shean, Ryan
    Ng, David
    AMERICAN JOURNAL OF CLINICAL PATHOLOGY, 2024, 162
  • [45] AcawebAgent: A Large Language Model-Powered Assistant for Early Academic Research
    Yang, Yingli
    Wang, Xiaoting
    2024 5TH INTERNATIONAL CONFERENCE ON COMPUTER ENGINEERING AND APPLICATION, ICCEA 2024, 2024, : 302 - 305
  • [46] Assessment of a large language model based digital intelligent assistant in assembly manufacturing
    Colabianchi, Silvia
    Costantino, Francesco
    Sabetta, Nicolo
    COMPUTERS IN INDUSTRY, 2024, 162
  • [47] Can a large language model be an effective assistant for literature reviews? An example in Radiomics
    Orlhac, Fanny
    Bradshaw, Tyler
    Buvat, Irene
    JOURNAL OF NUCLEAR MEDICINE, 2024, 65
  • [48] Comment on: Performance of Generative Large Language Models on Ophthalmology Board Style Questions
    Kleebayoon, Amnuay
    Wiwanitkit, Viroj
    AMERICAN JOURNAL OF OPHTHALMOLOGY, 2023, 256 : 200 - 200
  • [49] Performance of Generative Large Language Models on Ophthalmology Board-Style Questions
    Cai, Louis Z.
    Shaheen, Abdulla
    Jin, Andrew
    Fukui, Riya
    Yi, Jonathan S.
    Yannuzzi, Nicolas
    Alabiad, Chrisfouad
    AMERICAN JOURNAL OF OPHTHALMOLOGY, 2023, 254 : 141 - 149
  • [50] Evaluating the efficacy of a large language model in screening ophthalmology articles for systematic reviews
    Otles, Erkin
    Ramachandran, Rithambara
    Lu, Ming-Chen
    Newman-Casey, Paula Anne
    INVESTIGATIVE OPHTHALMOLOGY & VISUAL SCIENCE, 2024, 65 (07)