A comprehensive survey of large language models and multimodal large models in medicine

被引:1
|
作者
Xiao, Hanguang [1 ]
Zhou, Feizhong [1 ]
Liu, Xingyue [1 ]
Liu, Tianqi [1 ]
Li, Zhipeng [1 ]
Liu, Xin [1 ]
Huang, Xiaoxuan [1 ]
机构
[1] Chongqing Univ Technol, Sch Artificial Intelligence, Chongqing 401120, Peoples R China
关键词
Large language model; Multimodal large language model; Medicine; Healthcare; Clinical application; ARTIFICIAL-INTELLIGENCE; LIMITS;
D O I
10.1016/j.inffus.2024.102888
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Since the release of ChatGPT and GPT-4, large language models (LLMs) and multimodal large language models (MLLMs) have attracted widespread attention for their exceptional capabilities in understanding, reasoning, and generation, introducing transformative paradigms for integrating artificial intelligence into medicine. This survey provides a comprehensive overview of the development, principles, application scenarios, challenges, and future directions of LLMs and MLLMs in medicine. Specifically, it begins by examining the paradigm shift, tracing the transition from traditional models to LLMs and MLLMs, and highlighting the unique advantages of these LLMs and MLLMs in medical applications. Next, the survey reviews existing medical LLMs and MLLMs, providing detailed guidance on their construction and evaluation in a clear and systematic manner. Subsequently, to underscore the substantial value of LLMs and MLLMs in healthcare, the survey explores five promising applications in the field. Finally, the survey addresses the challenges confronting medical LLMs and MLLMs and proposes practical strategies and future directions for their integration into medicine. In summary, this survey offers a comprehensive analysis of the technical methodologies and practical clinical applications of medical LLMs and MLLMs, with the goal of bridging the gap between these advanced technologies and clinical practice, thereby fostering the evolution of the next generation of intelligent healthcare systems.
引用
收藏
页数:26
相关论文
共 50 条
  • [31] The future landscape of large language models in medicine
    Clusmann, Jan
    Kolbinger, Fiona R.
    Muti, Hannah Sophie
    Carrero, Zunamys I.
    Eckardt, Jan-Niklas
    Laleh, Narmin Ghaffari
    Loeffler, Chiara Maria Lavinia
    Schwarzkopf, Sophie-Caroline
    Unger, Michaela
    Veldhuizen, Gregory P.
    Wagner, Sophia J.
    Kather, Jakob Nikolas
    COMMUNICATIONS MEDICINE, 2023, 3 (01):
  • [32] Chatbots and Large Language Models in Family Medicine
    Saguil, Aaron
    AMERICAN FAMILY PHYSICIAN, 2024, 109 (06) : 501 - 502
  • [33] Environmental impact of large language models in medicine
    Kleinig, Oliver
    Sinhal, Shreyans
    Khurram, Rushan
    Gao, Christina
    Spajic, Luke
    Zannettino, Andrew
    Schnitzler, Margaret
    Guo, Christina
    Zaman, Sarah
    Smallbone, Harry
    Ittimani, Mana
    Chan, Weng Onn
    Stretton, Brandon
    Godber, Harry
    Chan, Justin
    Turner, Richard C.
    Warren, Leigh R.
    Clarke, Jonathan
    Sivagangabalan, Gopal
    Marshall-Webb, Matthew
    Moseley, Genevieve
    Driscoll, Simon
    Kovoor, Pramesh
    Chow, Clara K.
    Luo, Yuchen
    Thiagalingam, Aravinda
    Zaka, Ammar
    Gould, Paul
    Ramponi, Fabio
    Gupta, Aashray
    Kovoor, Joshua G.
    Bacchi, Stephen
    INTERNAL MEDICINE JOURNAL, 2024, 54 (12) : 2083 - 2086
  • [34] The future landscape of large language models in medicine
    Jan Clusmann
    Fiona R. Kolbinger
    Hannah Sophie Muti
    Zunamys I. Carrero
    Jan-Niklas Eckardt
    Narmin Ghaffari Laleh
    Chiara Maria Lavinia Löffler
    Sophie-Caroline Schwarzkopf
    Michaela Unger
    Gregory P. Veldhuizen
    Sophia J. Wagner
    Jakob Nikolas Kather
    Communications Medicine, 3
  • [35] A Comprehensive Survey of Attack Techniques, Implementation, and Mitigation Strategies in Large Language Models
    Esmradi, Aysan
    Yip, Daniel Wankit
    Chan, Chun Fai
    UBIQUITOUS SECURITY, UBISEC 2023, 2024, 2034 : 76 - 95
  • [36] Can We Edit Multimodal Large Language Models?
    Cheng, Siyuan
    Tian, Bozhong
    Liu, Qingbin
    Chen, Xi
    Wang, Yongheng
    Chen, Huajun
    Zhang, Ningyu
    2023 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2023), 2023, : 13877 - 13888
  • [37] Contextual Object Detection with Multimodal Large Language Models
    Zang, Yuhang
    Li, Wei
    Han, Jun
    Zhou, Kaiyang
    Loy, Chen Change
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2025, 133 (02) : 825 - 843
  • [38] Investigating the Catastrophic Forgetting in Multimodal Large Language Models
    Zhai, Yuexiang
    Tong, Shengbang
    Li, Xiao
    Cai, Mu
    Qu, Qing
    Lee, Yong Jae
    Ma, Yi
    CONFERENCE ON PARSIMONY AND LEARNING, VOL 234, 2024, 234 : 202 - 227
  • [39] Woodpecker: hallucination correction for multimodal large language models
    Yin, Shukang
    Fu, Chaoyou
    Zhao, Sirui
    Xu, Tong
    Wang, Hao
    Sui, Dianbo
    Shen, Yunhang
    Li, Ke
    Sun, Xing
    Chen, Enhong
    SCIENCE CHINA-INFORMATION SCIENCES, 2024, 67 (12)
  • [40] Multimodal Food Image Classification with Large Language Models
    Kim, Jun-Hwa
    Kim, Nam-Ho
    Jo, Donghyeok
    Won, Chee Sun
    ELECTRONICS, 2024, 13 (22)