Cross sectional pilot study on clinical review generation using large language models

被引:0
|
作者
Luo, Zining [1 ,2 ,3 ,4 ,5 ,6 ]
Qiao, Yang [7 ]
Xu, Xinyu [6 ]
Li, Xiangyu [6 ]
Xiao, Mengyan [6 ]
Kang, Aijia [8 ]
Wang, Dunrui [2 ,3 ]
Pang, Yueshan [9 ]
Xie, Xing [1 ,10 ,11 ]
Xie, Sijun [1 ,10 ,11 ]
Luo, Dachen [12 ]
Ding, Xuefeng [13 ]
Liu, Zhenglong [4 ,5 ]
Liu, Ying [6 ]
Hu, Aimin [14 ]
Ren, Yixing [1 ,10 ,11 ]
Xie, Jiebin [1 ,10 ,11 ]
机构
[1] North Sichuan Med Coll, Dept Gastrointestinal Surg, Affiliated Hosp, Nanchong, Sichuan, Peoples R China
[2] Univ Strathclyde, Dept Elect & Elect Engn, Glasgow, Lanark, Scotland
[3] Univ Elect Sci & Technol China, Sch Informat & Commun Engn, Chengdu, Sichuan, Peoples R China
[4] North Sichuan Med Coll, Sch Basic Med, Nanchong, Sichuan, Peoples R China
[5] North Sichuan Med Coll, Sch Forens Med, Nanchong, Sichuan, Peoples R China
[6] North Sichuan Med Coll, Dept Stomatol, Nanchong, Sichuan, Peoples R China
[7] North Sichuan Med Coll, Dept Biomed Engn, Nanchong, Sichuan, Peoples R China
[8] North Sichuan Med Coll, Dept Aesthesia, Nanchong, Sichuan, Peoples R China
[9] Nanchong Cent Hosp, Clin Med Coll 2, Dept Geriatr, North Sichuan Med Coll, Nanchong, Sichuan, Peoples R China
[10] North Sichuan Med Coll, Dept Gen Surg, Affiliated Hosp, Nanchong, Sichuan, Peoples R China
[11] North Sichuan Med Coll, Inst Hepatobiliary Pancreas & Intestinal Dis, Affiliated Hosp, Nanchong, Sichuan, Peoples R China
[12] North Sichuan Med Coll, Affiliated Hosp, Dept Resp & Crit Care Med, Nanchong, Sichuan, Peoples R China
[13] North Sichuan Med Coll, Dept Crit Care Med, Affiliated Hosp, Nanchong, Sichuan, Peoples R China
[14] North Sichuan Med Coll, Dept Foreign Languages & Culture, Nanchong, Sichuan, Peoples R China
来源
NPJ DIGITAL MEDICINE | 2025年 / 8卷 / 01期
关键词
D O I
10.1038/s41746-025-01535-z
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
As the volume of medical literature accelerates, necessitating efficient tools to synthesize evidence for clinical practice and research, the interest in leveraging large language models (LLMs) for generating clinical reviews has surged. However, there are significant concerns regarding the reliability associated with integrating LLMs into the clinical review process. This study presents a systematic comparison between LLM-generated and human-authored clinical reviews, revealing that while AI can quickly produce reviews, it often has fewer references, less comprehensive insights, and lower logical consistency while exhibiting lower authenticity and accuracy in their citations. Additionally, a higher proportion of its references are from lower-tier journals. Moreover, the study uncovers a concerning inefficiency in current detection systems for identifying AI-generated content, suggesting a need for more advanced checking systems and a stronger ethical framework to ensure academic transparency. Addressing these challenges is vital for the responsible integration of LLMs into clinical research.
引用
收藏
页数:15
相关论文
共 50 条
  • [1] Clinical Accuracy of Large Language Models and Google Search Responses to Postpartum Depression Questions: Cross-Sectional Study
    Sezgin, Emre
    Chekeni, Faraaz
    Lee, Jennifer
    Keim, Sarah
    JOURNAL OF MEDICAL INTERNET RESEARCH, 2023, 25
  • [2] Practical Aspects of Using Large Language Models toScreen Abstracts for Cardiovascular Drug Development:Cross-Sectional Study
    Ronquillo, Jay G.
    Ye, Jamie
    Gorman, Donal
    Lemeshow, Adina R.
    Watt, Stephen J.
    JMIR MEDICAL INFORMATICS, 2024, 12
  • [3] Clinical Accuracy, Relevance, Clarity, and Emotional Sensitivityof Large Language Models to Surgical Patient Questions:Cross-Sectional Study
    Dagli, Mert Marcel
    Oettl, Felix Conrad
    Ujral, Jaskeerat
    Malhotra, Kashish
    Ghenbot, Yohannes
    Yoon, Jang W.
    Ozturk, Ali K.
    Welch, William C.
    JMIR FORMATIVE RESEARCH, 2024, 8
  • [4] Assessing the Efficacy of Large Language Models in Health Literacy: A Comprehensive Cross-Sectional Study
    Amin, Kanhai S.
    Mayes, Linda C.
    Khosla, Pavan
    Doshi, Rushabh H.
    YALE JOURNAL OF BIOLOGY AND MEDICINE, 2024, 97 (01): : 17 - 27
  • [5] CONCEPTUAL DESIGN GENERATION USING LARGE LANGUAGE MODELS
    Ma, Kevin
    Grandi, Daniele
    McComb, Christopher
    Goucher-Lambert, Kosa
    PROCEEDINGS OF ASME 2023 INTERNATIONAL DESIGN ENGINEERING TECHNICAL CONFERENCES AND COMPUTERS AND INFORMATION IN ENGINEERING CONFERENCE, IDETC-CIE2023, VOL 6, 2023,
  • [6] Evaluating Large Language Models for Automated Reporting and Data Systems Categorization: Cross-Sectional Study
    Wu, Qingxia
    Li, Huali
    Wang, Yan
    Bai, Yan
    Wu, Yaping
    Yu, Xuan
    Li, Xiaodong
    Dong, Pei
    Xue, Jon
    Shen, Dinggang
    Wang, Meiyun
    JMIR MEDICAL INFORMATICS, 2024, 12
  • [7] Comparison of Large Language Models in Answering Immuno-Oncology Questions: A Cross-Sectional Study
    Iannantuono, Giovanni Maria
    Bracken-Clarke, Dara
    Karzai, Fatima
    Choo-Wosoba, Hyoyoung
    Gulley, James L.
    Floudas, Charalampos S.
    ONCOLOGIST, 2024, : 407 - 414
  • [8] Automating Patch Set Generation from Code Review Comments Using Large Language Models
    Rahman, Tajmilur
    Singh, Rahul
    Sultan, Mir Yousuf
    PROCEEDINGS 2024 IEEE/ACM 3RD INTERNATIONAL CONFERENCE ON AI ENGINEERING-SOFTWARE ENGINEERING FOR AI, CAIN 2024, 2024, : 273 - 274
  • [9] Transforming Informed Consent Generation Using Large Language Models: Mixed Methods Study
    Shi, Qiming
    Luzuriaga, Katherine
    Allison, Jeroan J.
    Oztekin, Asil
    Faro, Jamie M.
    Lee, Joy L.
    Hafer, Nathaniel
    Mcmanus, Margaret
    Zai, Adrian H.
    JMIR MEDICAL INFORMATICS, 2025, 13
  • [10] Query Generation Using Large Language Models A Reproducibility Study of Unsupervised Passage Reranking
    Rau, David
    Kamps, Jaap
    ADVANCES IN INFORMATION RETRIEVAL, ECIR 2024, PT IV, 2024, 14611 : 226 - 239