Patient-Friendly Discharge Summaries in Korea Based on ChatGPT: Software Development and Validation

被引:0
|
作者
Kim, Hanjae [1 ]
Jin, Hee Min [2 ]
Bin Jung, Yoon [3 ]
You, Seng Chan [2 ]
机构
[1] Yonsei Univ, Coll Nursing, Seoul, South Korea
[2] Yonsei Univ, Dept Biomed Syst Informat, Coll Med, 50-1 Yonsei Ro, Seoul 03722, South Korea
[3] Yonsei Univ, Coll Med, Dept Surg, Seoul, South Korea
关键词
ChatGPT; Artificial Intelligence; Large Language Model; Patient Discharge Summaries; Patient-Centered Care; Documentation;
D O I
暂无
中图分类号
R5 [内科学];
学科分类号
1002 ; 100201 ;
摘要
Background: Although discharge summaries in patient-friendly language can enhance patient comprehension and satisfaction, they can also increase medical staff workload. Using a large language model, we developed and validated software that generates a patient-friendly discharge summary. Methods: We developed and tested the software using 100 discharge summary documents, 50 for patients with myocardial infarction and 50 for patients treated in the Department of General Surgery. For each document, three new summaries were generated using three different prompting methods (Zero-shot, One-shot, and Few-shot) and graded using a 5-point Likert Scale regarding factuality, comprehensiveness, usability, ease, and fluency. We compared the effects of different prompting methods and assessed the relationship between input length and output quality. Results: The mean overall scores differed across prompting methods (4.19 +/- 0.36 in Few-shot, 4.11 +/- 0.36 in One-shot, and 3.73 +/- 0.44 in Zero-shot; P < 0.001). Post-hoc analysis indicated that the scores were higher with Few-shot and One-shot prompts than in zero-shot prompts, whereas there was no significant difference between Few-shot and One-shot prompts. The overall proportion of outputs that scored >= 4 was 77.0% (95% confidence interval: 68.8-85.3%), 70.0% (95% confidence interval [CI], 61.0-79.0%), and 32.0% (95% CI, 22.9-41.1%) with Few-shot, One-shot, and Zero-shot prompts, respectively. The mean factuality score was 4.19 +/- 0.60 with Few-shot, 4.20 +/- 0.55 with One-shot, and 3.82 +/- 0.57 with Zero-shot prompts. Input length and the overall score showed negative correlations in the Zero-shot (r = -0.437, P < 0.001) and One-shot (r = -0.327, P < 0.001) tests but not in the Few-shot (r = -0.050, P = 0.625) tests. Conclusion: Large-language models utilizing Few-shot prompts generally produce acceptable discharge summaries without significant misinformation. Our research highlights the potential of such models in creating patient-friendly discharge summaries for Korean patients to support patient-centered care.
引用
收藏
页数:12
相关论文
共 50 条
  • [31] Model-Based Design, Development and Validation for UAS Critical Software
    Daniel Santamaría
    Francisco Alarcón
    Antonio Jiménez
    Antidio Viguria
    Manuel Béjar
    Aníbal Ollero
    Journal of Intelligent & Robotic Systems, 2012, 65 : 103 - 114
  • [32] Multi-channel, convolutional attention based neural model for automated diagnostic coding of unstructured patient discharge summaries
    Mayya, Veena
    Kamath, Sowmya S.
    Krishnan, Gokul S.
    Gangavarapu, Tushaar
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2021, 118 : 374 - 391
  • [33] Home smart-phone based measurement of fecal calprotectin by IBD patients: correlation with laboratory assay and applicability as patient-friendly monitoring tool
    Ungar, B.
    Lahat, A.
    Selinger, L.
    Levhar, N.
    Neuman, S.
    Kopylov, U.
    Yavzori, M.
    Fudim, E.
    Bubis, M.
    Picard, O.
    Eliakim, R.
    Ben-Horin, S.
    JOURNAL OF CROHNS & COLITIS, 2017, 11 : S167 - S168
  • [34] Development and Validation of a Useful Taxonomy of Patient Portals Based on Characteristics of Patient Engagement
    Gloeggler, Michael
    Ammenwerth, Elske
    METHODS OF INFORMATION IN MEDICINE, 2021, 60 : E44 - E55
  • [35] Development and validation of a patient based measure of outcome in ocular melanoma
    Foss, AJE
    Lamping, DL
    Schroter, S
    Hungerford, J
    BRITISH JOURNAL OF OPHTHALMOLOGY, 2000, 84 (04) : 347 - 351
  • [36] Development and validation of a patient-based measure of COPD stability
    Sciurba, Frank
    Rosenzweig, Jacqueline Carranza
    Bailey, William
    Hanania, Nicola
    Zibrak, Joseph
    Donohue, James
    Sharafkhaneh, Amir
    Ferguson, Garry
    Marcus, Phil
    Rosa, Kathleen
    Marcucci, Gretchen
    Piault, Elizabeth
    Martinez, Fernando
    CHEST, 2006, 130 (04) : 98S - 98S
  • [37] Development and Validation of a Patient Discharge Readiness Scale for Daytime Cataract Surgery (DRS-CAT)
    Chen, Chen
    Sun, Yiwen
    Chen, Caifen
    Zhang, Mengyue
    Lin, Shudan
    Dai, Tingting
    Li, Rong
    Huang, Jiali
    Zheng, Jingwei
    Chen, Yanyan
    JOURNAL OF PERIANESTHESIA NURSING, 2024, 39 (02) : 195 - 201.e3
  • [38] VENOPUNCTURE-FREE IVF: MEASUREMENT OF ESTROGEN IN CONTROLLED OVARIAN STIMULATION IVF CYCLES USING A "PATIENT-FRIENDLY" SALIVA-BASED ESTRADIOL ASSAY
    Zimon, A.
    Lannon, B.
    Sheller, S.
    Sakkas, D.
    Ulrich, M.
    Alper, M.
    FERTILITY AND STERILITY, 2013, 100 (03) : S110 - S110
  • [39] Signing message architecture development based on open source and validation on a software platform
    Kaaniche, Walid
    Masmoudi, Mohamed
    2008 INTERNATIONAL CONFERENCE ON DESIGN & TECHNOLOGY OF INTEGRATED SYSTEMS IN NANOSCALE, 2008, : 59 - 63
  • [40] An Integrated Automotive Software Development and Validation System Based on CASOS-OSEK
    Huang, Wuling
    Qiao, Xin
    Ai, Yunfeng
    Yao, Qingming
    Gao, Hui
    PROCEEDINGS OF 2008 IEEE/ASME INTERNATIONAL CONFERENCE ON MECHATRONIC AND EMBEDDED SYSTEMS AND APPLICATIONS, 2008, : 269 - +