Evaluating the accuracy and relevance of ChatGPT responses to frequently asked questions regarding total knee replacement

被引:7
|
作者
Zhang, Siyuan [1 ]
Liau, Zi Qiang Glen [1 ]
Tan, Kian Loong Melvin [1 ]
Chua, Wei Liang [1 ]
机构
[1] Natl Univ Hlth Syst, Dept Orthopaed Surg, Level 11,NUHS Tower Block,1E Kent Ridge Rd, Singapore 119228, Singapore
关键词
ChatGPT; Artificial intelligence; Chatbot; Large language model; Total knee replacement; Total knee arthroplasty; ARTHROPLASTY;
D O I
10.1186/s43019-024-00218-5
中图分类号
R826.8 [整形外科学]; R782.2 [口腔颌面部整形外科学]; R726.2 [小儿整形外科学]; R62 [整形外科学(修复外科学)];
学科分类号
摘要
Background Chat Generative Pretrained Transformer (ChatGPT), a generative artificial intelligence chatbot, may have broad applications in healthcare delivery and patient education due to its ability to provide human-like responses to a wide range of patient queries. However, there is limited evidence regarding its ability to provide reliable and useful information on orthopaedic procedures. This study seeks to evaluate the accuracy and relevance of responses provided by ChatGPT to frequently asked questions (FAQs) regarding total knee replacement (TKR).Methods A list of 50 clinically-relevant FAQs regarding TKR was collated. Each question was individually entered as a prompt to ChatGPT (version 3.5), and the first response generated was recorded. Responses were then reviewed by two independent orthopaedic surgeons and graded on a Likert scale for their factual accuracy and relevance. These responses were then classified into accurate versus inaccurate and relevant versus irrelevant responses using preset thresholds on the Likert scale.Results Most responses were accurate, while all responses were relevant. Of the 50 FAQs, 44/50 (88%) of ChatGPT responses were classified as accurate, achieving a mean Likert grade of 4.6/5 for factual accuracy. On the other hand, 50/50 (100%) of responses were classified as relevant, achieving a mean Likert grade of 4.9/5 for relevance.Conclusion ChatGPT performed well in providing accurate and relevant responses to FAQs regarding TKR, demonstrating great potential as a tool for patient education. However, it is not infallible and can occasionally provide inaccurate medical information. Patients and clinicians intending to utilize this technology should be mindful of its limitations and ensure adequate supervision and verification of information provided.
引用
收藏
页数:8
相关论文
共 50 条
  • [31] ChatSLE: consulting ChatGPT-4 for 100 frequently asked lupus questions
    Haase, Isabell
    Xiong, Tingting
    Rissmann, Antonia
    Knitza, Johannes
    Greenfield, Julia
    Krusche, Martin
    LANCET RHEUMATOLOGY, 2024, 6 (04): : e196 - e199
  • [32] Assessing the Accuracy of Responses by the Language Model ChatGPT to Questions Regarding Bariatric Surgery
    Jamil S. Samaan
    Yee Hui Yeo
    Nithya Rajeev
    Lauren Hawley
    Stuart Abel
    Wee Han Ng
    Nitin Srinivasan
    Justin Park
    Miguel Burch
    Rabindra Watson
    Omer Liran
    Kamran Samakar
    Obesity Surgery, 2023, 33 : 1790 - 1796
  • [33] Assessing the Accuracy of Responses by the Language Model ChatGPT to Questions Regarding Bariatric Surgery
    Samaan, Jamil S.
    Yeo, Yee Hui
    Rajeev, Nithya
    Hawley, Lauren
    Abel, Stuart
    Ng, Wee Han
    Srinivasan, Nitin
    Park, Justin
    Burch, Miguel
    Watson, Rabindra
    Liran, Omer
    Samakar, Kamran
    OBESITY SURGERY, 2023, 33 (06) : 1790 - 1796
  • [34] Assessing the Responses of Large Language Models (ChatGPT-4, Gemini, and Microsoft Copilot) to Frequently Asked Questions in Breast Imaging: A Study on Readability and Accuracy
    Tepe, Murat
    Emekli, Emre
    CUREUS JOURNAL OF MEDICAL SCIENCE, 2024, 16 (05)
  • [35] Evaluating Chatbot Efficacy for Answering Frequently Asked Questions in Plastic Surgery: A ChatGPT Case Study Focused on Breast Augmentation
    Seth, Ishith
    Cox, Aram
    Xie, Yi
    Bulloch, Gabriella
    Hunter-Smith, David J.
    Rozen, Warren M.
    Ross, Richard J.
    AESTHETIC SURGERY JOURNAL, 2023, 43 (10) : 1126 - 1135
  • [36] Do ChatGPT and Google differ in answers to commonly asked patient questions regarding total shoulder and total elbow arthroplasty?
    Tharakan, Shebin
    Klein, Brandon
    Bartlett, Lucas
    Atlas, Aaron
    Parada, Stephen A.
    Cohn, Randy M.
    JOURNAL OF SHOULDER AND ELBOW SURGERY, 2024, 33 (08) : e429 - e437
  • [37] Assessing ChatGPT responses to common patient questions regarding total ankle arthroplasty
    Artioli, Elena
    Veronesi, Francesca
    Mazzotti, Antonio
    Brogini, Silvia
    Zielli, Simone Ottavio
    Giavaresi, Gianluca
    Faldini, Cesare
    JOURNAL OF EXPERIMENTAL ORTHOPAEDICS, 2025, 12 (01)
  • [38] Evaluating the Accuracy of ChatGPT in Common Patient Questions Regarding HPV plus Oropharyngeal Carcinoma
    Bellamkonda, Nikhil
    Farlow, Janice L.
    Haring, Catherine T.
    Sim, Michael W.
    Seim, Nolan B.
    Cannon, Richard B.
    Monroe, Marcus M.
    Agrawal, Amit
    Rocco, James W.
    McCrary, Hilary C.
    ANNALS OF OTOLOGY RHINOLOGY AND LARYNGOLOGY, 2024, 133 (09): : 814 - 819
  • [39] Assessing ChatGPT Responses to Common Patient Questions Regarding Total Hip Arthroplasty
    Mika, Aleksander P.
    Martin, J. Ryan
    Engstrom, Stephen M.
    Polkowski, Gregory G.
    Wilson, Jacob M.
    JOURNAL OF BONE AND JOINT SURGERY-AMERICAN VOLUME, 2023, 105 (19): : 1519 - 1526
  • [40] Response to comments on "ChatGPT and frequently asked patient questions for upper eyelid blepharoplasty surgery"
    Maeng, Michelle M.
    Tenzel, Phillip A.
    ORBIT-THE INTERNATIONAL JOURNAL ON ORBITAL DISORDERS-OCULOPLASTIC AND LACRIMAL SURGERY, 2025,