Evaluating the accuracy and relevance of ChatGPT responses to frequently asked questions regarding total knee replacement

被引:7
|
作者
Zhang, Siyuan [1 ]
Liau, Zi Qiang Glen [1 ]
Tan, Kian Loong Melvin [1 ]
Chua, Wei Liang [1 ]
机构
[1] Natl Univ Hlth Syst, Dept Orthopaed Surg, Level 11,NUHS Tower Block,1E Kent Ridge Rd, Singapore 119228, Singapore
关键词
ChatGPT; Artificial intelligence; Chatbot; Large language model; Total knee replacement; Total knee arthroplasty; ARTHROPLASTY;
D O I
10.1186/s43019-024-00218-5
中图分类号
R826.8 [整形外科学]; R782.2 [口腔颌面部整形外科学]; R726.2 [小儿整形外科学]; R62 [整形外科学(修复外科学)];
学科分类号
摘要
Background Chat Generative Pretrained Transformer (ChatGPT), a generative artificial intelligence chatbot, may have broad applications in healthcare delivery and patient education due to its ability to provide human-like responses to a wide range of patient queries. However, there is limited evidence regarding its ability to provide reliable and useful information on orthopaedic procedures. This study seeks to evaluate the accuracy and relevance of responses provided by ChatGPT to frequently asked questions (FAQs) regarding total knee replacement (TKR).Methods A list of 50 clinically-relevant FAQs regarding TKR was collated. Each question was individually entered as a prompt to ChatGPT (version 3.5), and the first response generated was recorded. Responses were then reviewed by two independent orthopaedic surgeons and graded on a Likert scale for their factual accuracy and relevance. These responses were then classified into accurate versus inaccurate and relevant versus irrelevant responses using preset thresholds on the Likert scale.Results Most responses were accurate, while all responses were relevant. Of the 50 FAQs, 44/50 (88%) of ChatGPT responses were classified as accurate, achieving a mean Likert grade of 4.6/5 for factual accuracy. On the other hand, 50/50 (100%) of responses were classified as relevant, achieving a mean Likert grade of 4.9/5 for relevance.Conclusion ChatGPT performed well in providing accurate and relevant responses to FAQs regarding TKR, demonstrating great potential as a tool for patient education. However, it is not infallible and can occasionally provide inaccurate medical information. Patients and clinicians intending to utilize this technology should be mindful of its limitations and ensure adequate supervision and verification of information provided.
引用
收藏
页数:8
相关论文
共 50 条
  • [1] Evaluating the accuracy and relevance of ChatGPT responses to frequently asked questions regarding total knee replacement
    Siyuan Zhang
    Zi Qiang Glen Liau
    Kian Loong Melvin Tan
    Wei Liang Chua
    Knee Surgery & Related Research, 36
  • [2] Evaluating ChatGPT responses to frequently asked patient questions regarding periprosthetic joint infection after total hip and knee arthroplasty
    Hu, Xiaojun
    Niemann, Marcel
    Kienzle, Arne
    Braun, Karl
    Back, David Alexander
    Gwinner, Clemens
    Renz, Nora
    Stoeckle, Ulrich
    Trampuz, Andrej
    Meller, Sebastian
    DIGITAL HEALTH, 2024, 10
  • [3] Frequently asked questions regarding total knee arthroplasty
    Scott, RD
    Erens, GA
    ORTHOPEDICS, 2004, 27 (09) : 977 - 979
  • [4] EVALUATING CHATGPT'S RESPONSES TO FREQUENTLY ASKED QUESTIONS REGARDING POLYCYSTIC OVARY SYNDROME.
    Pace, Lauren
    Kummer, Nicholas
    Bril, Fernando
    Hosseinzadeh, Pardis
    Azziz, Ricardo
    FERTILITY AND STERILITY, 2024, 122 (01) : E55 - E55
  • [5] Accuracy assessment of ChatGPT responses to frequently asked questions regarding anterior cruciate ligament surgery
    Villarreal-Espinosa, Juan Bernardo
    Berreta, Rodrigo Saad
    Allende, Felicitas
    Garcia, Jose Rafael
    Ayala, Salvador
    Familiari, Filippo
    Chahla, Jorge
    KNEE, 2024, 51 : 84 - 92
  • [6] ChatGPT is capable of providing satisfactory responses to frequently asked questions regarding total shoulder arthroplasty
    Yeramosu, Teja
    Johns, William L.
    Onor, Gabriel
    Menendez, Mariano E.
    Namdari, Surena
    Hammoud, Sommer
    SHOULDER & ELBOW, 2024, 16 (04) : 407 - 412
  • [7] ChatGPT Responses to Frequently Asked Questions Regarding Sexually Transmitted Diseases: Considerations
    Kleebayoon, Amnuay
    Wiwanitkit, Viroj
    SEXUALLY TRANSMITTED DISEASES, 2025, 52 (03) : 193 - 193
  • [8] Qualitatively Assessing ChatGPT Responses to Frequently Asked Questions Regarding Sexually Transmitted Diseases
    Moothedan, Elijah
    Jhumkhawala, Vama
    Burgoa, Sara
    Martinez, Lisa
    Sacca, Lea
    SEXUALLY TRANSMITTED DISEASES, 2025, 52 (03) : 188 - 192
  • [9] ChatGPT Provides Unsatisfactory Responses to Frequently Asked Questions Regarding Anterior Cruciate Ligament Reconstruction
    Johns, William L.
    Martinazzi, Brandon J.
    Miltenberg, Benjamin
    Nam, Hannah H.
    Hammoud, Sommer
    ARTHROSCOPY-THE JOURNAL OF ARTHROSCOPIC AND RELATED SURGERY, 2024, 40 (07): : 2067 - 2079.e1
  • [10] Response to "Qualitatively Assessing ChatGPT Responses to Frequently Asked Questions Regarding Sexually Transmitted Diseases: Considerations"
    Moothedan, Elijah
    Jhumkhawala, Vama
    Burgoa, Sara
    Martinez, Lisa
    Sacca, Lea
    SEXUALLY TRANSMITTED DISEASES, 2025, 52 (04) : e11 - e11