Evaluating the accuracy and relevance of ChatGPT responses to frequently asked questions regarding total knee replacement

被引:7
|
作者
Zhang, Siyuan [1 ]
Liau, Zi Qiang Glen [1 ]
Tan, Kian Loong Melvin [1 ]
Chua, Wei Liang [1 ]
机构
[1] Natl Univ Hlth Syst, Dept Orthopaed Surg, Level 11,NUHS Tower Block,1E Kent Ridge Rd, Singapore 119228, Singapore
关键词
ChatGPT; Artificial intelligence; Chatbot; Large language model; Total knee replacement; Total knee arthroplasty; ARTHROPLASTY;
D O I
10.1186/s43019-024-00218-5
中图分类号
R826.8 [整形外科学]; R782.2 [口腔颌面部整形外科学]; R726.2 [小儿整形外科学]; R62 [整形外科学(修复外科学)];
学科分类号
摘要
Background Chat Generative Pretrained Transformer (ChatGPT), a generative artificial intelligence chatbot, may have broad applications in healthcare delivery and patient education due to its ability to provide human-like responses to a wide range of patient queries. However, there is limited evidence regarding its ability to provide reliable and useful information on orthopaedic procedures. This study seeks to evaluate the accuracy and relevance of responses provided by ChatGPT to frequently asked questions (FAQs) regarding total knee replacement (TKR).Methods A list of 50 clinically-relevant FAQs regarding TKR was collated. Each question was individually entered as a prompt to ChatGPT (version 3.5), and the first response generated was recorded. Responses were then reviewed by two independent orthopaedic surgeons and graded on a Likert scale for their factual accuracy and relevance. These responses were then classified into accurate versus inaccurate and relevant versus irrelevant responses using preset thresholds on the Likert scale.Results Most responses were accurate, while all responses were relevant. Of the 50 FAQs, 44/50 (88%) of ChatGPT responses were classified as accurate, achieving a mean Likert grade of 4.6/5 for factual accuracy. On the other hand, 50/50 (100%) of responses were classified as relevant, achieving a mean Likert grade of 4.9/5 for relevance.Conclusion ChatGPT performed well in providing accurate and relevant responses to FAQs regarding TKR, demonstrating great potential as a tool for patient education. However, it is not infallible and can occasionally provide inaccurate medical information. Patients and clinicians intending to utilize this technology should be mindful of its limitations and ensure adequate supervision and verification of information provided.
引用
收藏
页数:8
相关论文
共 50 条
  • [21] Can ChatGPT Answer Patient Questions Regarding Total Knee Arthroplasty?
    Mika, Aleksander P.
    Mulvey, Hillary E.
    Engstrom, Stephen M.
    Polkowski, Gregory G.
    Martin, J. Ryan
    Wilson, Jacob M.
    JOURNAL OF KNEE SURGERY, 2024, 37 (09) : 664 - 673
  • [22] Evaluation of the quality and readability of ChatGPT responses to frequently asked questions about myopia in traditional Chinese language
    Chang, Li-Chun
    Sun, Chi-Chin
    Chen, Ting-Han
    Tsai, Der-Chong
    Lin, Hui-Ling
    Liao, Li-Ling
    DIGITAL HEALTH, 2024, 10
  • [23] Acceptability and readability of ChatGPT-4 based responses for frequently asked questions about strabismus and amblyopia
    Guven, S.
    Ayyildiz, B.
    JOURNAL FRANCAIS D OPHTALMOLOGIE, 2025, 48 (03):
  • [24] ChatGPT Responses to Frequently Asked Questions on Meniere's Disease: A Comparison to Clinical Practice Guideline Answers
    Ho, Rebecca A.
    Shaari, Ariana L.
    Cowan, Paul T.
    Yan, Kenneth
    OTO OPEN, 2024, 8 (03)
  • [25] Language-adaptive artificial intelligence: assessing CHATGPT'S answer to frequently asked questions on total hip arthroplasty questions
    Ibrahim, Muhammad Talal
    Khaskheli, Sarah Ashraf
    Shahzad, Hania
    Noordin, Shahryar
    JOURNAL OF THE PAKISTAN MEDICAL ASSOCIATION, 2024, 74 (04) : S161 - S164
  • [26] Evaluating ChatGPT as a patient resource for frequently asked questions about lung cancer surgery-a pilot study
    Ferrari-Light, Dana
    Merritt, Robert E.
    D'Souza, Desmond
    Ferguson, Mark K.
    Harrison, Sebron
    Madariaga, Maria Lucia
    Lee, Benjamin E.
    Moffatt-Bruce, Susan D.
    Kneuertz, Peter J.
    JOURNAL OF THORACIC AND CARDIOVASCULAR SURGERY, 2025, 169 (04):
  • [27] Assessing ChatGPT Ability to Answer Frequently Asked Questions About Essential Tremor
    Sorrentino, Cristiano
    Canoro, Vincenzo
    Russo, Maria
    Giordano, Caterina
    Barone, Paolo
    Erro, Roberto
    TREMOR AND OTHER HYPERKINETIC MOVEMENTS, 2024, 14 : 1 - 10
  • [28] Readability, reliability and quality of responses generated by ChatGPT, gemini, and perplexity for the most frequently asked questions about pain
    Ozduran, Erkan
    Akkoc, Ibrahim
    Buyukcoban, Sibel
    Erkin, Yueksel
    Hanci, Volkan
    MEDICINE, 2025, 104 (11)
  • [29] How good does ChatGPT answer frequently asked questions about haemophilia?
    Vandewyngaert, Caroline
    Iarossi, Michael
    Hermans, Cedric
    HAEMOPHILIA, 2023, 29 (06) : 1646 - 1648
  • [30] Comment on "ChatGPT and frequently asked patient questions for upper eyelid blepharoplasty surgery"
    Daungsupawong, Hinpetch
    Wiwanitkit, Viroj
    ORBIT-THE INTERNATIONAL JOURNAL ON ORBITAL DISORDERS-OCULOPLASTIC AND LACRIMAL SURGERY, 2025,