Accuracy assessment of ChatGPT responses to frequently asked questions regarding anterior cruciate ligament surgery

被引:3
|
作者
Villarreal-Espinosa, Juan Bernardo [1 ]
Berreta, Rodrigo Saad [1 ]
Allende, Felicitas [1 ]
Garcia, Jose Rafael [1 ]
Ayala, Salvador [1 ]
Familiari, Filippo [2 ]
Chahla, Jorge [1 ]
机构
[1] Rush Univ, Med Ctr, Dept Orthoped, 1620 W Harrison St, Chicago, IL 60612 USA
[2] Magna Graecia Univ Catanzaro, Catanzaro, Italy
来源
KNEE | 2024年 / 51卷
关键词
ChatGPT; AI; ACL surgery; Frequently asked questions (FAQs); Patient education; MANAGEMENT;
D O I
10.1016/j.knee.2024.08.014
中图分类号
R826.8 [整形外科学]; R782.2 [口腔颌面部整形外科学]; R726.2 [小儿整形外科学]; R62 [整形外科学(修复外科学)];
学科分类号
摘要
Background: The emergence of artificial intelligence (AI) has allowed users to have access to large sources of information in a chat-like manner. Thereby, we sought to evaluate ChatGPT-4 response's accuracy to the 10 patient most frequently asked questions (FAQs) regarding anterior cruciate ligament (ACL) surgery. Methods: A list of the top 10 FAQs pertaining to ACL surgery was created after conducting a search through all Sports Medicine Fellowship Institutions listed on the Arthroscopy Association of North America (AANA) and American Orthopaedic Society of Sports Medicine (AOSSM) websites. A Likert scale was used to grade response accuracy by two sports medicine fellowship-trained surgeons. Cohen's kappa was used to assess interrater agreement. Reproducibility of the responses over time was also assessed. Results: Five of the 10 responses received a 'completely accurate' grade by two-fellowship trained surgeons with three additional replies receiving a 'completely accurate' status by at least one. Moreover, inter-rater reliability accuracy assessment revealed a moderate agreement between fellowship-trained attending physicians (weighted kappa = 0.57, 95% confidence interval 0.15-0.99). Additionally, 80% of the responses were reproducible over time. Conclusion: ChatGPT can be considered an accurate additional tool to answer general patient questions regarding ACL surgery. None the less, patient-surgeon interaction should not be deferred and must continue to be the driving force for information retrieval. Thus, the general recommendation is to address any questions in the presence of a qualified specialist. (c) 2024 Elsevier B.V. All rights are reserved, including those for text and data mining, AI training, and similar technologies.
引用
收藏
页码:84 / 92
页数:9
相关论文
共 50 条
  • [31] Dr. Google vs. Dr. ChatGPT: Exploring the Use of Artificial Intelligence in Ophthalmology by Comparing the Accuracy, Safety, and Readability of Responses to Frequently Asked Patient Questions Regarding Cataracts and Cataract Surgery
    Cohen, Samuel A.
    Brant, Arthur
    Fisher, Ann Caroline
    Pershing, Suzann
    Do, Diana
    Pan, Carolyn
    SEMINARS IN OPHTHALMOLOGY, 2024, 39 (06) : 472 - 479
  • [32] Evaluation of the quality and readability of ChatGPT responses to frequently asked questions about myopia in traditional Chinese language
    Chang, Li-Chun
    Sun, Chi-Chin
    Chen, Ting-Han
    Tsai, Der-Chong
    Lin, Hui-Ling
    Liao, Li-Ling
    DIGITAL HEALTH, 2024, 10
  • [33] Acceptability and readability of ChatGPT-4 based responses for frequently asked questions about strabismus and amblyopia
    Guven, S.
    Ayyildiz, B.
    JOURNAL FRANCAIS D OPHTALMOLOGIE, 2025, 48 (03):
  • [34] ChatGPT Responses to Frequently Asked Questions on Meniere's Disease: A Comparison to Clinical Practice Guideline Answers
    Ho, Rebecca A.
    Shaari, Ariana L.
    Cowan, Paul T.
    Yan, Kenneth
    OTO OPEN, 2024, 8 (03)
  • [35] Reply to "Assessing the Accuracy of Responses by the Language Model ChatGPT to Questions Regarding Bariatric Surgery: a Critical Appraisal"
    Samaan, Jamil S.
    Yeo, Yee Hui
    Rajeev, Nithya
    Ng, Wee Han
    Srinivasan, Nitin
    Samakar, Kamran
    OBESITY SURGERY, 2023, 33 (08) : 2590 - 2591
  • [36] Reply to “Assessing the Accuracy of Responses by the Language Model ChatGPT to Questions Regarding Bariatric Surgery: a Critical Appraisal”
    Jamil S. Samaan
    Yee Hui Yeo
    Nithya Rajeev
    Wee Han Ng
    Nitin Srinivasan
    Kamran Samakar
    Obesity Surgery, 2023, 33 : 2590 - 2591
  • [37] Assessment of the Responses of the Artificial Intelligence-based Chatbot ChatGPT-4 to Frequently Asked Questions About Amblyopia and Childhood Myopia
    Nikdel, Mojgan
    Ghadimi, Hadi
    Tavakoli, Mehdi
    Suh, Donny W.
    JOURNAL OF PEDIATRIC OPHTHALMOLOGY & STRABISMUS, 2024, 61 (02) : 86 - 89
  • [38] Frequently Asked Questions of Potential Bariatric Surgery Candidates
    Krzyzanowski, S.
    Kim, K.
    Buffington, C.
    OBESITY SURGERY, 2013, 23 (08) : 1038 - 1038
  • [39] Anterior cruciate ligament surgery
    Provencher, Matthew T.
    ORTHOPEDICS, 2008, 31 (06) : 561 - 564
  • [40] Assessing ChatGPT Ability to Answer Frequently Asked Questions About Essential Tremor
    Sorrentino, Cristiano
    Canoro, Vincenzo
    Russo, Maria
    Giordano, Caterina
    Barone, Paolo
    Erro, Roberto
    TREMOR AND OTHER HYPERKINETIC MOVEMENTS, 2024, 14 : 1 - 10