Accuracy assessment of ChatGPT responses to frequently asked questions regarding anterior cruciate ligament surgery

被引：3

作者：

Villarreal-Espinosa, Juan Bernardo ^{[1
]}

Berreta, Rodrigo Saad ^{[1
]}

Allende, Felicitas ^{[1
]}

Garcia, Jose Rafael ^{[1
]}

Ayala, Salvador ^{[1
]}

Familiari, Filippo ^{[2
]}

Chahla, Jorge ^{[1
]}

机构：

[1] Rush Univ, Med Ctr, Dept Orthoped, 1620 W Harrison St, Chicago, IL 60612 USA

[2] Magna Graecia Univ Catanzaro, Catanzaro, Italy

来源：

KNEE | 2024年 / 51卷

关键词：

ChatGPT; AI; ACL surgery; Frequently asked questions (FAQs); Patient education; MANAGEMENT;

D O I：

10.1016/j.knee.2024.08.014

中图分类号：

R826.8 [整形外科学]; R782.2 [口腔颌面部整形外科学]; R726.2 [小儿整形外科学]; R62 [整形外科学（修复外科学）];

学科分类号：

摘要：

Background: The emergence of artificial intelligence (AI) has allowed users to have access to large sources of information in a chat-like manner. Thereby, we sought to evaluate ChatGPT-4 response's accuracy to the 10 patient most frequently asked questions (FAQs) regarding anterior cruciate ligament (ACL) surgery. Methods: A list of the top 10 FAQs pertaining to ACL surgery was created after conducting a search through all Sports Medicine Fellowship Institutions listed on the Arthroscopy Association of North America (AANA) and American Orthopaedic Society of Sports Medicine (AOSSM) websites. A Likert scale was used to grade response accuracy by two sports medicine fellowship-trained surgeons. Cohen's kappa was used to assess interrater agreement. Reproducibility of the responses over time was also assessed. Results: Five of the 10 responses received a 'completely accurate' grade by two-fellowship trained surgeons with three additional replies receiving a 'completely accurate' status by at least one. Moreover, inter-rater reliability accuracy assessment revealed a moderate agreement between fellowship-trained attending physicians (weighted kappa = 0.57, 95% confidence interval 0.15-0.99). Additionally, 80% of the responses were reproducible over time. Conclusion: ChatGPT can be considered an accurate additional tool to answer general patient questions regarding ACL surgery. None the less, patient-surgeon interaction should not be deferred and must continue to be the driving force for information retrieval. Thus, the general recommendation is to address any questions in the presence of a qualified specialist. (c) 2024 Elsevier B.V. All rights are reserved, including those for text and data mining, AI training, and similar technologies.

引用

页码：84 / 92

页数：9

共 50 条

[31] Dr. Google vs. Dr. ChatGPT: Exploring the Use of Artificial Intelligence in Ophthalmology by Comparing the Accuracy, Safety, and Readability of Responses to Frequently Asked Patient Questions Regarding Cataracts and Cataract Surgery
Cohen, Samuel A.
Brant, Arthur
Fisher, Ann Caroline
Pershing, Suzann
Do, Diana
Pan, Carolyn
SEMINARS IN OPHTHALMOLOGY, 2024, 39 (06) : 472 - 479
[32] Evaluation of the quality and readability of ChatGPT responses to frequently asked questions about myopia in traditional Chinese language
Chang, Li-Chun
Sun, Chi-Chin
Chen, Ting-Han
Tsai, Der-Chong
Lin, Hui-Ling
Liao, Li-Ling
DIGITAL HEALTH, 2024, 10
[33] Acceptability and readability of ChatGPT-4 based responses for frequently asked questions about strabismus and amblyopia
Guven, S.
Ayyildiz, B.
JOURNAL FRANCAIS D OPHTALMOLOGIE, 2025, 48 (03):
[34] ChatGPT Responses to Frequently Asked Questions on Meniere's Disease: A Comparison to Clinical Practice Guideline Answers
Ho, Rebecca A.
Shaari, Ariana L.
Cowan, Paul T.
Yan, Kenneth
OTO OPEN, 2024, 8 (03)
[35] Reply to "Assessing the Accuracy of Responses by the Language Model ChatGPT to Questions Regarding Bariatric Surgery: a Critical Appraisal"
Samaan, Jamil S.
Yeo, Yee Hui
Rajeev, Nithya
Ng, Wee Han
Srinivasan, Nitin
Samakar, Kamran
OBESITY SURGERY, 2023, 33 (08) : 2590 - 2591
[36] Reply to “Assessing the Accuracy of Responses by the Language Model ChatGPT to Questions Regarding Bariatric Surgery: a Critical Appraisal”
Jamil S. Samaan
Yee Hui Yeo
Nithya Rajeev
Wee Han Ng
Nitin Srinivasan
Kamran Samakar
Obesity Surgery, 2023, 33 : 2590 - 2591
[37] Assessment of the Responses of the Artificial Intelligence-based Chatbot ChatGPT-4 to Frequently Asked Questions About Amblyopia and Childhood Myopia
Nikdel, Mojgan
Ghadimi, Hadi
Tavakoli, Mehdi
Suh, Donny W.
JOURNAL OF PEDIATRIC OPHTHALMOLOGY & STRABISMUS, 2024, 61 (02) : 86 - 89
[38] Frequently Asked Questions of Potential Bariatric Surgery Candidates
Krzyzanowski, S.
Kim, K.
Buffington, C.
OBESITY SURGERY, 2013, 23 (08) : 1038 - 1038
[39] Anterior cruciate ligament surgery
Provencher, Matthew T.
ORTHOPEDICS, 2008, 31 (06) : 561 - 564
[40] Assessing ChatGPT Ability to Answer Frequently Asked Questions About Essential Tremor
Sorrentino, Cristiano
Canoro, Vincenzo
Russo, Maria
Giordano, Caterina
Barone, Paolo
Erro, Roberto
TREMOR AND OTHER HYPERKINETIC MOVEMENTS, 2024, 14 : 1 - 10

← 1 2 3 4 5 →