Artificial intelligence large language model scores highly on focused practice designation in metabolic and bariatric surgery board practice questions

被引：0

作者：

Sanders, A. ^{[1
,4
]}

Lim, R. ^{[2
]}

Jones, D. ^{[3
]}

Vosburg, R. W. ^{[4
]}

机构：

[1] Beth Israel Deaconess Med Ctr, Dept Surg, Boston, MA USA

[2] Atrium Hlth, Charlotte, NC USA

[3] Rutgers New Jersey Med Sch, Dept Surg, Newark, NJ USA

[4] Grand Strand Med Ctr, Dept Surg, Myrtle Beach, SC 29572 USA

来源：

SURGICAL ENDOSCOPY AND OTHER INTERVENTIONAL TECHNIQUES | 2024年 / 38卷 / 11期

关键词：

Artificial intelligence; AI; ChatGPT; Metabolic and bariatric surgery; Exam; PERFORMANCE; GPT-4;

D O I：

10.1007/s00464-024-11267-y

中图分类号：

R61 [外科手术学];

学科分类号：

摘要：

BackgroundArtificial intelligence models such as ChatGPT (Open AI) have performed well on the exams of various medical and surgical fields. It is not yet known how ChatGPT performs on similar metabolic and bariatric surgery (MBS) questions.ObjectiveAssess the performance of ChatGPT on Focused Practice Designation in Metabolic and Bariatric Surgery board-style questions.SettingUnited States.MethodsQuestions obtained from the largest commercially available bank of FPD-MBS practice questions were entered into ChatGPT-4, as is, without prior training. We assessed the overall percentage correct as well as the percentage correct within each of the five American Board of Surgery (ABS) question categories. One-way ANOVA was used to determine if the frequency of correct answers differed between categories.ResultsOut of 255 questions, ChatGPT-4 correctly answered 189 (74.1%). Between the five question categories there was no difference between the frequency of correct answers (p = 0.22). It did not matter if questions were entered individually or in groups of up to 10.ConclusionWithout prior training, ChatGPT-4 scored highly when evaluated on the largest practice question bank for the FPD-MBS exam.

引用

页码：6678 / 6681

页数：4

共 26 条

[1] Performance of a Large Language Model on Practice Questions for the Neonatal Board Examination
Beam, Kristyn
Sharma, Puneet
Kumar, Bhawesh
Wang, Cindy
Brodsky, Dara
Martin, Camilia R.
Beam, Andrew
JAMA PEDIATRICS, 2023, 177 (09) : 977 - 979
[2] Performance of large language model artificial intelligence on dermatology board exam questions
Park, Lily
Ehlert, Brittany
Susla, Lyudmyla
Lum, Zachary C.
Lee, Patrick K.
CLINICAL AND EXPERIMENTAL DERMATOLOGY, 2023, 49 (07) : 733 - 734
[3] Large Language Model Performance on Practice Epilepsy Board Examinations
Habib, Sara
Butt, Haroon
Goldenholz, Shira R.
Chang, Chi Yuan
Goldenholz, Daniel M.
JAMA NEUROLOGY, 2024, 81 (06) : 660 - 661
[4] The Artificial intelligence large language models and neuropsychiatry practice and research ethic
Zhong, Yi
Chen, Yu-jun
Zhou, Yang
Lyu, Yan-Ao-Hai
Yin, Jia-Jun
Gao, Yu-jun
ASIAN JOURNAL OF PSYCHIATRY, 2023, 84
[5] Artificial Intelligence for Anesthesiology Board-Style Examination Questions: Role of Large Language Models
Khan, Adnan A.
Yunus, Rayaan
Sohail, Mahad
Rehman, Taha A.
Saeed, Shirin
Bu, Yifan
Jackson, Cullen D.
Sharkey, Aidan
Mahmood, Feroze
Matyal, Robina
JOURNAL OF CARDIOTHORACIC AND VASCULAR ANESTHESIA, 2024, 38 (05) : 1251 - 1259
[6] Performance of "Bard", Google's Artificial Intelligence Chatbot, on Ophthalmology Board Exam Practice Questions
Botross, Monica
Mohammadi, Seyed Omid
Montgomery, Kendall
INVESTIGATIVE OPHTHALMOLOGY & VISUAL SCIENCE, 2024, 65 (07)
[7] Performance of artificial intelligence in bariatric surgery: comparative analysis of ChatGPT-4, Bing, and Bard in the American Society for Metabolic and Bariatric Surgery textbook of bariatric surgery questions
Lee, Yung
Brar, Karanbir
Malone, Sarah
Jin, David
McKechnie, Tyler
Jung, James J.
Kroh, Matthew
Dang, Jerry T.
SURGERY FOR OBESITY AND RELATED DISEASES, 2024, 20 (07) : 609 - 613
[8] Clinical Science and Practice in the Age of Large Language Models and Generative Artificial Intelligence
Schueller, Stephen M.
Morris, Robert R.
JOURNAL OF CONSULTING AND CLINICAL PSYCHOLOGY, 2023, 91 (10) : 559 - 561
[9] Artificial intelligence and large language models in palliative medicine clinical practice and education
Taubert, Mark
Hackett, Robyn
Tavabie, Simon
BMJ SUPPORTIVE & PALLIATIVE CARE, 2024,
[10] Evaluation of responses to cardiac imaging questions by the artificial intelligence large language model ChatGPT
Monroe, Cynthia L.
Abdelhafez, Yasser G.
Atsina, Kwame
Aman, Edris
Nardo, Lorenzo
Madani, Mohammad H.
CLINICAL IMAGING, 2024, 112

← 1 2 3 →