Artificial intelligence large language model scores highly on focused practice designation in metabolic and bariatric surgery board practice questions

被引:0
|
作者
Sanders, A. [1 ,4 ]
Lim, R. [2 ]
Jones, D. [3 ]
Vosburg, R. W. [4 ]
机构
[1] Beth Israel Deaconess Med Ctr, Dept Surg, Boston, MA USA
[2] Atrium Hlth, Charlotte, NC USA
[3] Rutgers New Jersey Med Sch, Dept Surg, Newark, NJ USA
[4] Grand Strand Med Ctr, Dept Surg, Myrtle Beach, SC 29572 USA
关键词
Artificial intelligence; AI; ChatGPT; Metabolic and bariatric surgery; Exam; PERFORMANCE; GPT-4;
D O I
10.1007/s00464-024-11267-y
中图分类号
R61 [外科手术学];
学科分类号
摘要
BackgroundArtificial intelligence models such as ChatGPT (Open AI) have performed well on the exams of various medical and surgical fields. It is not yet known how ChatGPT performs on similar metabolic and bariatric surgery (MBS) questions.ObjectiveAssess the performance of ChatGPT on Focused Practice Designation in Metabolic and Bariatric Surgery board-style questions.SettingUnited States.MethodsQuestions obtained from the largest commercially available bank of FPD-MBS practice questions were entered into ChatGPT-4, as is, without prior training. We assessed the overall percentage correct as well as the percentage correct within each of the five American Board of Surgery (ABS) question categories. One-way ANOVA was used to determine if the frequency of correct answers differed between categories.ResultsOut of 255 questions, ChatGPT-4 correctly answered 189 (74.1%). Between the five question categories there was no difference between the frequency of correct answers (p = 0.22). It did not matter if questions were entered individually or in groups of up to 10.ConclusionWithout prior training, ChatGPT-4 scored highly when evaluated on the largest practice question bank for the FPD-MBS exam.
引用
收藏
页码:6678 / 6681
页数:4
相关论文
共 26 条
  • [1] Performance of a Large Language Model on Practice Questions for the Neonatal Board Examination
    Beam, Kristyn
    Sharma, Puneet
    Kumar, Bhawesh
    Wang, Cindy
    Brodsky, Dara
    Martin, Camilia R.
    Beam, Andrew
    JAMA PEDIATRICS, 2023, 177 (09) : 977 - 979
  • [2] Performance of large language model artificial intelligence on dermatology board exam questions
    Park, Lily
    Ehlert, Brittany
    Susla, Lyudmyla
    Lum, Zachary C.
    Lee, Patrick K.
    CLINICAL AND EXPERIMENTAL DERMATOLOGY, 2023, 49 (07) : 733 - 734
  • [3] Large Language Model Performance on Practice Epilepsy Board Examinations
    Habib, Sara
    Butt, Haroon
    Goldenholz, Shira R.
    Chang, Chi Yuan
    Goldenholz, Daniel M.
    JAMA NEUROLOGY, 2024, 81 (06) : 660 - 661
  • [4] The Artificial intelligence large language models and neuropsychiatry practice and research ethic
    Zhong, Yi
    Chen, Yu-jun
    Zhou, Yang
    Lyu, Yan-Ao-Hai
    Yin, Jia-Jun
    Gao, Yu-jun
    ASIAN JOURNAL OF PSYCHIATRY, 2023, 84
  • [5] Artificial Intelligence for Anesthesiology Board-Style Examination Questions: Role of Large Language Models
    Khan, Adnan A.
    Yunus, Rayaan
    Sohail, Mahad
    Rehman, Taha A.
    Saeed, Shirin
    Bu, Yifan
    Jackson, Cullen D.
    Sharkey, Aidan
    Mahmood, Feroze
    Matyal, Robina
    JOURNAL OF CARDIOTHORACIC AND VASCULAR ANESTHESIA, 2024, 38 (05) : 1251 - 1259
  • [6] Performance of "Bard", Google's Artificial Intelligence Chatbot, on Ophthalmology Board Exam Practice Questions
    Botross, Monica
    Mohammadi, Seyed Omid
    Montgomery, Kendall
    INVESTIGATIVE OPHTHALMOLOGY & VISUAL SCIENCE, 2024, 65 (07)
  • [7] Performance of artificial intelligence in bariatric surgery: comparative analysis of ChatGPT-4, Bing, and Bard in the American Society for Metabolic and Bariatric Surgery textbook of bariatric surgery questions
    Lee, Yung
    Brar, Karanbir
    Malone, Sarah
    Jin, David
    McKechnie, Tyler
    Jung, James J.
    Kroh, Matthew
    Dang, Jerry T.
    SURGERY FOR OBESITY AND RELATED DISEASES, 2024, 20 (07) : 609 - 613
  • [8] Clinical Science and Practice in the Age of Large Language Models and Generative Artificial Intelligence
    Schueller, Stephen M.
    Morris, Robert R.
    JOURNAL OF CONSULTING AND CLINICAL PSYCHOLOGY, 2023, 91 (10) : 559 - 561
  • [9] Artificial intelligence and large language models in palliative medicine clinical practice and education
    Taubert, Mark
    Hackett, Robyn
    Tavabie, Simon
    BMJ SUPPORTIVE & PALLIATIVE CARE, 2024,
  • [10] Evaluation of responses to cardiac imaging questions by the artificial intelligence large language model ChatGPT
    Monroe, Cynthia L.
    Abdelhafez, Yasser G.
    Atsina, Kwame
    Aman, Edris
    Nardo, Lorenzo
    Madani, Mohammad H.
    CLINICAL IMAGING, 2024, 112