Evaluation of ChatGPT's Performance in the Turkish Board of Orthopaedic Surgery Examination

被引:0
|
作者
Yigitbay, Ahmet [1 ]
机构
[1] Siverek State Hosp, Clin Orthoped & Traumatol, Sanliurfa, Turkiye
来源
关键词
Artificial intelligence; humans; orthopedics; specialty boards; ARTIFICIAL-INTELLIGENCE;
D O I
10.4274/haseki.galenos.2024.10038
中图分类号
R5 [内科学];
学科分类号
1002 ; 100201 ;
摘要
Aim: Technological advances lead to significant changes in education and evaluation processes in medicine. In particular, artificial intelligence and natural language processing developments offer new opportunities in the health sector. This article evaluates Chat Generative Pre-Trained Transformer's (ChatGPT) performance in the Turkish Orthopaedics and Traumatology Education Council (TOTEK) Qualifying Written Examination and its applicability. Methods: To evaluate ChatGPT's performance, TOTEK Qualifying Written Examination questions from the last five years were entered as data. The results of ChatGPT were assessed under four parameters and compared with the actual exam results. The results were analyzed statistically. Results: Of the 500 questions, 458 were used as data in this study. Chat Generative Pre-Trained Transformer scored 40.2%, 26.3%, 37.3%, 32.9%, and 35.8% in the 2019, 2020, 2021, 2022, and 2023 TOTEK Qualifying Written Examination, respectively. When the correct answer percentages of ChatGPT according to years and the simple linear regression model applied to these data were analyzed, it was determined that there was a slightly decreasing trend in the correct answer rates as the years progressed. ChatGPT's TOTEK Qualifying Written Examination performance showed a statistically significant difference from the actual exam results. It was observed that the correct answer percentage of ChatGPT was below the general average success scores of the exam for each year. Conclusions: This analysis of artificial intelligence's applicability in the field and its role in training processes is essential to assess ChatGPT's potential uses and limitations. Chat Generative Pre-Trained Transformer can be a training tool, especially for knowledgebased and logical questions on specific topics. Still, its current performance is not at a level that can replace human decision-making in specialized medical fields.
引用
收藏
页码:243 / 249
页数:7
相关论文
共 50 条
  • [41] Surgical Trends in Bankart Repair An Analysis of Data From the American Board of Orthopaedic Surgery Certification Examination
    Owens, Brett D.
    Harrast, John J.
    Hurwitz, Shepard R.
    Thompson, Terry L.
    Wolf, Jennifer Moriatis
    AMERICAN JOURNAL OF SPORTS MEDICINE, 2011, 39 (09): : 1865 - 1869
  • [42] Beyond human in neurosurgical exams: ChatGPT's success in the Turkish neurosurgical society proficiency board exams
    Sahin, Mustafa Caglar
    Sozer, Alperen
    Kuzucu, Pelin
    Turkmen, Tolga
    Sahin, Merve Buke
    Sozer, Ekin
    Tufek, Ozan Yavuz
    Nernekli, Kerem
    Emmez, Hakan
    Celtikci, Emrah
    COMPUTERS IN BIOLOGY AND MEDICINE, 2024, 169
  • [43] Surgery Residency Curriculum Examination Scores Predict Future American Board of Surgery In-Training Examination Performance
    Webb, Travis P.
    Paul, Jasmeet
    Treat, Robert
    Codner, Panna
    Anderson, Rebecca
    Redlich, Philip
    JOURNAL OF SURGICAL EDUCATION, 2014, 71 (05) : 743 - 747
  • [44] Not the Last Word: ChatGPT Can't Perform Orthopaedic Surgery
    Bernstein, Joseph
    CLINICAL ORTHOPAEDICS AND RELATED RESEARCH, 2023, 481 (04) : 651 - 655
  • [45] Factors affecting performance on the American Board of Surgery in-training examination
    Godellas, CV
    Huang, RW
    AMERICAN JOURNAL OF SURGERY, 2001, 181 (04): : 294 - 296
  • [46] Continuing medical education activity and American Board of Surgery Examination Performance
    Rhodes, RS
    Biesten, TW
    Ritchie, WP
    Malangoni, MA
    JOURNAL OF THE AMERICAN COLLEGE OF SURGEONS, 2003, 196 (04) : 604 - 609
  • [47] Scrutinizing ChatGPT's Performance in Assessing Surgical Knowledge: an Examination Study
    Reyaz, Anam
    Sohail, Shahab Saquib
    Ishaaq, Namria
    INDIAN JOURNAL OF SURGERY, 2024, 86 (04) : 843 - 844
  • [48] Evaluating the Performance of ChatGPT at Breast Tumor Board
    Xu, Y.
    Logie, N.
    Phan, T.
    Barbera, L.
    Nordal, R. A.
    Stosky, J. M.
    Lee, S. L.
    INTERNATIONAL JOURNAL OF RADIATION ONCOLOGY BIOLOGY PHYSICS, 2023, 117 (02): : E493 - E493
  • [49] EVALUATING THE PERFORMANCE OF CHATGPT AT BREAST TUMOUR BOARD
    Xu, Yang
    Logie, Natalie
    Phan, Tien
    Barbera, Lisa
    Nordal, Robert
    Stosky, Jordan
    Lee, Sangjune
    RADIOTHERAPY AND ONCOLOGY, 2023, 186 : S77 - S77
  • [50] ChatGPT Earns American Board Certification in Hand Surgery
    Ghanem, Diane
    Nassar, Joseph E.
    El Bachour, Joseph
    Hanna, Tammam
    HAND SURGERY & REHABILITATION, 2024, 43 (03):