Evaluation of ChatGPT's Performance in the Turkish Board of Orthopaedic Surgery Examination

被引:0
|
作者
Yigitbay, Ahmet [1 ]
机构
[1] Siverek State Hosp, Clin Orthoped & Traumatol, Sanliurfa, Turkiye
来源
关键词
Artificial intelligence; humans; orthopedics; specialty boards; ARTIFICIAL-INTELLIGENCE;
D O I
10.4274/haseki.galenos.2024.10038
中图分类号
R5 [内科学];
学科分类号
1002 ; 100201 ;
摘要
Aim: Technological advances lead to significant changes in education and evaluation processes in medicine. In particular, artificial intelligence and natural language processing developments offer new opportunities in the health sector. This article evaluates Chat Generative Pre-Trained Transformer's (ChatGPT) performance in the Turkish Orthopaedics and Traumatology Education Council (TOTEK) Qualifying Written Examination and its applicability. Methods: To evaluate ChatGPT's performance, TOTEK Qualifying Written Examination questions from the last five years were entered as data. The results of ChatGPT were assessed under four parameters and compared with the actual exam results. The results were analyzed statistically. Results: Of the 500 questions, 458 were used as data in this study. Chat Generative Pre-Trained Transformer scored 40.2%, 26.3%, 37.3%, 32.9%, and 35.8% in the 2019, 2020, 2021, 2022, and 2023 TOTEK Qualifying Written Examination, respectively. When the correct answer percentages of ChatGPT according to years and the simple linear regression model applied to these data were analyzed, it was determined that there was a slightly decreasing trend in the correct answer rates as the years progressed. ChatGPT's TOTEK Qualifying Written Examination performance showed a statistically significant difference from the actual exam results. It was observed that the correct answer percentage of ChatGPT was below the general average success scores of the exam for each year. Conclusions: This analysis of artificial intelligence's applicability in the field and its role in training processes is essential to assess ChatGPT's potential uses and limitations. Chat Generative Pre-Trained Transformer can be a training tool, especially for knowledgebased and logical questions on specific topics. Still, its current performance is not at a level that can replace human decision-making in specialized medical fields.
引用
收藏
页码:243 / 249
页数:7
相关论文
共 50 条
  • [21] Commentary on: Performance of ChatGPT on the Plastic Surgery Inservice Training Examination
    Cevallos, Priscila C.
    Nazerali, Rahim S.
    AESTHETIC SURGERY JOURNAL, 2023, 43 (12) : NP1083 - NP1084
  • [22] American board of orthopaedic surgery practice of the orthopaedic surgeon: Part-II, certification examination case mix
    Garrett, WE
    Swiontkowski, MF
    Weinstein, JN
    Callaghan, J
    Rosier, RN
    Berry, DJ
    Harrast, J
    DeRosa, GP
    JOURNAL OF BONE AND JOINT SURGERY-AMERICAN VOLUME, 2006, 88A (03): : 660 - 667
  • [23] Is ChatGPT able to pass the first part of the European Board of Hand Surgery diploma examination?
    Traore, Sidi Yaya
    Goetsch, Thibaut
    Muller, Benjamin
    Dabbagh, Armaghan
    Liverneaux, Philippe Andre
    HAND SURGERY & REHABILITATION, 2023, 42 (04): : 362 - 364
  • [24] Evaluation of ChatGPT's performance in Medical Education: A Comparative Analysis with Students in a Pulmonology Examination
    Cherif, Hela
    Moussa, Chirine
    Ben Rjab, Sarra
    Mokaddem, Salma
    Dhahri, Besma
    EUROPEAN RESPIRATORY JOURNAL, 2024, 64
  • [25] Examination of ChatGPT's Performance as a Data Analysis Tool
    Kocak, Duygu
    EDUCATIONAL AND PSYCHOLOGICAL MEASUREMENT, 2025,
  • [26] Turkish Board of Neurological Surgery
    Bulduk, Erkut Baha
    Yilmaz, Cem
    TURKISH NEUROSURGERY, 2019, 29 (01) : 121 - 126
  • [27] ChatGPT Is Equivalent to First-Year Plastic Surgery Residents: Evaluation of ChatGPT on the Plastic Surgery In-service Examination
    Humar, Pooja
    Asaad, Malke
    Bengur, Fuat Baris
    Nguyen, Vu
    AESTHETIC SURGERY JOURNAL, 2023, 43 (12) : NP1085 - NP1089
  • [28] Performance of ChatGPT on a Radiology Board-style Examination: Insights into Current Strengths and Limitations
    Bhayana, Rajesh
    Krishna, Satheesh
    Bleakney, Robert R.
    RADIOLOGY, 2023, 307 (05)
  • [29] Relationship Between Performance on Part I of the American Board of Orthopaedic Surgery Certifying Examination and Scores on USMLE Steps 1 and 2
    Swanson, David B.
    Sawhill, Amy
    Holtzman, Kathleen Z.
    Bucak, S. Deniz
    Morrison, Carol
    Hurwitz, Shepard
    DeRosa, G. Paul
    ACADEMIC MEDICINE, 2009, 84 : S21 - S24
  • [30] Comparison of ChatGPT-3.5, ChatGPT-4, and Orthopaedic Resident Performance on Orthopaedic Assessment Examinations
    Massey, Patrick A.
    Montgomery, Carver
    Zhang, Andrew S.
    JOURNAL OF THE AMERICAN ACADEMY OF ORTHOPAEDIC SURGEONS, 2023, 31 (23) : 1173 - 1179