Evaluation of ChatGPT's Performance in the Turkish Board of Orthopaedic Surgery Examination

被引:0
|
作者
Yigitbay, Ahmet [1 ]
机构
[1] Siverek State Hosp, Clin Orthoped & Traumatol, Sanliurfa, Turkiye
来源
关键词
Artificial intelligence; humans; orthopedics; specialty boards; ARTIFICIAL-INTELLIGENCE;
D O I
10.4274/haseki.galenos.2024.10038
中图分类号
R5 [内科学];
学科分类号
1002 ; 100201 ;
摘要
Aim: Technological advances lead to significant changes in education and evaluation processes in medicine. In particular, artificial intelligence and natural language processing developments offer new opportunities in the health sector. This article evaluates Chat Generative Pre-Trained Transformer's (ChatGPT) performance in the Turkish Orthopaedics and Traumatology Education Council (TOTEK) Qualifying Written Examination and its applicability. Methods: To evaluate ChatGPT's performance, TOTEK Qualifying Written Examination questions from the last five years were entered as data. The results of ChatGPT were assessed under four parameters and compared with the actual exam results. The results were analyzed statistically. Results: Of the 500 questions, 458 were used as data in this study. Chat Generative Pre-Trained Transformer scored 40.2%, 26.3%, 37.3%, 32.9%, and 35.8% in the 2019, 2020, 2021, 2022, and 2023 TOTEK Qualifying Written Examination, respectively. When the correct answer percentages of ChatGPT according to years and the simple linear regression model applied to these data were analyzed, it was determined that there was a slightly decreasing trend in the correct answer rates as the years progressed. ChatGPT's TOTEK Qualifying Written Examination performance showed a statistically significant difference from the actual exam results. It was observed that the correct answer percentage of ChatGPT was below the general average success scores of the exam for each year. Conclusions: This analysis of artificial intelligence's applicability in the field and its role in training processes is essential to assess ChatGPT's potential uses and limitations. Chat Generative Pre-Trained Transformer can be a training tool, especially for knowledgebased and logical questions on specific topics. Still, its current performance is not at a level that can replace human decision-making in specialized medical fields.
引用
收藏
页码:243 / 249
页数:7
相关论文
共 50 条
  • [1] Can Artificial Intelligence Pass the American Board of Orthopaedic Surgery Examination? Orthopaedic Residents Versus ChatGPT
    Lum, Zachary C.
    CLINICAL ORTHOPAEDICS AND RELATED RESEARCH, 2023, 481 (08) : 1623 - 1630
  • [2] CORR Insights®: Can Artificial Intelligence Pass the American Board of Orthopaedic Surgery Examination? Orthopaedic Residents Versus ChatGPT
    Karnuta, Jaret McGraw
    CLINICAL ORTHOPAEDICS AND RELATED RESEARCH, 2023, 481 (08) : 1631 - 1633
  • [3] Performance of ChatGPT on American Board of Surgery In-Training Examination Preparation Questions
    Tran, Catherine G.
    Chang, Jeremy
    Sherman, Scott K.
    De Andrade, James P.
    JOURNAL OF SURGICAL RESEARCH, 2024, 299 : 329 - 335
  • [4] Evaluating the performance of ChatGPT-3.5 and ChatGPT-4 on the Taiwan plastic surgery board examination
    Hsieh, Ching-Hua
    Hsieh, Hsiao-Yun
    Lin, Hui-Ping
    HELIYON, 2024, 10 (14)
  • [5] ChatGPT and the German board examination for ophthalmology: an evaluation
    Yaici, Remi
    Cieplucha, M.
    Bock, R.
    Moayed, F.
    Bechrakis, N. E.
    Berens, P.
    Feltgen, N.
    Friedburg, D.
    Graef, M.
    Guthoff, R.
    Hoffmann, E. M.
    Hoerauf, H.
    Hintschich, C.
    Kohnen, T.
    Messmer, E. M.
    Nentwich, M. M.
    Pleyer, U.
    Schaudig, U.
    Seitz, B.
    Geerling, G.
    Roth, M.
    OPHTHALMOLOGIE, 2024, 121 (07): : 554 - 564
  • [6] Evaluating ChatGPT Performance on the Orthopaedic In-Training Examination
    Kung, Justin E.
    Marshall, Christopher
    Gauthier, Chase
    Gonzalez, Tyler A.
    Jackson III, J. Benjamin
    JBJS OPEN ACCESS, 2023, 8 (03)
  • [7] Assessment of ChatGPT's performance on neurology written board examination questions
    Chen, Tse Chian
    Multala, Evan
    Kearns, Patrick
    Delashaw, Johnny
    Dumont, Aaron
    Maraganore, Demetrius
    Wang, Arthur
    BMJ NEUROLOGY OPEN, 2023, 5 (02)
  • [8] Predictors of Success on the American Board of Orthopaedic Surgery Examination
    Herndon, James H.
    Allan, Bassan J.
    Dyer, George
    Jawa, Andrew
    Zurakowski, David
    CLINICAL ORTHOPAEDICS AND RELATED RESEARCH, 2009, 467 (09) : 2436 - 2445
  • [9] Performance of ChatGPT on a Practice Dermatology Board Certification Examination
    Joly-Chevrier, Maxine
    Nguyen, Anne Xuan-Lan
    Lesko-Krleza, Michael
    Lefrancois, Philippe
    JOURNAL OF CUTANEOUS MEDICINE AND SURGERY, 2023, 27 (04) : 409 - 412
  • [10] ChatGPT and the European Board of Hand Surgery diploma examination: Correspondence
    Kleebayoon, Amnuay
    Mungmunpuntipantip, Rujittika
    Wiwanitkit, Viroj
    HAND SURGERY & REHABILITATION, 2023, 42 (05):