Evaluation of ChatGPT's Performance in the Turkish Board of Orthopaedic Surgery Examination

被引:0
|
作者
Yigitbay, Ahmet [1 ]
机构
[1] Siverek State Hosp, Clin Orthoped & Traumatol, Sanliurfa, Turkiye
来源
关键词
Artificial intelligence; humans; orthopedics; specialty boards; ARTIFICIAL-INTELLIGENCE;
D O I
10.4274/haseki.galenos.2024.10038
中图分类号
R5 [内科学];
学科分类号
1002 ; 100201 ;
摘要
Aim: Technological advances lead to significant changes in education and evaluation processes in medicine. In particular, artificial intelligence and natural language processing developments offer new opportunities in the health sector. This article evaluates Chat Generative Pre-Trained Transformer's (ChatGPT) performance in the Turkish Orthopaedics and Traumatology Education Council (TOTEK) Qualifying Written Examination and its applicability. Methods: To evaluate ChatGPT's performance, TOTEK Qualifying Written Examination questions from the last five years were entered as data. The results of ChatGPT were assessed under four parameters and compared with the actual exam results. The results were analyzed statistically. Results: Of the 500 questions, 458 were used as data in this study. Chat Generative Pre-Trained Transformer scored 40.2%, 26.3%, 37.3%, 32.9%, and 35.8% in the 2019, 2020, 2021, 2022, and 2023 TOTEK Qualifying Written Examination, respectively. When the correct answer percentages of ChatGPT according to years and the simple linear regression model applied to these data were analyzed, it was determined that there was a slightly decreasing trend in the correct answer rates as the years progressed. ChatGPT's TOTEK Qualifying Written Examination performance showed a statistically significant difference from the actual exam results. It was observed that the correct answer percentage of ChatGPT was below the general average success scores of the exam for each year. Conclusions: This analysis of artificial intelligence's applicability in the field and its role in training processes is essential to assess ChatGPT's potential uses and limitations. Chat Generative Pre-Trained Transformer can be a training tool, especially for knowledgebased and logical questions on specific topics. Still, its current performance is not at a level that can replace human decision-making in specialized medical fields.
引用
收藏
页码:243 / 249
页数:7
相关论文
共 50 条
  • [31] Utility of ChatGPT as a preparation tool for the Orthopaedic In-Training Examination
    Mendiratta, Dhruv
    Herzog, Isabel
    Singh, Rohan
    Para, Ashok
    Joshi, Tej
    Vosbikian, Michael
    Kaushal, Neil
    JOURNAL OF EXPERIMENTAL ORTHOPAEDICS, 2025, 12 (01)
  • [32] How does ChatGPT perform on the European Board of Pediatric Surgery examination? A randomized comparative study
    Azizoglu, Mustafa
    Aydogdu, Bahattin
    MEDICINA BALEAR, 2024, 39 (01): : 23 - 26
  • [33] Delay in taking the American Board of Surgery qualifying examination affects examination performance
    Malangoni, Mark A.
    Jones, Andrew T.
    Biester, Thomas W.
    Buyske, Jo
    Lewis, Frank R., Jr.
    SURGERY, 2012, 152 (04) : 738 - 746
  • [34] The opinion and recommendations of Turkish Board for Accreditation in Cardiology on Board Examination
    Yildirir, Aylin
    Altun, Armagan
    Ural, Dilek
    Ozdemir, Murat
    Aslan, Ozgur
    Muderrisoglu, Haldun
    TURK KARDIYOLOJI DERNEGI ARSIVI-ARCHIVES OF THE TURKISH SOCIETY OF CARDIOLOGY, 2019, 47 (07): : 549 - 551
  • [35] Association of Vascular Surgery Board of the American Board of Surgery Examination Performance With Clinical Outcomes: Experience Matters
    Kraiss, Larry W.
    Al-Dulaimi, Ragheed
    Presson, Angela
    Cronenwett, Jack L.
    Eidt, John F.
    Mills, Joseph L.
    Hallett, John
    Kent, K. Craig
    Goodney, Philip P.
    Brooke, Benjamin S.
    JOURNAL OF VASCULAR SURGERY, 2018, 68 (03) : E29 - E30
  • [36] Do Orthopaedic In-Training Examination Scores Predict the Likelihood of Passing the American Board of Orthopaedic Surgery Part I Examination? An Update With 2014 to 2018 Data
    Fritz, Erik
    Bednar, Michael
    Harrast, John
    Marsh, J. Lawrence
    Martin, David
    Swanson, David
    Tornetta, Paul
    Van Heest, Ann
    JOURNAL OF THE AMERICAN ACADEMY OF ORTHOPAEDIC SURGEONS, 2021, 29 (24) : E1370 - E1377
  • [37] Comparative Performance of ChatGPT 3.5 and GPT4 on Rhinology Standardized Board Examination Questions
    Patel, Evan A.
    Fleischer, Lindsay
    Filip, Peter
    Eggerstedt, Michael
    Hutz, Michael
    Michaelides, Elias
    Batra, Pete S.
    Tajudeen, Bobby A.
    OTO OPEN, 2024, 8 (02)
  • [38] The Performance of ChatGPT on the American Society for Surgery of the Hand Self-Assessment Examination
    Arango, Sebastian D.
    Flynn, Jason C.
    Zeitlin, Jacob
    Wilson, Matthew S.
    Strohl, Adam B.
    Weiss, Lawrence E.
    Weir, Tristan B.
    CUREUS JOURNAL OF MEDICAL SCIENCE, 2024, 16 (04)
  • [39] Integrating artificial intelligence in orthopaedic care and surgery: the revolutionary role of ChatGPT, as written with ChatGPT
    Ghanem, Diane
    INTERNATIONAL JOURNAL OF SURGERY, 2024, 110 (12) : 7593 - 7597
  • [40] The American Board Style Practice In-Training Examination as a Predictor of Performance on the American Board of Surgery In-Training Examination
    Kantor, Rami S.
    Wise, Eric
    Morales, David
    Harris, Donald G.
    Kidd-Romero, Sarah
    Kavic, Stephen
    JOURNAL OF SURGICAL EDUCATION, 2018, 75 (04) : 895 - 900