Evaluation of ChatGPT's Performance in the Turkish Board of Orthopaedic Surgery Examination

被引：0

作者：

Yigitbay, Ahmet ^{[1
]}

机构：

[1] Siverek State Hosp, Clin Orthoped & Traumatol, Sanliurfa, Turkiye

来源：

HASEKI TIP BULTENI-MEDICAL BULLETIN OF HASEKI | 2024年 / 62卷 / 04期

关键词：

Artificial intelligence; humans; orthopedics; specialty boards; ARTIFICIAL-INTELLIGENCE;

D O I：

10.4274/haseki.galenos.2024.10038

中图分类号：

R5 [内科学];

学科分类号：

1002 ; 100201 ;

摘要：

Aim: Technological advances lead to significant changes in education and evaluation processes in medicine. In particular, artificial intelligence and natural language processing developments offer new opportunities in the health sector. This article evaluates Chat Generative Pre-Trained Transformer's (ChatGPT) performance in the Turkish Orthopaedics and Traumatology Education Council (TOTEK) Qualifying Written Examination and its applicability. Methods: To evaluate ChatGPT's performance, TOTEK Qualifying Written Examination questions from the last five years were entered as data. The results of ChatGPT were assessed under four parameters and compared with the actual exam results. The results were analyzed statistically. Results: Of the 500 questions, 458 were used as data in this study. Chat Generative Pre-Trained Transformer scored 40.2%, 26.3%, 37.3%, 32.9%, and 35.8% in the 2019, 2020, 2021, 2022, and 2023 TOTEK Qualifying Written Examination, respectively. When the correct answer percentages of ChatGPT according to years and the simple linear regression model applied to these data were analyzed, it was determined that there was a slightly decreasing trend in the correct answer rates as the years progressed. ChatGPT's TOTEK Qualifying Written Examination performance showed a statistically significant difference from the actual exam results. It was observed that the correct answer percentage of ChatGPT was below the general average success scores of the exam for each year. Conclusions: This analysis of artificial intelligence's applicability in the field and its role in training processes is essential to assess ChatGPT's potential uses and limitations. Chat Generative Pre-Trained Transformer can be a training tool, especially for knowledgebased and logical questions on specific topics. Still, its current performance is not at a level that can replace human decision-making in specialized medical fields.

引用

页码：243 / 249

页数：7

共 50 条

[31] Utility of ChatGPT as a preparation tool for the Orthopaedic In-Training Examination
Mendiratta, Dhruv
Herzog, Isabel
Singh, Rohan
Para, Ashok
Joshi, Tej
Vosbikian, Michael
Kaushal, Neil
JOURNAL OF EXPERIMENTAL ORTHOPAEDICS, 2025, 12 (01)
[32] How does ChatGPT perform on the European Board of Pediatric Surgery examination? A randomized comparative study
Azizoglu, Mustafa
Aydogdu, Bahattin
MEDICINA BALEAR, 2024, 39 (01): : 23 - 26
[33] Delay in taking the American Board of Surgery qualifying examination affects examination performance
Malangoni, Mark A.
Jones, Andrew T.
Biester, Thomas W.
Buyske, Jo
Lewis, Frank R., Jr.
SURGERY, 2012, 152 (04) : 738 - 746
[34] The opinion and recommendations of Turkish Board for Accreditation in Cardiology on Board Examination
Yildirir, Aylin
Altun, Armagan
Ural, Dilek
Ozdemir, Murat
Aslan, Ozgur
Muderrisoglu, Haldun
TURK KARDIYOLOJI DERNEGI ARSIVI-ARCHIVES OF THE TURKISH SOCIETY OF CARDIOLOGY, 2019, 47 (07): : 549 - 551
[35] Association of Vascular Surgery Board of the American Board of Surgery Examination Performance With Clinical Outcomes: Experience Matters
Kraiss, Larry W.
Al-Dulaimi, Ragheed
Presson, Angela
Cronenwett, Jack L.
Eidt, John F.
Mills, Joseph L.
Hallett, John
Kent, K. Craig
Goodney, Philip P.
Brooke, Benjamin S.
JOURNAL OF VASCULAR SURGERY, 2018, 68 (03) : E29 - E30
[36] Do Orthopaedic In-Training Examination Scores Predict the Likelihood of Passing the American Board of Orthopaedic Surgery Part I Examination? An Update With 2014 to 2018 Data
Fritz, Erik
Bednar, Michael
Harrast, John
Marsh, J. Lawrence
Martin, David
Swanson, David
Tornetta, Paul
Van Heest, Ann
JOURNAL OF THE AMERICAN ACADEMY OF ORTHOPAEDIC SURGEONS, 2021, 29 (24) : E1370 - E1377
[37] Comparative Performance of ChatGPT 3.5 and GPT4 on Rhinology Standardized Board Examination Questions
Patel, Evan A.
Fleischer, Lindsay
Filip, Peter
Eggerstedt, Michael
Hutz, Michael
Michaelides, Elias
Batra, Pete S.
Tajudeen, Bobby A.
OTO OPEN, 2024, 8 (02)
[38] The Performance of ChatGPT on the American Society for Surgery of the Hand Self-Assessment Examination
Arango, Sebastian D.
Flynn, Jason C.
Zeitlin, Jacob
Wilson, Matthew S.
Strohl, Adam B.
Weiss, Lawrence E.
Weir, Tristan B.
CUREUS JOURNAL OF MEDICAL SCIENCE, 2024, 16 (04)
[39] Integrating artificial intelligence in orthopaedic care and surgery: the revolutionary role of ChatGPT, as written with ChatGPT
Ghanem, Diane
INTERNATIONAL JOURNAL OF SURGERY, 2024, 110 (12) : 7593 - 7597
[40] The American Board Style Practice In-Training Examination as a Predictor of Performance on the American Board of Surgery In-Training Examination
Kantor, Rami S.
Wise, Eric
Morales, David
Harris, Donald G.
Kidd-Romero, Sarah
Kavic, Stephen
JOURNAL OF SURGICAL EDUCATION, 2018, 75 (04) : 895 - 900

← 1 2 3 4 5 →