Evaluating the accuracy of ChatGPT-4 in predicting ASA scores: A prospective multicentric study ChatGPT-4 in ASA score prediction

被引：9

作者：

Turan, Engin Ihsan ^{[1
,3
]}

Baydemir, Abdurrahman Engin ^{[2
]}

Ozcan, Funda Gumus

Sahin, Ayca Sultan ^{[1
]}

机构：

[1] Istanbul Hlth Sci Univ, Dept Anesthesiol, Kanuni Sultan Suleyman Educ & Training Hosp, Istanbul, Turkiye

[2] Basaksehir Cam ve Sakura City Hosp, Dept Anesthesiol, Istanbul, Turkiye

[3] Istanbul Hlth Sci Univ, Anesthesiol & Reanimat Dept, Dept Gastroenterol, Kanuni Sultan Suleyman Hosp, Atakent Mahallesi Turgut Ozal Bulvari 46-1, TR-34303 Istanbul, Turkiye

来源：

JOURNAL OF CLINICAL ANESTHESIA | 2024年 / 96卷

关键词：

D O I：

10.1016/j.jclinane.2024.111475

中图分类号：

R614 [麻醉学];

学科分类号：

100217 ;

摘要：

Background: This study investigates the potential of ChatGPT-4, developed by OpenAI, in enhancing medical decision-making processes, particularly in preoperative assessments using the American Society of Anesthesiologists (ASA) scoring system. The ASA score, a critical tool in evaluating patients' health status and anesthesia risks before surgery, categorizes patients from I to VI based on their overall health and risk factors. Despite its widespread use, determining accurate ASA scores remains a subjective process that may benefit from AI-supported assessments. This research aims to evaluate ChatGPT-4's capability to predict ASA scores accurately compared to expert anesthesiologists' assessments. Methods: In this prospective multicentric study, ethical board approval was obtained, and the study was registered with clinicaltrials.gov (NCT06321445). We included 2851 patients from anesthesiology outpatient clinics, spanning neonates to all age groups and genders, with ASA scores between I-IV. Exclusion criteria were set for ASA V and VI scores, emergency operations, and insufficient information for ASA score determination. Data on patients' demographics, health conditions, and ASA scores by anesthesiologists were collected and anonymized. ChatGPT-4 was then tasked with assigning ASA scores based on the standardized patient data. Results: Our results indicate a high level of concordance between ChatGPT-4 predictions and anesthesiologists' evaluations, with Cohen's kappa analysis showing a kappa value of 0.858 ( p = 0.000). While the model demonstrated over 90% accuracy in predicting ASA scores I to III, it showed a notable variance in ASA IV scores, suggesting a potential limitation in assessing patients with more complex health conditions. Discussion: The findings suggest that ChatGPT-4 can significantly contribute to the medical field by supporting anesthesiologists in preoperative assessments. This study not only demonstrates ChatGPT-4's efficacy in medical data analysis and decision-making but also opens new avenues for AI applications in healthcare, particularly in enhancing patient safety and optimizing surgical outcomes. Further research is needed to refine AI models for complex case assessments and integrate them seamlessly into clinical workflows.

引用

页数：7

共 50 条

[41] Using ChatGPT-4 in visual field test assessment
Akgun, Gulsah Gumus
Altan, Cigdem
Balci, Ali Safa
Alagoz, Nese
Cakir, Ihsan
Yasar, Tekin
CLINICAL AND EXPERIMENTAL OPTOMETRY, 2025,
[42] Evaluating ChatGPT-4 for the Interpretation of Images from Several Diagnostic Techniques in Gastroenterology
Saraiva, Miguel Mascarenhas
Ribeiro, Tiago
Agudo, Belen
Afonso, Joao
Mendes, Francisco
Martins, Miguel
Cardoso, Pedro
Mota, Joana
Almeida, Maria Joao
Costa, Antonio
Gonzalez Haba Ruiz, Mariano
Widmer, Jessica
Moura, Eduardo
Javed, Ahsan
Manzione, Thiago
Nadal, Sidney
Barroso, Luis F.
de Parades, Vincent
Ferreira, Joao
Macedo, Guilherme
JOURNAL OF CLINICAL MEDICINE, 2025, 14 (02)
[43] Using ChatGPT-4 to Teach the Design of Data Visualizations
Lear, Benjamin J.
JOURNAL OF CHEMICAL EDUCATION, 2024, 101 (07) : 2749 - 2756
[44] Diagnostic Performance of ChatGPT-4 in Patients with Retinal Pathologies
Mafi, Mostafa
Montazeri, Fateme
Moghadam, Mohammad Mehdi Johari
Mirghorbani, Masoud
Anvari, Pasha
Falavarjani, Khalil Ghasemi
Mahoney, Mohammad Delsoz
INVESTIGATIVE OPHTHALMOLOGY & VISUAL SCIENCE, 2024, 65 (07)
[45] A Study on the Efficacy of ChatGPT-4 in Enhancing Students' English Communication Skills
Wang, Ying
SAGE OPEN, 2025, 15 (01):
[46] Suicide Risk Assessments Through the Eyes of ChatGPT-3.5 Versus ChatGPT-4: Vignette Study
Levkovich, Inbar
Elyoseph, Zohar
JMIR MENTAL HEALTH, 2023, 10
[47] Revolutionizing Diagnostics: Evaluating ChatGPT-4's Performance in Ulcerative Colitis Endoscopic Assessment
Levartovsky, A.
Albshesh, A.
Grinman, A.
Shachar, E.
Lahat, A.
Eliakim, R.
Kopylov, U.
JOURNAL OF CROHNS & COLITIS, 2025, 19 : I748 - I748
[48] Uncovering the Reasons behind Willingness to Pay for ChatGPT-4 Premium
Jo, Hyeon
INTERNATIONAL JOURNAL OF HUMAN-COMPUTER INTERACTION, 2025, 41 (02) : 994 - 1009
[49] ChatGPT-4 versus human assessment in cardiology peer review
Fernandez-Cisnal, Agustin
Avanzas, Pablo
Filgueiras-Rama, David
Garcia-Pavia, Pablo
Sanchis, Laura
Sanchis, Juan
REVISTA ESPANOLA DE CARDIOLOGIA, 2024, 77 (07): : 591 - 594
[50] Assessing the Current Clinical Application of ChatGPT-4 in Radiation Oncology
Chuang, W. K.
Kao, Y. S.
Liu, Y. T.
Lee, C. Y.
INTERNATIONAL JOURNAL OF RADIATION ONCOLOGY BIOLOGY PHYSICS, 2024, 120 (02): : E641 - E642

← 1 2 3 4 5 →