ChatGPT Solving Complex Kidney Transplant Cases: A Comparative Study With Human Respondents

被引：0

作者：

Mankowski, Michal A. ^{[1
]}

Jaffe, Ian S. ^{[1
]}

Xu, Jingzhi ^{[1
]}

Bae, Sunjae ^{[1
,2
]}

Oermann, Eric K. ^{[3
]}

Aphinyanaphongs, Yindalon ^{[2
,4
]}

McAdams-DeMarco, Mara A. ^{[1
,2
]}

Lonze, Bonnie E. ^{[1
]}

Orandi, Babak J. ^{[1
,4
]}

Stewart, Darren ^{[1
]}

Levan, Macey ^{[1
,2
]}

Massie, Allan ^{[1
,2
]}

Gentry, Sommer ^{[1
,2
]}

Segev, Dorry L. ^{[1
,2
]}

机构：

[1] NYU Grossman Sch Med, Dept Surg, New York, NY 10016 USA

[2] NYU Grossman Sch Med, Dept Populat Hlth, New York, NY USA

[3] NYU Grossman Sch Med, Dept Neurosurg, New York, NY USA

[4] NYU Grossman Sch Med, Dept Med, New York, NY USA

来源：

CLINICAL TRANSPLANTATION | 2024年 / 38卷 / 10期

基金：

美国国家卫生研究院;

关键词：

artificial intelligence; ChatGPT; generative pretrained transformer; kidney transplantation; quiz; AMERICAN SOCIETY; NEPHROLOGY QUIZ;

D O I：

10.1111/ctr.15466

中图分类号：

R61 [外科手术学];

学科分类号：

摘要：

IntroductionChatGPT has shown the ability to answer clinical questions in general medicine but may be constrained by the specialized nature of kidney transplantation. Thus, it is important to explore how ChatGPT can be used in kidney transplantation and how its knowledge compares to human respondents.MethodsWe prompted ChatGPT versions 3.5, 4, and 4 Visual (4 V) with 12 multiple-choice questions related to six kidney transplant cases from 2013 to 2015 American Society of Nephrology (ASN) fellowship program quizzes. We compared the performance of ChatGPT with US nephrology fellowship program directors, nephrology fellows, and the audience of the ASN's annual Kidney Week meeting.ResultsOverall, ChatGPT 4 V correctly answered 10 out of 12 questions, showing a performance level comparable to nephrology fellows (group majority correctly answered 9 of 12 questions) and training program directors (11 of 12). This surpassed ChatGPT 4 (7 of 12 correct) and 3.5 (5 of 12). All three ChatGPT versions failed to correctly answer questions where the consensus among human respondents was low.ConclusionEach iterative version of ChatGPT performed better than the prior version, with version 4 V achieving performance on par with nephrology fellows and training program directors. While it shows promise in understanding and answering kidney transplantation questions, ChatGPT should be seen as a complementary tool to human expertise rather than a replacement.

引用

页数：8

共 50 条

[1] ChatGPT in Occupational Medicine: A Comparative Study with Human Experts
Padovan, Martina
Cosci, Bianca
Petillo, Armando
Nerli, Gianluca
Porciatti, Francesco
Scarinci, Sergio
Carlucci, Francesco
Dell'Amico, Letizia
Meliani, Niccolo
Necciari, Gabriele
Lucisano, Vincenzo Carmelo
Marino, Riccardo
Foddis, Rudy
Palla, Alessandro
BIOENGINEERING-BASEL, 2024, 11 (01):
[2] ChatGPT (GPT-4) versus doctors on complex cases of the Swedish family medicine specialist examination: an observational comparative study
Arvidsson, Rasmus
Gunnarsson, Ronny
Entezarjou, Artin
Sundemo, David
Wikberg, Carl
BMJ OPEN, 2024, 14 (12):
[3] COMPARATIVE STUDY OF DEATH PERCENTAGES: DIALYSIS VERSUS KIDNEY TRANSPLANT
Atmane, Seba
Moufida, Hamouche
Ouaret, Rafik
Badaoui, Lynda
Daou, Rosa
Moussi, Fatiha
Noura, Ouerda
TRANSPLANT INTERNATIONAL, 2019, 32 : 369 - 369
[4] Ramadan fast in kidney transplant recipients: A prospective comparative study
Said, T
Nampoory, MRN
Haleem, MA
Nair, MP
Johny, KV
Samhan, M
Al-Mousawi, M
TRANSPLANTATION PROCEEDINGS, 2003, 35 (07) : 2614 - 2616
[5] Assessment of Complex Oncologic Cases, a Comparative Analysis Between Conversational AI (ChatGPT) and a Multidisciplinary Oncologic Board
Hernandez-Flores, Luis A.
Rosales De La Rosa, Jesus J.
Lopez Martinez, Jose B.
Contreras, Sergio
Cortes Gonzalez, Ruben
JOURNAL OF THE AMERICAN COLLEGE OF SURGEONS, 2024, 239 (05) : S438 - S439
[6] Detecting LLM-Generated Text in Computing Education: Comparative Study for ChatGPT Cases
Orenstrakh, Michael Sheinman
Karnalim, Oscar
Anibal Suarez, Carlos
Liut, Michael
2024 IEEE 48TH ANNUAL COMPUTERS, SOFTWARE, AND APPLICATIONS CONFERENCE, COMPSAC 2024, 2024, : 121 - 126
[7] A Comparative Study of Solving the Problem of Module Identification in a Complex Network
Lastusilta, Toni
Papageorgiou, Lazaros G.
Westerlund, Tapio
ICHEAP-10: 10TH INTERNATIONAL CONFERENCE ON CHEMICAL AND PROCESS ENGINEERING, PTS 1-3, 2011, 24 : 319 - +
[8] Rhabdoid tumor of the kidney in children: A comparative study of 21 cases
Agrons, GA
Kingsman, KD
Wagner, BJ
SoteloAvila, C
AMERICAN JOURNAL OF ROENTGENOLOGY, 1997, 168 (02) : 447 - 451
[9] Type 2 diabetes and kidney transplant: comparative study on medication adherence
de Oliveira Procopio, Fernanda
Rangel, Erika Bevilaqua
de Aguiar Roza, Bartira
de Sa, Joao Roberto
Schirmer, Janine
ACTA PAULISTA DE ENFERMAGEM, 2023, 36
[10] Type 2 diabetes and kidney transplant: comparative study on medication adherence
Procopio, Fernanda de Oliveira
Rangel, Erika Bevilaqua
Roza, Bartira de Aguiar
de Sa, Joao Roberto
Schirmer, Janine
ACTA PAULISTA DE ENFERMAGEM, 2023, 36

← 1 2 3 4 5 →