ChatGPT Solving Complex Kidney Transplant Cases: A Comparative Study With Human Respondents

被引:0
|
作者
Mankowski, Michal A. [1 ]
Jaffe, Ian S. [1 ]
Xu, Jingzhi [1 ]
Bae, Sunjae [1 ,2 ]
Oermann, Eric K. [3 ]
Aphinyanaphongs, Yindalon [2 ,4 ]
McAdams-DeMarco, Mara A. [1 ,2 ]
Lonze, Bonnie E. [1 ]
Orandi, Babak J. [1 ,4 ]
Stewart, Darren [1 ]
Levan, Macey [1 ,2 ]
Massie, Allan [1 ,2 ]
Gentry, Sommer [1 ,2 ]
Segev, Dorry L. [1 ,2 ]
机构
[1] NYU Grossman Sch Med, Dept Surg, New York, NY 10016 USA
[2] NYU Grossman Sch Med, Dept Populat Hlth, New York, NY USA
[3] NYU Grossman Sch Med, Dept Neurosurg, New York, NY USA
[4] NYU Grossman Sch Med, Dept Med, New York, NY USA
基金
美国国家卫生研究院;
关键词
artificial intelligence; ChatGPT; generative pretrained transformer; kidney transplantation; quiz; AMERICAN SOCIETY; NEPHROLOGY QUIZ;
D O I
10.1111/ctr.15466
中图分类号
R61 [外科手术学];
学科分类号
摘要
IntroductionChatGPT has shown the ability to answer clinical questions in general medicine but may be constrained by the specialized nature of kidney transplantation. Thus, it is important to explore how ChatGPT can be used in kidney transplantation and how its knowledge compares to human respondents.MethodsWe prompted ChatGPT versions 3.5, 4, and 4 Visual (4 V) with 12 multiple-choice questions related to six kidney transplant cases from 2013 to 2015 American Society of Nephrology (ASN) fellowship program quizzes. We compared the performance of ChatGPT with US nephrology fellowship program directors, nephrology fellows, and the audience of the ASN's annual Kidney Week meeting.ResultsOverall, ChatGPT 4 V correctly answered 10 out of 12 questions, showing a performance level comparable to nephrology fellows (group majority correctly answered 9 of 12 questions) and training program directors (11 of 12). This surpassed ChatGPT 4 (7 of 12 correct) and 3.5 (5 of 12). All three ChatGPT versions failed to correctly answer questions where the consensus among human respondents was low.ConclusionEach iterative version of ChatGPT performed better than the prior version, with version 4 V achieving performance on par with nephrology fellows and training program directors. While it shows promise in understanding and answering kidney transplantation questions, ChatGPT should be seen as a complementary tool to human expertise rather than a replacement.
引用
收藏
页数:8
相关论文
共 50 条
  • [1] ChatGPT in Occupational Medicine: A Comparative Study with Human Experts
    Padovan, Martina
    Cosci, Bianca
    Petillo, Armando
    Nerli, Gianluca
    Porciatti, Francesco
    Scarinci, Sergio
    Carlucci, Francesco
    Dell'Amico, Letizia
    Meliani, Niccolo
    Necciari, Gabriele
    Lucisano, Vincenzo Carmelo
    Marino, Riccardo
    Foddis, Rudy
    Palla, Alessandro
    BIOENGINEERING-BASEL, 2024, 11 (01):
  • [2] ChatGPT (GPT-4) versus doctors on complex cases of the Swedish family medicine specialist examination: an observational comparative study
    Arvidsson, Rasmus
    Gunnarsson, Ronny
    Entezarjou, Artin
    Sundemo, David
    Wikberg, Carl
    BMJ OPEN, 2024, 14 (12):
  • [3] COMPARATIVE STUDY OF DEATH PERCENTAGES: DIALYSIS VERSUS KIDNEY TRANSPLANT
    Atmane, Seba
    Moufida, Hamouche
    Ouaret, Rafik
    Badaoui, Lynda
    Daou, Rosa
    Moussi, Fatiha
    Noura, Ouerda
    TRANSPLANT INTERNATIONAL, 2019, 32 : 369 - 369
  • [4] Ramadan fast in kidney transplant recipients: A prospective comparative study
    Said, T
    Nampoory, MRN
    Haleem, MA
    Nair, MP
    Johny, KV
    Samhan, M
    Al-Mousawi, M
    TRANSPLANTATION PROCEEDINGS, 2003, 35 (07) : 2614 - 2616
  • [5] Assessment of Complex Oncologic Cases, a Comparative Analysis Between Conversational AI (ChatGPT) and a Multidisciplinary Oncologic Board
    Hernandez-Flores, Luis A.
    Rosales De La Rosa, Jesus J.
    Lopez Martinez, Jose B.
    Contreras, Sergio
    Cortes Gonzalez, Ruben
    JOURNAL OF THE AMERICAN COLLEGE OF SURGEONS, 2024, 239 (05) : S438 - S439
  • [6] Detecting LLM-Generated Text in Computing Education: Comparative Study for ChatGPT Cases
    Orenstrakh, Michael Sheinman
    Karnalim, Oscar
    Anibal Suarez, Carlos
    Liut, Michael
    2024 IEEE 48TH ANNUAL COMPUTERS, SOFTWARE, AND APPLICATIONS CONFERENCE, COMPSAC 2024, 2024, : 121 - 126
  • [7] A Comparative Study of Solving the Problem of Module Identification in a Complex Network
    Lastusilta, Toni
    Papageorgiou, Lazaros G.
    Westerlund, Tapio
    ICHEAP-10: 10TH INTERNATIONAL CONFERENCE ON CHEMICAL AND PROCESS ENGINEERING, PTS 1-3, 2011, 24 : 319 - +
  • [8] Rhabdoid tumor of the kidney in children: A comparative study of 21 cases
    Agrons, GA
    Kingsman, KD
    Wagner, BJ
    SoteloAvila, C
    AMERICAN JOURNAL OF ROENTGENOLOGY, 1997, 168 (02) : 447 - 451
  • [9] Type 2 diabetes and kidney transplant: comparative study on medication adherence
    de Oliveira Procopio, Fernanda
    Rangel, Erika Bevilaqua
    de Aguiar Roza, Bartira
    de Sa, Joao Roberto
    Schirmer, Janine
    ACTA PAULISTA DE ENFERMAGEM, 2023, 36
  • [10] Type 2 diabetes and kidney transplant: comparative study on medication adherence
    Procopio, Fernanda de Oliveira
    Rangel, Erika Bevilaqua
    Roza, Bartira de Aguiar
    de Sa, Joao Roberto
    Schirmer, Janine
    ACTA PAULISTA DE ENFERMAGEM, 2023, 36