Quality of information and appropriateness of Open AI outputs for prostate cancer

被引:20
|
作者
Lombardo, Riccardo [1 ]
Gallo, Giacomo [1 ]
Stira, Jordi [1 ]
Turchi, Beatrice [1 ]
Santoro, Giuseppe [1 ]
Riolo, Sara [1 ]
Romagnoli, Matteo [1 ]
Cicione, Antonio [1 ]
Tema, Giorgia [1 ]
Pastore, Antonio [1 ]
Al Salhi, Yazan [1 ]
Fuschi, Andrea [1 ]
Franco, Giorgio [1 ]
Nacchia, Antonio [1 ]
Tubaro, Andrea [1 ]
De Nunzio, Cosimo [1 ]
机构
[1] Sapienza Univ Rome, Dept Urol, Rome, Italy
关键词
D O I
10.1038/s41391-024-00789-0
中图分类号
R73 [肿瘤学];
学科分类号
100214 ;
摘要
Chat-GPT, a natural language processing (NLP) tool created by Open-AI, can potentially be used as a quick source for obtaining information related to prostate cancer. This study aims to analyze the quality and appropriateness of Chat-GPT's responses to inquiries related to prostate cancer compared to those of the European Urology Association's (EAU) 2023 prostate cancer guidelines. Overall, 195 questions were prepared according to the recommendations gathered in the prostate cancer section of the EAU 2023 Guideline. All questions were systematically presented to Chat-GPT's August 3 Version, and two expert urologists independently assessed and assigned scores ranging from 1 to 4 to each response (1: completely correct, 2: correct but inadequate, 3: a mix of correct and misleading information, and 4: completely incorrect). Sub-analysis per chapter and per grade of recommendation were performed. Overall, 195 recommendations were evaluated. Overall, 50/195 (26%) were completely correct, 51/195 (26%) correct but inadequate, 47/195 (24%) a mix of correct and misleading and 47/195 (24%) incorrect. When looking at different chapters Open AI was particularly accurate in answering questions on follow-up and QoL. Worst performance was recorded for the diagnosis and treatment chapters with respectively 19% and 30% of the answers completely incorrect. When looking at the strength of recommendation, no differences in terms of accuracy were recorded when comparing weak and strong recommendations (p > 0,05). Chat-GPT has a poor accuracy when answering questions on the PCa EAU guidelines recommendations. Future studies should assess its performance after adequate training.
引用
收藏
页码:229 / 231
页数:3
相关论文
共 50 条
  • [31] Quality medical data management within an open AI architecture - cancer patients case
    Ivanovic, Mirjana
    Autexier, Serge
    Kokkonidis, Miltiadis
    Rust, Johannes
    CONNECTION SCIENCE, 2023, 35 (01)
  • [32] Appropriateness of cancer screening exams: the case of the specific prostate antigen
    Giavarina, Davide
    BIOCHIMICA CLINICA, 2018, 42 (04) : 281 - 282
  • [33] ACR appropriateness criteria: Permanent source brachytherapy for prostate cancer
    Davis, Brian J.
    Taira, Al V.
    Nguyen, Paul L.
    Assimos, Dean G.
    D'Amico, Anthony V.
    Gottschalk, Alexander R.
    Gustafson, Gary S.
    Keole, Sameer R.
    Liauw, Stanley L.
    Lloyd, Shane
    McLaughlin, Patrick W.
    Movsas, Benjamin
    Prestidge, Bradley R.
    Showalter, Timothy N.
    Vapiwala, Neha
    BRACHYTHERAPY, 2017, 16 (02) : 266 - 276
  • [34] ACR Appropriateness Criteria® Postradical Prostatectomy Irradiation in Prostate Cancer
    Gustafson, Gary S.
    Nguyen, Paul L.
    Assimos, Dean G.
    D'Amico, Anthony V.
    Gottschalk, Alexander R.
    Hsu, I-Chow Joe
    Lloyd, Shane
    Mclaughlin, Patrick W.
    Merrick, Gregory
    Showalter, Timothy N.
    Taira, Al V.
    Vapiwala, Neha
    Yamada, Yoshiya
    Davis, Brian J.
    ONCOLOGY-NEW YORK, 2014, 28 (12): : 1125 - +
  • [35] ACR Appropriateness Criteria® Postradical Prostatectomy Irradiation in Prostate Cancer
    Rossi, Carl J., Jr.
    Hsu, I-Chow Joe
    Abdel-Wahab, May
    Arterbery, V. Elayne
    Ciezki, Jay P.
    Frank, Steven J.
    Hahn, Noah M.
    Moran, Brian J.
    Rosenthal, Seth A.
    Merrick, Gregory
    AMERICAN JOURNAL OF CLINICAL ONCOLOGY-CANCER CLINICAL TRIALS, 2011, 34 (01): : 92 - 98
  • [36] PROSTATE CANCER Diabetes and prostate cancer-an open debate
    De Nunzio, Cosimo
    Tubaro, Andrea
    NATURE REVIEWS UROLOGY, 2013, 10 (01) : 12 - 14
  • [37] EVALUATING THE APPROPRIATENESS OF OPEN ACCESS COLONOSCOPY FOR COLORECTAL CANCER SCREENING
    Kapila, Nikhil
    Singh, Harjinder
    Kandragunta, Kiranmayee
    McMahon, Meaghan
    Castro-Pavia, Fernando
    GASTROENTEROLOGY, 2018, 154 (06) : S571 - S571
  • [38] Patients are dissatisfied with information provision: perceived information provision and quality of life in prostate cancer patients
    Lamers, Romy E. D.
    Cuypers, Maarten
    Husson, Olga
    de Vries, Marieke
    Kil, Paul J. M.
    Bosch, J. L. H. Ruud
    van de Poll-Franse, Lonneke V.
    PSYCHO-ONCOLOGY, 2016, 25 (06) : 633 - 640
  • [39] The inclusion of social determinants of health into evaluations of quality and appropriateness of AI assistant-ChatGPT
    Hswen, Yulin
    Nguyen, Thu T.
    PROSTATE CANCER AND PROSTATIC DISEASES, 2024, 27 (01) : 157 - 157
  • [40] The inclusion of social determinants of health into evaluations of quality and appropriateness of AI assistant-ChatGPT
    Yulin Hswen
    Thu T. Nguyen
    Prostate Cancer and Prostatic Diseases, 2024, 27 : 157 - 157