Quality of information and appropriateness of Open AI outputs for prostate cancer

被引：20

作者：

Lombardo, Riccardo ^{[1
]}

Gallo, Giacomo ^{[1
]}

Stira, Jordi ^{[1
]}

Turchi, Beatrice ^{[1
]}

Santoro, Giuseppe ^{[1
]}

Riolo, Sara ^{[1
]}

Romagnoli, Matteo ^{[1
]}

Cicione, Antonio ^{[1
]}

Tema, Giorgia ^{[1
]}

Pastore, Antonio ^{[1
]}

Al Salhi, Yazan ^{[1
]}

Fuschi, Andrea ^{[1
]}

Franco, Giorgio ^{[1
]}

Nacchia, Antonio ^{[1
]}

Tubaro, Andrea ^{[1
]}

De Nunzio, Cosimo ^{[1
]}

机构：

[1] Sapienza Univ Rome, Dept Urol, Rome, Italy

来源：

PROSTATE CANCER AND PROSTATIC DISEASES | 2024年 / 28卷 / 1期

关键词：

D O I：

10.1038/s41391-024-00789-0

中图分类号：

R73 [肿瘤学];

学科分类号：

100214 ;

摘要：

Chat-GPT, a natural language processing (NLP) tool created by Open-AI, can potentially be used as a quick source for obtaining information related to prostate cancer. This study aims to analyze the quality and appropriateness of Chat-GPT's responses to inquiries related to prostate cancer compared to those of the European Urology Association's (EAU) 2023 prostate cancer guidelines. Overall, 195 questions were prepared according to the recommendations gathered in the prostate cancer section of the EAU 2023 Guideline. All questions were systematically presented to Chat-GPT's August 3 Version, and two expert urologists independently assessed and assigned scores ranging from 1 to 4 to each response (1: completely correct, 2: correct but inadequate, 3: a mix of correct and misleading information, and 4: completely incorrect). Sub-analysis per chapter and per grade of recommendation were performed. Overall, 195 recommendations were evaluated. Overall, 50/195 (26%) were completely correct, 51/195 (26%) correct but inadequate, 47/195 (24%) a mix of correct and misleading and 47/195 (24%) incorrect. When looking at different chapters Open AI was particularly accurate in answering questions on follow-up and QoL. Worst performance was recorded for the diagnosis and treatment chapters with respectively 19% and 30% of the answers completely incorrect. When looking at the strength of recommendation, no differences in terms of accuracy were recorded when comparing weak and strong recommendations (p > 0,05). Chat-GPT has a poor accuracy when answering questions on the PCa EAU guidelines recommendations. Future studies should assess its performance after adequate training.

引用

页码：229 / 231

页数：3

共 50 条

[31] Quality medical data management within an open AI architecture - cancer patients case
Ivanovic, Mirjana
Autexier, Serge
Kokkonidis, Miltiadis
Rust, Johannes
CONNECTION SCIENCE, 2023, 35 (01)
[32] Appropriateness of cancer screening exams: the case of the specific prostate antigen
Giavarina, Davide
BIOCHIMICA CLINICA, 2018, 42 (04) : 281 - 282
[33] ACR appropriateness criteria: Permanent source brachytherapy for prostate cancer
Davis, Brian J.
Taira, Al V.
Nguyen, Paul L.
Assimos, Dean G.
D'Amico, Anthony V.
Gottschalk, Alexander R.
Gustafson, Gary S.
Keole, Sameer R.
Liauw, Stanley L.
Lloyd, Shane
McLaughlin, Patrick W.
Movsas, Benjamin
Prestidge, Bradley R.
Showalter, Timothy N.
Vapiwala, Neha
BRACHYTHERAPY, 2017, 16 (02) : 266 - 276
[34] ACR Appropriateness Criteria® Postradical Prostatectomy Irradiation in Prostate Cancer
Gustafson, Gary S.
Nguyen, Paul L.
Assimos, Dean G.
D'Amico, Anthony V.
Gottschalk, Alexander R.
Hsu, I-Chow Joe
Lloyd, Shane
Mclaughlin, Patrick W.
Merrick, Gregory
Showalter, Timothy N.
Taira, Al V.
Vapiwala, Neha
Yamada, Yoshiya
Davis, Brian J.
ONCOLOGY-NEW YORK, 2014, 28 (12): : 1125 - +
[35] ACR Appropriateness Criteria® Postradical Prostatectomy Irradiation in Prostate Cancer
Rossi, Carl J., Jr.
Hsu, I-Chow Joe
Abdel-Wahab, May
Arterbery, V. Elayne
Ciezki, Jay P.
Frank, Steven J.
Hahn, Noah M.
Moran, Brian J.
Rosenthal, Seth A.
Merrick, Gregory
AMERICAN JOURNAL OF CLINICAL ONCOLOGY-CANCER CLINICAL TRIALS, 2011, 34 (01): : 92 - 98
[36] PROSTATE CANCER Diabetes and prostate cancer-an open debate
De Nunzio, Cosimo
Tubaro, Andrea
NATURE REVIEWS UROLOGY, 2013, 10 (01) : 12 - 14
[37] EVALUATING THE APPROPRIATENESS OF OPEN ACCESS COLONOSCOPY FOR COLORECTAL CANCER SCREENING
Kapila, Nikhil
Singh, Harjinder
Kandragunta, Kiranmayee
McMahon, Meaghan
Castro-Pavia, Fernando
GASTROENTEROLOGY, 2018, 154 (06) : S571 - S571
[38] Patients are dissatisfied with information provision: perceived information provision and quality of life in prostate cancer patients
Lamers, Romy E. D.
Cuypers, Maarten
Husson, Olga
de Vries, Marieke
Kil, Paul J. M.
Bosch, J. L. H. Ruud
van de Poll-Franse, Lonneke V.
PSYCHO-ONCOLOGY, 2016, 25 (06) : 633 - 640
[39] The inclusion of social determinants of health into evaluations of quality and appropriateness of AI assistant-ChatGPT
Hswen, Yulin
Nguyen, Thu T.
PROSTATE CANCER AND PROSTATIC DISEASES, 2024, 27 (01) : 157 - 157
[40] The inclusion of social determinants of health into evaluations of quality and appropriateness of AI assistant-ChatGPT
Yulin Hswen
Thu T. Nguyen
Prostate Cancer and Prostatic Diseases, 2024, 27 : 157 - 157

← 1 2 3 4 5 →