The Consistency and Quality of ChatGPT Responses Compared to Clinical Guidelines for Ovarian Cancer: A Delphi Approach

被引：3

作者：

Piazza, Dario ^{[1
]}

Martorana, Federica ^{[2
]}

Curaba, Annabella ^{[1
]}

Sambataro, Daniela ^{[3
]}

Valerio, Maria Rosaria ^{[4
]}

Firenze, Alberto ^{[5
]}

Pecorino, Basilio ^{[6
,7
]}

Scollo, Paolo ^{[6
,7
]}

Chiantera, Vito ^{[8
]}

Scibilia, Giuseppe ^{[9
]}

Vigneri, Paolo ^{[10
,11
]}

Gebbia, Vittorio ^{[1
,12
]}

Scandurra, Giuseppa ^{[13
]}

机构：

[1] Casa Cura Torina, Med Oncol Unit, I-90145 Palermo, Italy

[2] Univ Catania, Dept Clin & Expt Med, I-95124 Catania, Italy

[3] Osped Umberto I, Med Oncol Unit, I-94100 Enna, Italy

[4] Univ Palermo, Med Oncol Unit, Policlin P Giaccone, I-90133 Palermo, Italy

[5] Univ Palermo, Dept Hlth Promot Mother & Child Care, Occupat Hlth Sect, Internal Med & Med Specialties, I-90133 Palermo, Italy

[6] Osped Cannizzaro, Gynecol Unit, I-95126 Catania, Italy

[7] Univ Enna Kore, Fac Med & Surg, Gynecol, I-94100 Enna, Italy

[8] Univ Palermo, Gynecol, I-90133 Palermo, Italy

[9] Osped Paterno Arezzo, Gynecol Unit, I-97100 Ragusa, Italy

[10] Univ Catania, Med Oncol, I-95124 Catania, Italy

[11] Ist Clin Humanitas, Med Oncol, I-95045 Catania, Italy

[12] Univ Enna Kore, Fac Med & Surg, Med Oncol, I-94100 Enna, Italy

[13] Osped Cannizzaro, Med Oncol Unit, I-95126 Catania, Italy

来源：

CURRENT ONCOLOGY | 2024年 / 31卷 / 05期

关键词：

artificial intelligence; ChatGPT; ovarian carcinoma; guidelines; RECOMMENDATIONS; CONSENSUS;

D O I：

10.3390/curroncol31050212

中图分类号：

R73 [肿瘤学];

学科分类号：

100214 ;

摘要：

Introduction: In recent years, generative Artificial Intelligence models, such as ChatGPT, have increasingly been utilized in healthcare. Despite acknowledging the high potential of AI models in terms of quick access to sources and formulating responses to a clinical question, the results obtained using these models still require validation through comparison with established clinical guidelines. This study compares the responses of the AI model to eight clinical questions with the Italian Association of Medical Oncology (AIOM) guidelines for ovarian cancer. Materials and Methods: The authors used the Delphi method to evaluate responses from ChatGPT and the AIOM guidelines. An expert panel of healthcare professionals assessed responses based on clarity, consistency, comprehensiveness, usability, and quality using a five-point Likert scale. The GRADE methodology assessed the evidence quality and the recommendations' strength. Results: A survey involving 14 physicians revealed that the AIOM guidelines consistently scored higher averages compared to the AI models, with a statistically significant difference. Post hoc tests showed that AIOM guidelines significantly differed from all AI models, with no significant difference among the AI models. Conclusions: While AI models can provide rapid responses, they must match established clinical guidelines regarding clarity, consistency, comprehensiveness, usability, and quality. These findings underscore the importance of relying on expert-developed guidelines in clinical decision-making and highlight potential areas for AI model improvement.

引用

页码：2796 / 2804

页数：9

共 50 条

[21] Clinical practice guidelines for ovarian cancer: an update to the Korean Society of Gynecologic Oncology guidelines
Lee, Banghyun
Chang, Suk-Joon
Kwon, Byung Su
Son, Joo-Hyuk
Lim, Myong Cheol
Kim, Yun Hwan
Lee, Shin-Wha
Choi, Chel Hun
Eoh, Kyung Jin
Lee, Jung-Yun
Lee, Yoo-Young
Suh, Dong Hoon
Kim, Yong Beom
JOURNAL OF GYNECOLOGIC ONCOLOGY, 2025, 36 (01)
[22] Hereditary cancer: guidelines in clinical practice. Breast and ovarian cancer genetics
Eccles, DM
ANNALS OF ONCOLOGY, 2004, 15 : 133 - 138
[23] Quality and consistency of clinical practice guidelines for diagnosis and management of osteoarthritis of the hip and knee: a descriptive overview of published guidelines
Misso, Marie L.
Pitt, Veronica J.
Jones, Kay M.
Barnes, Hayley N.
Piterman, Leon
Green, Sally E.
MEDICAL JOURNAL OF AUSTRALIA, 2008, 189 (07) : 394 - 399
[24] ChatGPT Responses to Clinical Questions in the Japan Atherosclerosis Society Guidelines for Prevention of Atherosclerotic Cardiovascular Disease 2022
Hisamatsu, Takashi
Fukuda, Mari
Kinuta, Minako
Kanda, Hideyuki
JOURNAL OF ATHEROSCLEROSIS AND THROMBOSIS, 2024,
[25] Quality Assessment of Cancer Pain Clinical Practice Guidelines
Zhang, Zhigang
Cao, Xiao
Wang, Qi
Yang, Qiuyu
Sun, Mingyao
Ge, Long
Tian, Jinhui
FRONTIERS IN ONCOLOGY, 2022, 12
[26] Quality assessment of cancer cachexia clinical practice guidelines
Shen, Wang-Qin
Yao, Liang
Wang, Xiao-qin
Hu, Yan
Bian, Zhao-Xiang
CANCER TREATMENT REVIEWS, 2018, 70 : 9 - 15
[27] Breast cancer care compared with clinical Guidelines: an observational study in France
Lebeau, Marie
Mathoulin-Pelissier, Simone
Bellera, Carine
Tunon-de-Lara, Christine
Daban, Alain
Lipinski, Francis
Jaubert, Dominique
Ingrand, Pierre
Migeot, Virginie
BMC PUBLIC HEALTH, 2011, 11
[28] Breast cancer care compared with clinical Guidelines: an observational study in France
Marie Lebeau
Simone Mathoulin-Pélissier
Carine Bellera
Christine Tunon-de-Lara
Alain Daban
Francis Lipinski
Dominique Jaubert
Pierre Ingrand
Virginie Migeot
BMC Public Health, 11
[29] Correspondence for "Quality of ChatGPT Responses to Questions Related to Pancreatic Cancer and Its Surgical Care"
Mungmunpuntipantip, Rujittika
Wiwanitkit, Viroj
ANNALS OF SURGICAL ONCOLOGY, 2023, 30 (12) : 7780 - 7780
[30] Correspondence for “Quality of ChatGPT Responses to Questions Related to Pancreatic Cancer and Its Surgical Care”
Rujittika Mungmunpuntipantip
Viroj Wiwanitkit
Annals of Surgical Oncology, 2023, 30 : 7780 - 7780

← 1 2 3 4 5 →