The Consistency and Quality of ChatGPT Responses Compared to Clinical Guidelines for Ovarian Cancer: A Delphi Approach

被引:3
|
作者
Piazza, Dario [1 ]
Martorana, Federica [2 ]
Curaba, Annabella [1 ]
Sambataro, Daniela [3 ]
Valerio, Maria Rosaria [4 ]
Firenze, Alberto [5 ]
Pecorino, Basilio [6 ,7 ]
Scollo, Paolo [6 ,7 ]
Chiantera, Vito [8 ]
Scibilia, Giuseppe [9 ]
Vigneri, Paolo [10 ,11 ]
Gebbia, Vittorio [1 ,12 ]
Scandurra, Giuseppa [13 ]
机构
[1] Casa Cura Torina, Med Oncol Unit, I-90145 Palermo, Italy
[2] Univ Catania, Dept Clin & Expt Med, I-95124 Catania, Italy
[3] Osped Umberto I, Med Oncol Unit, I-94100 Enna, Italy
[4] Univ Palermo, Med Oncol Unit, Policlin P Giaccone, I-90133 Palermo, Italy
[5] Univ Palermo, Dept Hlth Promot Mother & Child Care, Occupat Hlth Sect, Internal Med & Med Specialties, I-90133 Palermo, Italy
[6] Osped Cannizzaro, Gynecol Unit, I-95126 Catania, Italy
[7] Univ Enna Kore, Fac Med & Surg, Gynecol, I-94100 Enna, Italy
[8] Univ Palermo, Gynecol, I-90133 Palermo, Italy
[9] Osped Paterno Arezzo, Gynecol Unit, I-97100 Ragusa, Italy
[10] Univ Catania, Med Oncol, I-95124 Catania, Italy
[11] Ist Clin Humanitas, Med Oncol, I-95045 Catania, Italy
[12] Univ Enna Kore, Fac Med & Surg, Med Oncol, I-94100 Enna, Italy
[13] Osped Cannizzaro, Med Oncol Unit, I-95126 Catania, Italy
关键词
artificial intelligence; ChatGPT; ovarian carcinoma; guidelines; RECOMMENDATIONS; CONSENSUS;
D O I
10.3390/curroncol31050212
中图分类号
R73 [肿瘤学];
学科分类号
100214 ;
摘要
Introduction: In recent years, generative Artificial Intelligence models, such as ChatGPT, have increasingly been utilized in healthcare. Despite acknowledging the high potential of AI models in terms of quick access to sources and formulating responses to a clinical question, the results obtained using these models still require validation through comparison with established clinical guidelines. This study compares the responses of the AI model to eight clinical questions with the Italian Association of Medical Oncology (AIOM) guidelines for ovarian cancer. Materials and Methods: The authors used the Delphi method to evaluate responses from ChatGPT and the AIOM guidelines. An expert panel of healthcare professionals assessed responses based on clarity, consistency, comprehensiveness, usability, and quality using a five-point Likert scale. The GRADE methodology assessed the evidence quality and the recommendations' strength. Results: A survey involving 14 physicians revealed that the AIOM guidelines consistently scored higher averages compared to the AI models, with a statistically significant difference. Post hoc tests showed that AIOM guidelines significantly differed from all AI models, with no significant difference among the AI models. Conclusions: While AI models can provide rapid responses, they must match established clinical guidelines regarding clarity, consistency, comprehensiveness, usability, and quality. These findings underscore the importance of relying on expert-developed guidelines in clinical decision-making and highlight potential areas for AI model improvement.
引用
收藏
页码:2796 / 2804
页数:9
相关论文
共 50 条
  • [21] Clinical practice guidelines for ovarian cancer: an update to the Korean Society of Gynecologic Oncology guidelines
    Lee, Banghyun
    Chang, Suk-Joon
    Kwon, Byung Su
    Son, Joo-Hyuk
    Lim, Myong Cheol
    Kim, Yun Hwan
    Lee, Shin-Wha
    Choi, Chel Hun
    Eoh, Kyung Jin
    Lee, Jung-Yun
    Lee, Yoo-Young
    Suh, Dong Hoon
    Kim, Yong Beom
    JOURNAL OF GYNECOLOGIC ONCOLOGY, 2025, 36 (01)
  • [22] Hereditary cancer: guidelines in clinical practice. Breast and ovarian cancer genetics
    Eccles, DM
    ANNALS OF ONCOLOGY, 2004, 15 : 133 - 138
  • [23] Quality and consistency of clinical practice guidelines for diagnosis and management of osteoarthritis of the hip and knee: a descriptive overview of published guidelines
    Misso, Marie L.
    Pitt, Veronica J.
    Jones, Kay M.
    Barnes, Hayley N.
    Piterman, Leon
    Green, Sally E.
    MEDICAL JOURNAL OF AUSTRALIA, 2008, 189 (07) : 394 - 399
  • [24] ChatGPT Responses to Clinical Questions in the Japan Atherosclerosis Society Guidelines for Prevention of Atherosclerotic Cardiovascular Disease 2022
    Hisamatsu, Takashi
    Fukuda, Mari
    Kinuta, Minako
    Kanda, Hideyuki
    JOURNAL OF ATHEROSCLEROSIS AND THROMBOSIS, 2024,
  • [25] Quality Assessment of Cancer Pain Clinical Practice Guidelines
    Zhang, Zhigang
    Cao, Xiao
    Wang, Qi
    Yang, Qiuyu
    Sun, Mingyao
    Ge, Long
    Tian, Jinhui
    FRONTIERS IN ONCOLOGY, 2022, 12
  • [26] Quality assessment of cancer cachexia clinical practice guidelines
    Shen, Wang-Qin
    Yao, Liang
    Wang, Xiao-qin
    Hu, Yan
    Bian, Zhao-Xiang
    CANCER TREATMENT REVIEWS, 2018, 70 : 9 - 15
  • [27] Breast cancer care compared with clinical Guidelines: an observational study in France
    Lebeau, Marie
    Mathoulin-Pelissier, Simone
    Bellera, Carine
    Tunon-de-Lara, Christine
    Daban, Alain
    Lipinski, Francis
    Jaubert, Dominique
    Ingrand, Pierre
    Migeot, Virginie
    BMC PUBLIC HEALTH, 2011, 11
  • [28] Breast cancer care compared with clinical Guidelines: an observational study in France
    Marie Lebeau
    Simone Mathoulin-Pélissier
    Carine Bellera
    Christine Tunon-de-Lara
    Alain Daban
    Francis Lipinski
    Dominique Jaubert
    Pierre Ingrand
    Virginie Migeot
    BMC Public Health, 11
  • [29] Correspondence for "Quality of ChatGPT Responses to Questions Related to Pancreatic Cancer and Its Surgical Care"
    Mungmunpuntipantip, Rujittika
    Wiwanitkit, Viroj
    ANNALS OF SURGICAL ONCOLOGY, 2023, 30 (12) : 7780 - 7780
  • [30] Correspondence for “Quality of ChatGPT Responses to Questions Related to Pancreatic Cancer and Its Surgical Care”
    Rujittika Mungmunpuntipantip
    Viroj Wiwanitkit
    Annals of Surgical Oncology, 2023, 30 : 7780 - 7780