A Comparative Analysis of AI Models in Complex Medical Decision-Making Scenarios: Evaluating ChatGPT, Claude AI, Bard, and Perplexity

被引：2

作者：

Uppalapati, Vamsi Krishna ^{[1
]}

Nag, Deb Sanjay ^{[1
]}

机构：

[1] Tata Main Hosp, Dept Anesthesiol, Jamshedpur, India

来源：

CUREUS JOURNAL OF MEDICAL SCIENCE | 2024年 / 16卷 / 01期

关键词：

future medicine; ai efficacy; ai comparison; healthcare ai; medical decision-making;

D O I：

10.7759/cureus.52485

中图分类号：

R5 [内科学];

学科分类号：

1002 ; 100201 ;

摘要：

This study rigorously evaluates the performance of four artificial intelligence (AI) language modelsChatGPT, Claude AI, Google Bard, and Perplexity AI - across four key metrics: accuracy, relevance, clarity, and completeness. We used a strong mix of research methods, getting opinions from 14 scenarios. This helped us make sure our findings were accurate and dependable. The study showed that Claude AI performs better than others because it gives complete responses. Its average score was 3.64 for relevance and 3.43 for completeness compared to other AI tools. ChatGPT always did well, and Google Bard had unclear responses, which varied greatly, making it difficult to understand it, so there was no consistency in Google Bard. These results give important information about what AI language models are doing well or not for medical suggestions. They help us use them better, telling us how to improve future tech changes that use AI. The study shows that AI abilities match complex medical scenarios.

引用

页数：6

共 50 条

[1] Enhancing medical decision-making with ChatGPT and explainable AI
Chopra, Aryan
Rajput, Dharmendra Singh
Patel, Harshita
[J]. INTERNATIONAL JOURNAL OF SURGERY, 2024, 110 (08) : 5167 - 5168
[2] Effects of AI ChatGPT on travelers' travel decision-making
Kim, Jeong Hyun
Kim, Jungkeun
Kim, Seongseop
Hailu, Tadesse Bekele
[J]. TOURISM REVIEW, 2024, 79 (05) : 1038 - 1057
[3] AI Versus MD: Evaluating the surgical decision-making accuracy of ChatGPT-4
Palenzuela, Deanna L.
Mullen, John T.
Phitayakorn, Roy
[J]. SURGERY, 2024, 176 (02) : 241 - 245
[4] ChatGPT in Iranian medical licensing examination: evaluating the diagnostic accuracy and decision-making capabilities of an AI-based model
Ebrahimian, Manoochehr
Behnam, Behdad
Ghayebi, Negin
Sobhrakhshankhah, Elham
[J]. BMJ HEALTH & CARE INFORMATICS, 2023, 30 (01)
[5] Radiologic Decision-Making for Imaging in Pulmonary Embolism: Accuracy and Reliability of Large Language Models-Bing, Claude, ChatGPT, and Perplexity
Sarangi, Pradosh Kumar
Datta, Suvrankar
Swarup, M. Sarthak
Panda, Swaha
Nayak, Debasish Swapnesh Kumar
Malik, Archana
Datta, Ananda
Mondal, Himel
[J]. INDIAN JOURNAL OF RADIOLOGY AND IMAGING, 2024,
[6] Autonomous travel decision-making: An early glimpse into ChatGPT and generative AI
Wong, IpKin Anthony
Lian, Qi Lilith
Sun, Danni
[J]. JOURNAL OF HOSPITALITY AND TOURISM MANAGEMENT, 2023, 56 : 253 - 263
[7] Assessing the Accuracy of Information on Medication Abortion: A Comparative Analysis of ChatGPT and Google Bard AI
Mediboina, Anjali
Badam, Rajani Kumari
Chodavarapu, Sailaja
[J]. CUREUS JOURNAL OF MEDICAL SCIENCE, 2024, 16 (01)
[8] The role of explainability in AI-supported medical decision-making
Gerdes A.
[J]. Discover Artificial Intelligence, 2024, 4 (01):
[9] Hello Ai: Uncovering the onboarding needs of medical practitioners for human–AI collaborative decision-making
Cai, Carrie J.
Winter, Samantha
Steiner, David
Wilcox, Lauren
Terry, Michael
[J]. Proceedings of the ACM on Human-Computer Interaction, 2019, 3 (CSCW)
[10] Psychological Implications of AI-Enhanced Decision-making in Educational Leadership: A Comparative Analysis
Qian, Yang
[J]. PSYCHOLOGICAL REPORTS, 2024, 127 : 197 - 199

← 1 2 3 4 5 →