Assessing online chat-based artificial intelligence models for weight loss recommendation appropriateness and bias in the presence of guideline incongruence

被引:0
|
作者
Annor, Eugene [1 ]
Atarere, Joseph [2 ]
Ubah, Nneoma [3 ]
Jolaoye, Oladoyin [1 ]
Kunkle, Bryce [4 ]
Egbo, Olachi [5 ]
Martin, Daniel K. [6 ]
机构
[1] Univ Illinois, Dept Internal Med, Peoria, IL 60607 USA
[2] MedStar Hlth, Dept Med, Baltimore, MD USA
[3] Montefiore St Lukes Cornwall Hosp, Dept Internal Med, Newburgh, NY USA
[4] Georgetown Univ Hosp, Dept Med, Washington, DC USA
[5] Aurora Med Ctr, Dept Med, Oshkosh, WI USA
[6] Univ Illinois, Dept Gastroenterol & Hepatol, Coll Med, Peoria, IL USA
关键词
OVERWEIGHT; OBESITY;
D O I
10.1038/s41366-025-01717-5
中图分类号
R5 [内科学];
学科分类号
1002 ; 100201 ;
摘要
Background and aimManaging obesity requires a comprehensive approach that involves therapeutic lifestyle changes, medications, or metabolic surgery. Many patients seek health information from online sources and artificial intelligence models like ChatGPT, Google Gemini, and Microsoft Copilot before consulting health professionals. This study aims to evaluate the appropriateness of the responses of Google Gemini and Microsoft Copilot to questions on pharmacologic and surgical management of obesity and assess for bias in their responses to either the ADA or AACE guidelines.MethodsTen questions were compiled into a set and posed separately to the free editions of Google Gemini and Microsoft Copilot. Recommendations for the questions were extracted from the ADA and the AACE websites, and the responses were graded by reviewers for appropriateness, completeness, and bias to any of the guidelines.ResultsAll responses from Microsoft Copilot and 8/10 (80%) responses from Google Gemini were appropriate. There were no inappropriate responses. Google Gemini refused to respond to two questions and insisted on consulting a physician. Microsoft Copilot (10/10; 100%) provided a higher proportion of complete responses than Google Gemini (5/10; 50%). Of the eight responses from Google Gemini, none were biased towards any of the guidelines, while two of the responses from Microsoft Copilot were biased.ConclusionThe study highlights the role of Microsoft Copilot and Google Gemini in weight loss management. The differences in their responses may be attributed to the variation in the quality and scope of their training data and design.
引用
收藏
页数:6
相关论文
共 20 条
  • [1] Assessing Online Chat-Based Artificial Intelligence Models for Weight Loss Recommendation Appropriateness and Bias in the Presence of Guideline Incongruence
    Annor, Eugene
    Atarere, Joseph O.
    Ubah, Nneoma R.
    Jolaoye, Oladoyin
    Kunkle, Bryce
    Hasan, Fariha
    Egbo, Olachi
    Martin, Daniel K.
    AMERICAN JOURNAL OF GASTROENTEROLOGY, 2024, 119 (10S): : S1496 - S1496
  • [2] Applicability of Online Chat-Based Artificial Intelligence Models to Colorectal Cancer Screening
    Atarere, Joseph
    Naqvi, Haider
    Haas, Christopher
    Adewunmi, Comfort
    Bandaru, Sumanth
    Allamneni, Rakesh
    Ugonabo, Onyinye
    Egbo, Olachi
    Umoren, Mfoniso
    Kanth, Priyanka
    DIGESTIVE DISEASES AND SCIENCES, 2024, 69 (03) : 791 - 797
  • [3] Applicability of Online Chat-Based Artificial Intelligence Models to Colorectal Cancer Screening
    Joseph Atarere
    Haider Naqvi
    Christopher Haas
    Comfort Adewunmi
    Sumanth Bandaru
    Rakesh Allamneni
    Onyinye Ugonabo
    Olachi Egbo
    Mfoniso Umoren
    Priyanka Kanth
    Digestive Diseases and Sciences, 2024, 69 : 791 - 797
  • [4] Appropriateness of Online Chat-Based Artificial Intelligence (ChatGPT) Answers to Common Questions on Inguinal Hernia Repair
    Lima, Diego Laurentino
    Nogueira, Raquel
    Chin, Ryan
    Claus, Christiano
    Malcher, Flavio
    Sreeramoju, Prashanth
    Cavazzola, Leandro Totti
    JOURNAL OF LAPAROENDOSCOPIC & ADVANCED SURGICAL TECHNIQUES, 2024, 34 (02): : 141 - 143
  • [5] Appropriateness of Cardiovascular Disease Prevention Recommendations Obtained From a Popular Online Chat-Based Artificial Intelligence Model
    Sarraju, Ashish
    Bruemmer, Dennis
    Van Iterson, Erik
    Cho, Leslie
    Rodriguez, Fatima
    Laffin, Luke
    JAMA-JOURNAL OF THE AMERICAN MEDICAL ASSOCIATION, 2023, 329 (10): : 842 - 844
  • [6] Evaluating the appropriateness of skin cancer prevention recommendations obtained from an online chat-based artificial intelligence model
    Cai, Elise M.
    Pathmarajah, Pirunthan
    Daneshjou, Roxana
    Ko, Justin M.
    Chiou, Albert S.
    JAAD INTERNATIONAL, 2025, 18 : 145 - 147
  • [7] Association Between Price and Appropriateness of Cancer-Related Medical Information Generated by Chat-Based Artificial Intelligence Models
    Patel, Vishal R.
    Tsai, Alexander K.
    Blaes, Anne
    Parikh, Ravi B.
    Adamson, Adewole S.
    Gupta, Arjun
    JCO CLINICAL CANCER INFORMATICS, 2023, 7 : e2300041
  • [8] Association Between Price and Appropriateness of Cancer-Related Medical Information Generated by Chat-Based Artificial Intelligence Models
    Patel, Vishal R.
    Tsai, Alexander K.
    Blaes, Anne
    Parikh, Ravi B.
    Adamson, Adewole S.
    Gupta, Arjun
    JCO CLINICAL CANCER INFORMATICS, 2023, 7
  • [9] Racial Disparities in Knowledge of Cardiovascular Disease by Chat-Based Artificial Intelligence Models
    Eromosele, Oseiwe
    Ughagwu, Kelechukwu
    Johnson, Ayodeji
    Das, Herlyne
    Sobodu, Temitope
    Ouyang, David
    CIRCULATION, 2024, 150
  • [10] Evaluation of online chat-based artificial intelligence responses about inflammatory bowel disease and diet
    Naqvi, Haider A.
    Delungahawatta, Thilini
    Atarere, Joseph O.
    Bandaru, Sumanth Kumar
    Barrow, Jasmine B.
    Mattar, Mark C.
    EUROPEAN JOURNAL OF GASTROENTEROLOGY & HEPATOLOGY, 2024, 36 (09) : 1109 - 1112