Performance of large language model artificial intelligence on dermatology board exam questions

被引：3

作者：

Park, Lily ^{[1
,2
]}

Ehlert, Brittany ^{[3
]}

Susla, Lyudmyla ^{[4
]}

Lum, Zachary C. ^{[2
]}

Lee, Patrick K. ^{[5
]}

机构：

[1] Larkin Community Hosp, Dept Dermatol, South Miami, FL 33143 USA

[2] Nova Southeastern Univ, Sch Osteopath Med, Davie, FL 33328 USA

[3] Ohio Univ, Heritage Coll Osteopath Med, Cleveland, OH USA

[4] New York Inst Technol, Coll Osteopath Med, Old Westbury, NY USA

[5] Univ Calif Irvine, Dept Dermatol, Irvine, CA USA

来源：

CLINICAL AND EXPERIMENTAL DERMATOLOGY | 2023年 / 49卷 / 07期

关键词：

D O I：

10.1093/ced/llad355

中图分类号：

R75 [皮肤病学与性病学];

学科分类号：

100206 ;

摘要：

Our study attempted to assess the performance of two large language models: Open AI's ChatGPT and Google's Bard on dermatology board exam-style questions. Based on our study, Google Bard outperformed ChatGPT and achieved the highest scores in general dermatology among dermatology disciplines.

引用

页码：733 / 734

页数：2

共 50 条

[31] Authors' Reply: Assessing the utility of ChatGPT as an artificial intelligence-based large language model for information to answer questions on myopia
Biswas, Sayantan
Logan, Nicola S.
Davies, Leon N.
Sheppard, Amy L.
Wolffsohn, James S.
OPHTHALMIC AND PHYSIOLOGICAL OPTICS, 2024, 44 (01) : 233 - 234
[32] Would Uro_Chat, a Newly Developed Generative Artificial Intelligence Large Language Model, Have Successfully Passed the In-Service Assessment Questions of the European Board of Urology in 2022?
May, Matthias
Koerner-Riffard, Katharina
Marszalek, Martin
Eredics, Klaus
EUROPEAN UROLOGY ONCOLOGY, 2024, 7 (01): : 155 - 156
[33] Artificial intelligence, large language models, and you
Marquardt, Charles
JOURNAL OF VASCULAR SURGERY CASES INNOVATIONS AND TECHNIQUES, 2023, 9 (04):
[34] Performance of Large Language Models in Rheumatology Board-Like Questions: Accuracy, Quality, and Safety
Gouyonnet, Jaime Flores
Gonzalez-Trevino, Mariana
Crowson, Cynthia
Lennon, Ryan
Sanchez-Rodriguez, Alain
Figueroa-Parra, Gabriel
Joerns, Elena
Kimbrough, Bradly
Cuellar-Gutierrez, Maria
Navarro-Mendoza, Erika
Duarte-Garcia, Ali
ARTHRITIS & RHEUMATOLOGY, 2024, 76 : 3552 - 3554
[35] Performance of Publicly Available Large Language Models on Internal Medicine Board-style Questions
Tarabanis, Constantine
Zahid, Sohail
Mamalis, Marios
Zhang, Kevin
Kalampokis, Evangelos
Jankelson, Lior
PLOS DIGITAL HEALTH, 2024, 3 (09):
[36] Comparing the Performance of Popular Large Language Models on the National Board of Medical Examiners Sample Questions
Abbas, Ali
Rehman, Mahad S.
Rehman, Syed S.
CUREUS JOURNAL OF MEDICAL SCIENCE, 2024, 16 (03)
[37] How reliable is the artificial intelligence product large language model ChatGPT in orthodontics?
Demirsoy, Kevser Kurt
Buyuk, Suleyman Kutalmis
Bicer, Tayyip
ANGLE ORTHODONTIST, 2024, 94 (06) : 602 - 607
[38] Designing an artificial intelligence-enabled large language model for financial decisions
Saxena, Anshul
Rishi, Bikramjit
MANAGEMENT DECISION, 2025,
[39] Performance of artificial intelligence in answering cardiovascular textual questions
Skalidis, Ioannis
Cagnina, Aurelien
Fournier, Stephane
EUROPEAN HEART JOURNAL - DIGITAL HEALTH, 2023, 4 (05): : 364 - 365
[40] ChatGPT: performance of artificial intelligence in the dermatology specialty certificate examination
Jabour, Thais Barros Felippe
Ribeiro Junior, Jose Paulo
Fernandes, Alexandre Chaves
Honorato, Cecilia Mirelle Almeida
Queiroz, Maria do Carmo Araujo Palmeira
ANAIS BRASILEIROS DE DERMATOLOGIA, 2024, 99 (02) : 277 - 279

← 1 2 3 4 5 →