Performance of large language model artificial intelligence on dermatology board exam questions

被引:3
|
作者
Park, Lily [1 ,2 ]
Ehlert, Brittany [3 ]
Susla, Lyudmyla [4 ]
Lum, Zachary C. [2 ]
Lee, Patrick K. [5 ]
机构
[1] Larkin Community Hosp, Dept Dermatol, South Miami, FL 33143 USA
[2] Nova Southeastern Univ, Sch Osteopath Med, Davie, FL 33328 USA
[3] Ohio Univ, Heritage Coll Osteopath Med, Cleveland, OH USA
[4] New York Inst Technol, Coll Osteopath Med, Old Westbury, NY USA
[5] Univ Calif Irvine, Dept Dermatol, Irvine, CA USA
关键词
D O I
10.1093/ced/llad355
中图分类号
R75 [皮肤病学与性病学];
学科分类号
100206 ;
摘要
Our study attempted to assess the performance of two large language models: Open AI's ChatGPT and Google's Bard on dermatology board exam-style questions. Based on our study, Google Bard outperformed ChatGPT and achieved the highest scores in general dermatology among dermatology disciplines.
引用
收藏
页码:733 / 734
页数:2
相关论文
共 50 条
  • [31] Authors' Reply: Assessing the utility of ChatGPT as an artificial intelligence-based large language model for information to answer questions on myopia
    Biswas, Sayantan
    Logan, Nicola S.
    Davies, Leon N.
    Sheppard, Amy L.
    Wolffsohn, James S.
    OPHTHALMIC AND PHYSIOLOGICAL OPTICS, 2024, 44 (01) : 233 - 234
  • [32] Would Uro_Chat, a Newly Developed Generative Artificial Intelligence Large Language Model, Have Successfully Passed the In-Service Assessment Questions of the European Board of Urology in 2022?
    May, Matthias
    Koerner-Riffard, Katharina
    Marszalek, Martin
    Eredics, Klaus
    EUROPEAN UROLOGY ONCOLOGY, 2024, 7 (01): : 155 - 156
  • [33] Artificial intelligence, large language models, and you
    Marquardt, Charles
    JOURNAL OF VASCULAR SURGERY CASES INNOVATIONS AND TECHNIQUES, 2023, 9 (04):
  • [34] Performance of Large Language Models in Rheumatology Board-Like Questions: Accuracy, Quality, and Safety
    Gouyonnet, Jaime Flores
    Gonzalez-Trevino, Mariana
    Crowson, Cynthia
    Lennon, Ryan
    Sanchez-Rodriguez, Alain
    Figueroa-Parra, Gabriel
    Joerns, Elena
    Kimbrough, Bradly
    Cuellar-Gutierrez, Maria
    Navarro-Mendoza, Erika
    Duarte-Garcia, Ali
    ARTHRITIS & RHEUMATOLOGY, 2024, 76 : 3552 - 3554
  • [35] Performance of Publicly Available Large Language Models on Internal Medicine Board-style Questions
    Tarabanis, Constantine
    Zahid, Sohail
    Mamalis, Marios
    Zhang, Kevin
    Kalampokis, Evangelos
    Jankelson, Lior
    PLOS DIGITAL HEALTH, 2024, 3 (09):
  • [36] Comparing the Performance of Popular Large Language Models on the National Board of Medical Examiners Sample Questions
    Abbas, Ali
    Rehman, Mahad S.
    Rehman, Syed S.
    CUREUS JOURNAL OF MEDICAL SCIENCE, 2024, 16 (03)
  • [37] How reliable is the artificial intelligence product large language model ChatGPT in orthodontics?
    Demirsoy, Kevser Kurt
    Buyuk, Suleyman Kutalmis
    Bicer, Tayyip
    ANGLE ORTHODONTIST, 2024, 94 (06) : 602 - 607
  • [38] Designing an artificial intelligence-enabled large language model for financial decisions
    Saxena, Anshul
    Rishi, Bikramjit
    MANAGEMENT DECISION, 2025,
  • [39] Performance of artificial intelligence in answering cardiovascular textual questions
    Skalidis, Ioannis
    Cagnina, Aurelien
    Fournier, Stephane
    EUROPEAN HEART JOURNAL - DIGITAL HEALTH, 2023, 4 (05): : 364 - 365
  • [40] ChatGPT: performance of artificial intelligence in the dermatology specialty certificate examination
    Jabour, Thais Barros Felippe
    Ribeiro Junior, Jose Paulo
    Fernandes, Alexandre Chaves
    Honorato, Cecilia Mirelle Almeida
    Queiroz, Maria do Carmo Araujo Palmeira
    ANAIS BRASILEIROS DE DERMATOLOGIA, 2024, 99 (02) : 277 - 279