Performance of large language models in oral and maxillofacial surgery examinations

被引:2
|
作者
Quah, B. [1 ,2 ]
Yong, C. W. [1 ,2 ]
Lai, C. W. M. [1 ]
Islam, I. [1 ,2 ]
机构
[1] Natl Univ Singapore, Fac Dent, 9 Lower Kent Ridge Rd, Singapore 119085, Singapore
[2] Natl Univ Ctr Oral Hlth, Discipline Oral & Maxillofacial Surg, Singapore, Singapore
关键词
Artificial intelligence; Oral surgery; Dental education; Academic performance; Dentistry;
D O I
10.1016/j.ijom.2024.06.003
中图分类号
R78 [口腔科学];
学科分类号
1003 ;
摘要
This study aimed to determine the accuracy of large language models (LLMs) in answering oral and maxillofacial surgery (OMS) multiple choice questions. A total of 259 questions from the university's question bank were answered by the LLMs (GPT-3.5, GPT-4, Llama 2, Gemini, and Copilot). The scores per category as well as the total score out of 259 were recorded and evaluated, with the passing score set at 50%. The mean overall score amongst all LLMs was 62.5%. GPT-4 performed the best (76.8%, 95% confidence interval (CI) 71.4-82.2%), followed by Copilot (72.6%, 95% CI 67.2-78.0%), GPT-3.5 (62.2%, 95% CI 56.4-68.0%), Gemini (58.7%, 95% CI 52.9-64.5%), and Llama 2 (42.5%, 95% CI 37.1-48.6%). There was a statistically significant difference between the scores of the five LLMs overall (chi(2) = 79.9, df = 4, P < 0.001) and within all categories except 'basic sciences' (P = 0.129), 'dentoalveolar and implant surgery' (P = 0.052), and 'oral medicine/pathology/radiology' (P = 0.801). The LLMs performed best in 'basic sciences' (68.9%) and poorest in 'pharmacology' (45.9%). The LLMs can be used as adjuncts in teaching, but should not be used for clinical decision-making until the models are further developed and validated.
引用
收藏
页码:881 / 886
页数:6
相关论文
共 50 条
  • [41] Authorship in Oral and Maxillofacial Surgery
    Gadjradj, Pravesh S.
    Jalimsing, Mamta
    Jalimsing, Sandhia
    Voigt, Istifari
    JOURNAL OF MAXILLOFACIAL & ORAL SURGERY, 2021, 20 (02): : 330 - 335
  • [42] THE FUTURE OF ORAL AND MAXILLOFACIAL SURGERY
    GURALNICK, W
    BRITISH DENTAL JOURNAL, 1984, 157 (02) : 51 - 54
  • [43] INTERNATIONALISM IN ORAL AND MAXILLOFACIAL SURGERY
    BANKS, P
    BRITISH JOURNAL OF ORAL & MAXILLOFACIAL SURGERY, 1995, 33 (05): : 273 - 275
  • [44] Globalization of oral and maxillofacial surgery
    Stoelinga, PJ
    Haers, PE
    INTERNATIONAL JOURNAL OF ORAL AND MAXILLOFACIAL SURGERY, 1999, 28 (01) : 1 - 1
  • [45] ARTERIOGRAPHY IN ORAL AND MAXILLOFACIAL SURGERY
    TATELMAN, M
    JOURNAL OF ORAL SURGERY, 1976, 34 (09): : 795 - 804
  • [46] THE FUTURE OF ORAL AND MAXILLOFACIAL SURGERY
    MCCALLUM, CA
    JOURNAL OF ORAL AND MAXILLOFACIAL SURGERY, 1989, 47 (06) : 610 - 615
  • [47] Piezosurgery in oral and maxillofacial surgery
    Pavlikova, G.
    Foltan, R.
    Horka, M.
    Hanzelka, T.
    Borunska, H.
    Sedy, J.
    INTERNATIONAL JOURNAL OF ORAL AND MAXILLOFACIAL SURGERY, 2011, 40 (05) : 451 - 457
  • [48] Atlas of Oral & Maxillofacial Surgery
    Emma Brown
    British Dental Journal, 2015, 219 (10) : 474 - 474
  • [49] Academic oral and maxillofacial surgery
    Precious, David S.
    ORAL SURGERY ORAL MEDICINE ORAL PATHOLOGY ORAL RADIOLOGY, 2014, 118 (04): : 385 - 386
  • [50] DIVERSITY IN ORAL AND MAXILLOFACIAL SURGERY
    Lawrence, Martha M.
    JOURNAL OF ORAL AND MAXILLOFACIAL SURGERY, 2011, 69 (02) : 573 - 573