Performance of large language models in oral and maxillofacial surgery examinations

被引:2
|
作者
Quah, B. [1 ,2 ]
Yong, C. W. [1 ,2 ]
Lai, C. W. M. [1 ]
Islam, I. [1 ,2 ]
机构
[1] Natl Univ Singapore, Fac Dent, 9 Lower Kent Ridge Rd, Singapore 119085, Singapore
[2] Natl Univ Ctr Oral Hlth, Discipline Oral & Maxillofacial Surg, Singapore, Singapore
关键词
Artificial intelligence; Oral surgery; Dental education; Academic performance; Dentistry;
D O I
10.1016/j.ijom.2024.06.003
中图分类号
R78 [口腔科学];
学科分类号
1003 ;
摘要
This study aimed to determine the accuracy of large language models (LLMs) in answering oral and maxillofacial surgery (OMS) multiple choice questions. A total of 259 questions from the university's question bank were answered by the LLMs (GPT-3.5, GPT-4, Llama 2, Gemini, and Copilot). The scores per category as well as the total score out of 259 were recorded and evaluated, with the passing score set at 50%. The mean overall score amongst all LLMs was 62.5%. GPT-4 performed the best (76.8%, 95% confidence interval (CI) 71.4-82.2%), followed by Copilot (72.6%, 95% CI 67.2-78.0%), GPT-3.5 (62.2%, 95% CI 56.4-68.0%), Gemini (58.7%, 95% CI 52.9-64.5%), and Llama 2 (42.5%, 95% CI 37.1-48.6%). There was a statistically significant difference between the scores of the five LLMs overall (chi(2) = 79.9, df = 4, P < 0.001) and within all categories except 'basic sciences' (P = 0.129), 'dentoalveolar and implant surgery' (P = 0.052), and 'oral medicine/pathology/radiology' (P = 0.801). The LLMs performed best in 'basic sciences' (68.9%) and poorest in 'pharmacology' (45.9%). The LLMs can be used as adjuncts in teaching, but should not be used for clinical decision-making until the models are further developed and validated.
引用
收藏
页码:881 / 886
页数:6
相关论文
共 50 条
  • [31] Authorship in Oral and Maxillofacial Surgery
    Pravesh S. Gadjradj
    Mamta Jalimsing
    Sandhia Jalimsing
    Istifari Voigt
    Journal of Maxillofacial and Oral Surgery, 2021, 20 : 330 - 335
  • [32] Dentistry and oral and maxillofacial surgery
    Peterson, LJ
    ORAL SURGERY ORAL MEDICINE ORAL PATHOLOGY ORAL RADIOLOGY AND ENDODONTICS, 1998, 86 (01): : 1 - 1
  • [33] ORAL AND MAXILLOFACIAL SURGERY - AN UPDATE
    GURALNICK, W
    CHUONG, R
    BRITISH DENTAL JOURNAL, 1984, 156 (08) : 281 - 285
  • [34] Robotics in oral and maxillofacial surgery
    Borumandi, F.
    Cascarini, L.
    ANNALS OF THE ROYAL COLLEGE OF SURGEONS OF ENGLAND, 2018, 100 (06) : 19 - 22
  • [35] Anesthesiology and oral and maxillofacial surgery
    Peterson, LJ
    ORAL SURGERY ORAL MEDICINE ORAL PATHOLOGY ORAL RADIOLOGY AND ENDODONTICS, 2001, 91 (02): : 131 - 132
  • [36] Volunteerism in Oral and Maxillofacial Surgery
    Aghaloo, Tara L.
    JOURNAL OF ORAL AND MAXILLOFACIAL SURGERY, 2022, 80 (02) : 203 - 204
  • [37] Sedation in oral and maxillofacial surgery
    Luepertz, M.
    Martini, M.
    Teschke, M.
    Heugel, P. C.
    Mathers, F.
    Reich, R.
    MKG-CHIRURG, 2011, 4 (04): : 314 - 321
  • [38] ORAL AND MAXILLOFACIAL SURGERY IN CHINA
    YING, L
    BRUCE, RA
    ORAL SURGERY ORAL MEDICINE ORAL PATHOLOGY ORAL RADIOLOGY AND ENDODONTICS, 1987, 63 (03): : 300 - 303
  • [39] THE FUTURE OF ORAL AND MAXILLOFACIAL SURGERY
    DAVIS, WM
    JOURNAL OF ORAL AND MAXILLOFACIAL SURGERY, 1989, 47 (05) : 547 - 547
  • [40] Reconstructive Oral and Maxillofacial Surgery
    Hoelzle, F.
    Mohr, C.
    Wolff, K.
    DEUTSCHES ARZTEBLATT INTERNATIONAL, 2008, 105 (47): : 815 - 822