Can a large language model create acceptable dental board-style examination questions? A cross-sectional prospective study

被引:0
|
作者
Kim, Hak-Sun [1 ]
Kim, Gyu-Tae [2 ]
机构
[1] Kyung Hee Univ, Dept Oral & Maxillofacial Radiol, Dent Hosp, Seoul, South Korea
[2] Kyung Hee Univ, Coll Dent, Dept Oral & Maxillofacial Surg, 26 Kyungheedae Ro, Seoul 02447, South Korea
关键词
Dental education; Examination questions; Professional competence; Artificial intelligence; Natural language processing;
D O I
10.1016/j.jds.2024.08.020
中图分类号
R78 [口腔科学];
学科分类号
1003 ;
摘要
Background/purpose: Numerous studies have shown that large language models (LLMs) can score above the passing grade on various board examinations. Therefore, this study aimed to evaluate national dental board-style examination questions created by an LLM versus those created by human experts using item analysis. Materials and methods: This study was conducted in June 2024 and included senior dental students (n = 30) who participated voluntarily. An LLM, ChatGPT 4o, was used to generate 44 national dental board-style examination questions based on textbook content. Twenty questions for the LLM set were randomly selected after removing false questions. Two experts created another set of 20 questions based on the same content and in the same style as the LLM. Participating students simultaneously answered a total of 40 questions divided into two sets using Google Forms in the classroom. The responses were analyzed to assess difficulty, discrimination index, and distractor efficiency. Statistical comparisons were performed using the Wilcoxon signed rank test or linear-by-linear association test, with a confidence level of 95%. Results: The response rate was 100%. The median difficulty indices of the LLM and human set were 55.00% and 50.00%, both within the range of "excellent" range. The median discrimination indices were 0.29 for the LLM set and 0.14 for the human set. Both sets had a median distractor efficiency of 80.00%. The differences in all criteria were not statistically significant (P > 0.050). Conclusion: The LLM can create national board-style examination questions of equivalent quality to those created by human experts. (c) 2025 Association for Dental Sciences of the Republic of China. Publishing services by Elsevier B.V. This is an open access article under the CC BY-NC-ND license (http://creativecommons. org/licenses/by-nc-nd/4.0/).
引用
收藏
页码:895 / 900
页数:6
相关论文
共 50 条
  • [41] Can Isokinetic Testing Lead to More Precise Monitoring of Chondromalacia Patella? Prospective, Cross-sectional Study
    Saral, Ilknur
    Basat, Hande
    Agirman, Mehmet
    Tekeci, Esra
    Cakar, Engin
    MEDICAL JOURNAL OF BAKIRKOY, 2022, 18 (02) : 135 - 140
  • [42] A large-scale cohort study of long-term cardiac rehabilitation: A prospective cross-sectional study
    Nakayama, Atsuko
    Nagayama, Masatoshi
    Morita, Hiroyuki
    Tajima, Miyu
    Mahara, Keitaro
    Uemura, Yukari
    Tomoike, Hitonobu
    Komuro, Issei
    Isobe, Mitsuaki
    INTERNATIONAL JOURNAL OF CARDIOLOGY, 2020, 309 : 1 - 7
  • [43] Impact of heavy smoking on the clinical, microbiological and immunological parameters of patients with dental implants: a prospective cross-sectional study
    Ata-Ali, Javier
    Juan Flichy-Fernandez, Antonio
    Alegre-Domingo, Teresa
    Ata-Ali, Fadi
    Penarrocha-Diago, Miguel
    JOURNAL OF INVESTIGATIVE AND CLINICAL DENTISTRY, 2016, 7 (04) : 401 - 409
  • [44] Can type of school be used as an alternative indicator of socioeconomic status in dental caries studies? A cross-sectional study
    Piovesan, Chaiana
    Padua, Monica Carneiro
    Ardenghi, Thiago Machado
    Mendes, Fausto Medeiros
    Bonini, Gabriela Cunha
    BMC MEDICAL RESEARCH METHODOLOGY, 2011, 11
  • [45] Factors Associated With the Accuracy of Large Language Models in Basic Medical Science Examinations: Cross-Sectional Study
    Kaewboonlert, Naritsaret
    Poontananggul, Jiraphon
    Pongsuwan, Natthipong
    Bhakdisongkhram, Gun
    JMIR MEDICAL EDUCATION, 2025, 11
  • [46] Can type of school be used as an alternative indicator of socioeconomic status in dental caries studies? A cross-sectional study
    Chaiana Piovesan
    Monica Carneiro Pádua
    Thiago Machado Ardenghi
    Fausto Medeiros Mendes
    Gabriela Cunha Bonini
    BMC Medical Research Methodology, 11
  • [47] Examining different measures of multimorbidity, using a large prospective cross-sectional study in Australian general practice
    Harrison, Christopher
    Britt, Helena
    Miller, Graeme
    Henderson, Joan
    BMJ OPEN, 2014, 4 (07):
  • [48] Can the prevalence of dental caries be used as an indicator of the quality of dental services? A cross-sectional study among children in Almadinah Almunawwarah, KSA
    Mahrous, Mohamed S.
    Bhayat, Ahmed
    Hifnawy, Tamer
    Bakeer, Hala
    Ahmad, Mohamed S.
    JOURNAL OF TAIBAH UNIVERSITY MEDICAL SCIENCES, 2016, 11 (01): : 41 - 45
  • [49] Perspectives of Indian dental residents on novel online practical examination during COVID-19: A cross-sectional study
    Sajjan, Girija S.
    Praveen, Dalavai
    Gadde, Praveen
    Sajjan, Suresh
    Swamy, Shivakumara
    Chaitanya, Penmatsa
    Ramesh, Konathala S. V.
    Pulidindi, Anil Kumar
    JOURNAL OF DENTAL EDUCATION, 2023, 87 (07) : 957 - 962
  • [50] Examination of rural-urban disparities in utilization of preventive dental procedures in the US pediatric population: A cross-sectional study
    Ghaffari, Affan
    Graves, Katelyn Y.
    Bradbury, Russell F.
    Harman, Jeffrey S.
    JOURNAL OF RURAL HEALTH, 2025, 41 (02):