Transforming Assessment: The Impacts and Implications of Large Language Models and Generative AI

被引:7
|
作者
Hao, Jiangang [1 ]
von Davier, Alina A. [2 ]
Yaneva, Victoria [3 ]
Lottridge, Susan [4 ]
von Davier, Matthias [5 ]
Harris, Deborah J. [6 ]
机构
[1] Educ Testing Serv, Princeton, NJ 08541 USA
[2] Duolingo Inc, Pittsburgh, PA USA
[3] Natl Board Med Examiners, Philadelphia, PA USA
[4] Cambium Assessment Inc, Washington, DC USA
[5] Boston Coll, Chestnut Hill, MA 02467 USA
[6] Univ Iowa, Iowa City, IA USA
关键词
assessment; generative AI; LLMs; SUPPORT; TIME;
D O I
10.1111/emip.12602
中图分类号
G40 [教育学];
学科分类号
040101 ; 120403 ;
摘要
The remarkable strides in artificial intelligence (AI), exemplified by ChatGPT, have unveiled a wealth of opportunities and challenges in assessment. Applying cutting-edge large language models (LLMs) and generative AI to assessment holds great promise in boosting efficiency, mitigating bias, and facilitating customized evaluations. Conversely, these innovations raise significant concerns regarding validity, reliability, transparency, fairness, equity, and test security, necessitating careful thinking when applying them in assessments. In this article, we discuss the impacts and implications of LLMs and generative AI on critical dimensions of assessment with example use cases and call for a community effort to equip assessment professionals with the needed AI literacy to harness the potential effectively.
引用
收藏
页码:16 / 29
页数:14
相关论文
共 50 条
  • [41] Learning to Make Rare and Complex Diagnoses With Generative AI Assistance: Qualitative Study of Popular Large Language Models
    Abdullahi, Tassallah
    Singh, Ritambhara
    Eickhoff, Carsten
    JMIR MEDICAL EDUCATION, 2024, 10
  • [42] Machine learning techniques for IoT security: Current research and future vision with generative AI and large language models
    Alwahedi F.
    Aldhaheri A.
    Ferrag M.A.
    Battah A.
    Tihanyi N.
    Internet of Things and Cyber-Physical Systems, 2024, 4 : 167 - 185
  • [43] AI ENTERS PUBLIC DISCOURSE: A HABERMASIAN ASSESSMENT OF THE MORAL STATUS OF LARGE LANGUAGE MODELS
    Monti, Paolo
    ETICA & POLITICA, 2024, 26 (01): : 61 - 80
  • [44] Diversity in Deep Generative Models and Generative AI
    Turinici, Gabriel
    MACHINE LEARNING, OPTIMIZATION, AND DATA SCIENCE, LOD 2023, PT II, 2024, 14506 : 84 - 93
  • [45] Diffusion Models in Generative AI
    Sazara, Cem
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 9705 - 9706
  • [46] Fake news detection: comparative evaluation of BERT-like models and large language models with generative AI-annotated data
    Shaina Raza
    Drai Paulen-Patterson
    Chen Ding
    Knowledge and Information Systems, 2025, 67 (4) : 3267 - 3292
  • [47] The promise of AI Large Language Models for Epilepsy care
    Landais, Raphaelle
    Sultan, Mustafa
    Thomas, Rhys H.
    EPILEPSY & BEHAVIOR, 2024, 154
  • [48] LAraBench: Benchmarking Arabic AI with Large Language Models
    Qatar Computing Research Institute, HBKU, Qatar
    不详
    arXiv, 1600,
  • [50] Large language models make AI usable for everyone!
    Bause, Fabian
    Konstruktion, 2024, 76 (04): : 3 - 5