Practical and ethical challenges of large language models in education: A systematic scoping review

被引:49
|
作者
Yan, Lixiang [1 ,2 ]
Sha, Lele [1 ]
Zhao, Linxuan [1 ]
Li, Yuheng [1 ]
Martinez-Maldonado, Roberto [1 ]
Chen, Guanliang [1 ]
Li, Xinyu [1 ]
Jin, Yueqiao [1 ]
Gasevic, Dragan [1 ]
机构
[1] Monash Univ, Fac Informat Technol, Ctr Learning Analyt Monash, Clayton, Vic, Australia
[2] Monash Univ, Fac Informat Technol, Ctr Learning Analyt Monash, 20 Exhibit Walk, Clayton, Vic 3800, Australia
基金
澳大利亚研究理事会;
关键词
artificial intelligence; BERT; ChatGPT; education; GPT-3; large language models; pre-trained language models; systematic scoping review; GENERATION; PRINCIPLES;
D O I
10.1111/bjet.13370
中图分类号
G40 [教育学];
学科分类号
040101 ; 120403 ;
摘要
Educational technology innovations leveraging large language models (LLMs) have shown the potential to automate the laborious process of generating and analysing textual content. While various innovations have been developed to automate a range of educational tasks (eg, question generation, feedback provision, and essay grading), there are concerns regarding the practicality and ethicality of these innovations. Such concerns may hinder future research and the adoption of LLMs-based innovations in authentic educational contexts. To address this, we conducted a systematic scoping review of 118 peer-reviewed papers published since 2017 to pinpoint the current state of research on using LLMs to automate and support educational tasks. The findings revealed 53 use cases for LLMs in automating education tasks, categorised into nine main categories: profiling/labelling, detection, grading, teaching support, prediction, knowledge representation, feedback, content generation, and recommendation. Additionally, we also identified several practical and ethical challenges, including low technological readiness, lack of replicability and transparency and insufficient privacy and beneficence considerations. The findings were summarised into three recommendations for future studies, including updating existing innovations with state-of-the-art models (eg, GPT-3/4), embracing the initiative of open-sourcing models/systems, and adopting a human-centred approach throughout the developmental process. As the intersection of AI and education is continuously evolving, the findings of this study can serve as an essential reference point for researchers, allowing them to leverage the strengths, learn from the limitations, and uncover potential research opportunities enabled by ChatGPT and other generative AI models.
引用
收藏
页码:90 / 112
页数:23
相关论文
共 50 条
  • [21] Large language models to process, analyze, and synthesize biomedical texts: a scoping review
    Doneva, Simona Emilova
    Qin, Sijing
    Sick, Beate
    Ellendorff, Tilia
    Goldman, Jean-Philippe
    Schneider, Gerold
    Ineichen, Benjamin Victor
    [J]. Discover Artificial Intelligence, 2024, 4 (01):
  • [22] Assessing the research landscape and clinical utility of large language models: a scoping review
    Ye-Jean Park
    Abhinav Pillai
    Jiawen Deng
    Eddie Guo
    Mehul Gupta
    Mike Paget
    Christopher Naugler
    [J]. BMC Medical Informatics and Decision Making, 24
  • [23] Large language models for code completion: A systematic literature review
    Husein, Rasha Ahmad
    Aburajouh, Hala
    Catal, Cagatay
    [J]. COMPUTER STANDARDS & INTERFACES, 2025, 92
  • [24] Large language models for generating medical examinations: systematic review
    Artsi, Yaara
    Sorin, Vera
    Konen, Eli
    Glicksberg, Benjamin S.
    Nadkarni, Girish
    Klang, Eyal
    [J]. BMC MEDICAL EDUCATION, 2024, 24 (01)
  • [25] Achieving Peak Performance for Large Language Models: A Systematic Review
    Rostam, Zhyar Rzgar K.
    Szenasi, Sandor
    Kertesz, Gabor
    [J]. IEEE ACCESS, 2024, 12 : 96017 - 96050
  • [26] A scoping review on flipped classroom approach in language education: challenges, implications and an interaction model
    Jiang, Michael Yi-chao
    Jong, Morris Siu-yung
    Lau, Wilfred Wing-fat
    Chai, Ching-sing
    Liu, Kelly Shu-xia
    Park, Moonyoung
    [J]. COMPUTER ASSISTED LANGUAGE LEARNING, 2022, 35 (5-6) : 1218 - 1249
  • [27] Clinical and Surgical Applications of Large Language Models: A Systematic Review
    Pressman, Sophia M.
    Borna, Sahar
    Gomez-Cabello, Cesar A.
    Haider, Syed Ali
    Haider, Clifton R.
    Forte, Antonio Jorge
    [J]. JOURNAL OF CLINICAL MEDICINE, 2024, 13 (11)
  • [28] Large Language Models for Software Engineering: A Systematic Literature Review
    Hou, Xinyi
    Zhao, Yanjie
    Liu, Yue
    Yang, Zhou
    Wang, Kailong
    Li, Li
    Luo, Xiapu
    Lo, David
    Grundy, John
    Wang, Haoyu
    [J]. ACM Transactions on Software Engineering and Methodology, 2024, 33 (08)
  • [29] Application of Large Language Models in Cybersecurity: A Systematic Literature Review
    Hasanov, Ismayil
    Virtanen, Seppo
    Hakkala, Antti
    Isoaho, Jouni
    [J]. IEEE Access, 2024, 12 : 176751 - 176778
  • [30] A Systematic Scoping Review of Ethical Issues in Mentoring in Surgery
    Lee, Fion Qian Hui
    Chua, Wen Jie
    Cheong, Clarissa Wei Shuen
    Tay, Kuang Teck
    Hian, Eugene Koh Yong
    Chin, Annelissa Mien Chew
    Toh, Ying Pin
    Mason, Stephen
    Krishna, Lalit Kumar Radha
    [J]. JOURNAL OF MEDICAL EDUCATION AND CURRICULAR DEVELOPMENT, 2019, 6