Deep Reinforcement Learning for Adaptive Learning Systems

被引：5

作者：

Li, Xiao ^{[1
]}

Xu, Hanchen ^{[2
]}

Zhang, Jinming ^{[1
]}

Chang, Hua-hua ^{[3
]}

机构：

[1] Univ Illinois, Dept Educ Psychol, 236A Educ Bldg,1310 S Sixth St, Champaign, IL 61820 USA

[2] Univ Illinois, Dept Elect & Comp Engn, 306 N Wright St MC 702, Urbana, IL 61801 USA

[3] Purdue Univ, Dept Educ Studies, Steven C Beering Hall Liberal Arts & Educ, W Lafayette, IN 47907 USA

来源：

JOURNAL OF EDUCATIONAL AND BEHAVIORAL STATISTICS | 2023年 / 48卷 / 02期

关键词：

adaptive learning system; transition model estimator; Markov decision process; deep reinforcement learning; deep Q-learning; neural networks; model free; HIDDEN MARKOV MODEL; COGNITIVE DIAGNOSIS; ABILITY;

D O I：

10.3102/10769986221129847

中图分类号：

G40 [教育学];

学科分类号：

040101 ; 120403 ;

摘要：

The adaptive learning problem concerns how to create an individualized learning plan (also referred to as a learning policy) that chooses the most appropriate learning materials based on a learner's latent traits. In this article, we study an important yet less-addressed adaptive learning problem-one that assumes continuous latent traits. Specifically, we formulate the adaptive learning problem as a Markov decision process. We assume latent traits to be continuous with an unknown transition model and apply a model-free deep reinforcement learning algorithm-the deep Q-learning algorithm-that can effectively find the optimal learning policy from data on learners' learning process without knowing the actual transition model of the learners' continuous latent traits. To efficiently utilize available data, we also develop a transition model estimator that emulates the learner's learning process using neural networks. The transition model estimator can be used in the deep Q-learning algorithm so that it can more efficiently discover the optimal learning policy for a learner. Numerical simulation studies verify that the proposed algorithm is very efficient in finding a good learning policy. Especially with the aid of a transition model estimator, it can find the optimal learning policy after training using a small number of learners.

引用

页码：220 / 243

页数：24

共 50 条

[1] Simulation and deep reinforcement learning for adaptive dispatching in semiconductor manufacturing systems
Sakr, Ahmed H.
Aboelhassan, Ayman
Yacout, Soumaya
Bassetto, Samuel
[J]. JOURNAL OF INTELLIGENT MANUFACTURING, 2023, 34 (03) : 1311 - 1324
[2] Simulation and deep reinforcement learning for adaptive dispatching in semiconductor manufacturing systems
Ahmed H. Sakr
Ayman Aboelhassan
Soumaya Yacout
Samuel Bassetto
[J]. Journal of Intelligent Manufacturing, 2023, 34 : 1311 - 1324
[3] Adaptive Client Selection in Resource Constrained Federated Learning Systems: A Deep Reinforcement Learning Approach
Zhang, Hangjia
Xie, Zhijun
Zarei, Roozbeh
Wu, Tao
Chen, Kewei
[J]. IEEE ACCESS, 2021, 9 : 98423 - 98432
[4] Deep reinforcement learning for adaptive mesh refinement
Foucart, Corbin
Charous, Aaron
Lermusiaux, Pierre F. J.
[J]. JOURNAL OF COMPUTATIONAL PHYSICS, 2023, 491
[5] Adaptive Slope Locomotion with Deep Reinforcement Learning
Jones, William
Blum, Tamir
Yoshida, Kazuya
[J]. 2020 IEEE/SICE INTERNATIONAL SYMPOSIUM ON SYSTEM INTEGRATION (SII), 2020, : 546 - 550
[6] Deep reinforcement learning for swarm systems
Hüttenrauch, Maximilian
Oic, Adrian
Neumann, Gerhard
[J]. Journal of Machine Learning Research, 2019, 20
[7] Adaptive beamforming based on the deep reinforcement learning
Hao, Chuanhui
Sun, Xubao
Liu, Yidong
[J]. ICNSC 2022 - Proceedings of 2022 IEEE International Conference on Networking, Sensing and Control: Autonomous Intelligent Systems, 2022,
[8] Deep Reinforcement Learning for Swarm Systems
Huettenrauch, Maximilian
Sosic, Adrian
Neumann, Gerhard
[J]. JOURNAL OF MACHINE LEARNING RESEARCH, 2019, 20
[9] Adaptive deep Q learning network with reinforcement learning for crime prediction
J. Vimala Devi
K. S. Kavitha
[J]. Evolutionary Intelligence, 2023, 16 : 685 - 696
[10] Adaptive deep Q learning network with reinforcement learning for crime prediction
Devi, J. Vimala
Kavitha, K. S.
[J]. EVOLUTIONARY INTELLIGENCE, 2023, 16 (02) : 685 - 696

← 1 2 3 4 5 →