Deep Reinforcement Learning for Adaptive Learning Systems

被引:5
|
作者
Li, Xiao [1 ]
Xu, Hanchen [2 ]
Zhang, Jinming [1 ]
Chang, Hua-hua [3 ]
机构
[1] Univ Illinois, Dept Educ Psychol, 236A Educ Bldg,1310 S Sixth St, Champaign, IL 61820 USA
[2] Univ Illinois, Dept Elect & Comp Engn, 306 N Wright St MC 702, Urbana, IL 61801 USA
[3] Purdue Univ, Dept Educ Studies, Steven C Beering Hall Liberal Arts & Educ, W Lafayette, IN 47907 USA
关键词
adaptive learning system; transition model estimator; Markov decision process; deep reinforcement learning; deep Q-learning; neural networks; model free; HIDDEN MARKOV MODEL; COGNITIVE DIAGNOSIS; ABILITY;
D O I
10.3102/10769986221129847
中图分类号
G40 [教育学];
学科分类号
040101 ; 120403 ;
摘要
The adaptive learning problem concerns how to create an individualized learning plan (also referred to as a learning policy) that chooses the most appropriate learning materials based on a learner's latent traits. In this article, we study an important yet less-addressed adaptive learning problem-one that assumes continuous latent traits. Specifically, we formulate the adaptive learning problem as a Markov decision process. We assume latent traits to be continuous with an unknown transition model and apply a model-free deep reinforcement learning algorithm-the deep Q-learning algorithm-that can effectively find the optimal learning policy from data on learners' learning process without knowing the actual transition model of the learners' continuous latent traits. To efficiently utilize available data, we also develop a transition model estimator that emulates the learner's learning process using neural networks. The transition model estimator can be used in the deep Q-learning algorithm so that it can more efficiently discover the optimal learning policy for a learner. Numerical simulation studies verify that the proposed algorithm is very efficient in finding a good learning policy. Especially with the aid of a transition model estimator, it can find the optimal learning policy after training using a small number of learners.
引用
收藏
页码:220 / 243
页数:24
相关论文
共 50 条
  • [1] Simulation and deep reinforcement learning for adaptive dispatching in semiconductor manufacturing systems
    Sakr, Ahmed H.
    Aboelhassan, Ayman
    Yacout, Soumaya
    Bassetto, Samuel
    [J]. JOURNAL OF INTELLIGENT MANUFACTURING, 2023, 34 (03) : 1311 - 1324
  • [2] Simulation and deep reinforcement learning for adaptive dispatching in semiconductor manufacturing systems
    Ahmed H. Sakr
    Ayman Aboelhassan
    Soumaya Yacout
    Samuel Bassetto
    [J]. Journal of Intelligent Manufacturing, 2023, 34 : 1311 - 1324
  • [3] Adaptive Client Selection in Resource Constrained Federated Learning Systems: A Deep Reinforcement Learning Approach
    Zhang, Hangjia
    Xie, Zhijun
    Zarei, Roozbeh
    Wu, Tao
    Chen, Kewei
    [J]. IEEE ACCESS, 2021, 9 : 98423 - 98432
  • [4] Deep reinforcement learning for adaptive mesh refinement
    Foucart, Corbin
    Charous, Aaron
    Lermusiaux, Pierre F. J.
    [J]. JOURNAL OF COMPUTATIONAL PHYSICS, 2023, 491
  • [5] Adaptive Slope Locomotion with Deep Reinforcement Learning
    Jones, William
    Blum, Tamir
    Yoshida, Kazuya
    [J]. 2020 IEEE/SICE INTERNATIONAL SYMPOSIUM ON SYSTEM INTEGRATION (SII), 2020, : 546 - 550
  • [6] Deep reinforcement learning for swarm systems
    Hüttenrauch, Maximilian
    Oic, Adrian
    Neumann, Gerhard
    [J]. Journal of Machine Learning Research, 2019, 20
  • [7] Adaptive beamforming based on the deep reinforcement learning
    Hao, Chuanhui
    Sun, Xubao
    Liu, Yidong
    [J]. ICNSC 2022 - Proceedings of 2022 IEEE International Conference on Networking, Sensing and Control: Autonomous Intelligent Systems, 2022,
  • [8] Deep Reinforcement Learning for Swarm Systems
    Huettenrauch, Maximilian
    Sosic, Adrian
    Neumann, Gerhard
    [J]. JOURNAL OF MACHINE LEARNING RESEARCH, 2019, 20
  • [9] Adaptive deep Q learning network with reinforcement learning for crime prediction
    J. Vimala Devi
    K. S. Kavitha
    [J]. Evolutionary Intelligence, 2023, 16 : 685 - 696
  • [10] Adaptive deep Q learning network with reinforcement learning for crime prediction
    Devi, J. Vimala
    Kavitha, K. S.
    [J]. EVOLUTIONARY INTELLIGENCE, 2023, 16 (02) : 685 - 696