Adaptive Learning Recommendation Strategy Based on Deep Q-learning

被引:16
|
作者
Tan, Chunxi [1 ]
Han, Ruijian [1 ]
Ye, Rougang [1 ]
Chen, Kani [1 ]
机构
[1] Hong Kong Univ Sci & Technol, Kowloon, Hong Kong, Peoples R China
关键词
adaptive learning; Markov decision process; recommendation system; reinforcement learning;
D O I
10.1177/0146621619858674
中图分类号
O1 [数学]; C [社会科学总论];
学科分类号
03 ; 0303 ; 0701 ; 070101 ;
摘要
Personalized recommendation system has been widely adopted in E-learning field that is adaptive to each learner's own learning pace. With full utilization of learning behavior data, psychometric assessment models keep track of the learner's proficiency on knowledge points, and then, the well-designed recommendation strategy selects a sequence of actions to meet the objective of maximizing learner's learning efficiency. This article proposes a novel adaptive recommendation strategy under the framework of reinforcement learning. The proposed strategy is realized by the deep Q-learning algorithms, which are the techniques that contributed to the success of AlphaGo Zero to achieve the super-human level in playing the game of go. The proposed algorithm incorporates an early stopping to account for the possibility that learners may choose to stop learning. It can properly deal with missing data and can handle more individual-specific features for better recommendations. The recommendation strategy guides individual learners with efficient learning paths that vary from person to person. The authors showcase concrete examples with numeric analysis of substantive learning scenarios to further demonstrate the power of the proposed method.
引用
收藏
页码:251 / 266
页数:16
相关论文
共 50 条
  • [1] An adaptive deep Q-learning strategy for handwritten digit recognition
    Qiao, Junfei
    Wang, Gongming
    Li, Wenjing
    Chen, Min
    [J]. NEURAL NETWORKS, 2018, 107 : 61 - 71
  • [2] Deep Reinforcement Learning: From Q-Learning to Deep Q-Learning
    Tan, Fuxiao
    Yan, Pengfei
    Guan, Xinping
    [J]. NEURAL INFORMATION PROCESSING (ICONIP 2017), PT IV, 2017, 10637 : 475 - 483
  • [3] Scalable Deep Q-Learning for Session-Based Slate Recommendation
    Roy, Aayush Singha
    D'Amico, Edoardo
    Tragos, Elias
    Lawlor, Aonghus
    Hurley, Neil
    [J]. PROCEEDINGS OF THE 17TH ACM CONFERENCE ON RECOMMENDER SYSTEMS, RECSYS 2023, 2023, : 877 - 882
  • [4] Cooperative strategy based on adaptive Q-learning for robot soccer systems
    Hwang, KS
    Tan, SW
    Chen, CC
    [J]. IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2004, 12 (04) : 569 - 576
  • [5] A Novel Behavioral Strategy for RoboCode Platform Based on Deep Q-Learning
    Kayakoku, Hakan
    Guzel, Mehmet Serdar
    Bostanci, Erkan
    Medeni, Ihsan Tolga
    Mishra, Deepti
    [J]. COMPLEXITY, 2021, 2021
  • [6] Trading Strategy of the Cryptocurrency Market Based on Deep Q-Learning Agents
    Huang, Chester S. J.
    Su, Yu-Sheng
    [J]. APPLIED ARTIFICIAL INTELLIGENCE, 2024, 38 (01)
  • [7] Constrained Deep Q-Learning Gradually Approaching Ordinary Q-Learning
    Ohnishi, Shota
    Uchibe, Eiji
    Yamaguchi, Yotaro
    Nakanishi, Kosuke
    Yasui, Yuji
    Ishii, Shin
    [J]. FRONTIERS IN NEUROROBOTICS, 2019, 13
  • [8] Q-Learning Based Adaptive Frequency Hopping Strategy Under Probabilistic Jamming
    Wang, Yutao
    Niu, Yingtao
    Chen, Jianzhong
    Fang, Fang
    Han, Chen
    [J]. 2019 11TH INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS AND SIGNAL PROCESSING (WCSP), 2019,
  • [9] Q-learning for adaptive traffic signal control based on delay minimization strategy
    Lu Shoufeng
    Liu Ximin
    Dai Shiqiang
    [J]. PROCEEDINGS OF 2008 IEEE INTERNATIONAL CONFERENCE ON NETWORKING, SENSING AND CONTROL, VOLS 1 AND 2, 2008, : 687 - +
  • [10] Adaptive job shop scheduling strategy based on weighted Q-learning algorithm
    Wang, Yu-Fang
    [J]. JOURNAL OF INTELLIGENT MANUFACTURING, 2020, 31 (02) : 417 - 432