Adaptive Learning Recommendation Strategy Based on Deep Q-learning

被引:16
|
作者
Tan, Chunxi [1 ]
Han, Ruijian [1 ]
Ye, Rougang [1 ]
Chen, Kani [1 ]
机构
[1] Hong Kong Univ Sci & Technol, Kowloon, Hong Kong, Peoples R China
关键词
adaptive learning; Markov decision process; recommendation system; reinforcement learning;
D O I
10.1177/0146621619858674
中图分类号
O1 [数学]; C [社会科学总论];
学科分类号
03 ; 0303 ; 0701 ; 070101 ;
摘要
Personalized recommendation system has been widely adopted in E-learning field that is adaptive to each learner's own learning pace. With full utilization of learning behavior data, psychometric assessment models keep track of the learner's proficiency on knowledge points, and then, the well-designed recommendation strategy selects a sequence of actions to meet the objective of maximizing learner's learning efficiency. This article proposes a novel adaptive recommendation strategy under the framework of reinforcement learning. The proposed strategy is realized by the deep Q-learning algorithms, which are the techniques that contributed to the success of AlphaGo Zero to achieve the super-human level in playing the game of go. The proposed algorithm incorporates an early stopping to account for the possibility that learners may choose to stop learning. It can properly deal with missing data and can handle more individual-specific features for better recommendations. The recommendation strategy guides individual learners with efficient learning paths that vary from person to person. The authors showcase concrete examples with numeric analysis of substantive learning scenarios to further demonstrate the power of the proposed method.
引用
收藏
页码:251 / 266
页数:16
相关论文
共 50 条
  • [21] Q-learning and LSTM based deep active learning strategy for malware defense in industrial IoT applications
    Khowaja, Sunder Ali
    Khuwaja, Parus
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (10) : 14637 - 14663
  • [22] Convergence Improvement of Q-learning Based on a Personalized Recommendation System
    Chiang, Chia-Ling
    Cheng, Ming-Yang
    Ye, Ting-Yu
    Chen, Ya-Ling
    Huang, Pin-Hsuan
    [J]. 2019 INTERNATIONAL AUTOMATIC CONTROL CONFERENCE (CACS), 2019,
  • [23] Adaptive Traffic Signal Control with Deep Recurrent Q-learning
    Zeng, Jinghong
    Hu, Jianming
    Zhang, Yi
    [J]. 2018 IEEE INTELLIGENT VEHICLES SYMPOSIUM (IV), 2018, : 1215 - 1220
  • [24] Comparison of Deep Q-Learning, Q-Learning and SARSA Reinforced Learning for Robot Local Navigation
    Anas, Hafiq
    Ong, Wee Hong
    Malik, Owais Ahmed
    [J]. ROBOT INTELLIGENCE TECHNOLOGY AND APPLICATIONS 6, 2022, 429 : 443 - 454
  • [25] A novel deep learning driven robot path planning strategy: Q-learning approach
    Hu, Junli
    [J]. INTERNATIONAL JOURNAL OF COMPUTER APPLICATIONS IN TECHNOLOGY, 2023, 71 (03) : 237 - 243
  • [26] Adaptive PID controller based on Q-learning algorithm
    Shi, Qian
    Lam, Hak-Keung
    Xiao, Bo
    Tsai, Shun-Hung
    [J]. CAAI TRANSACTIONS ON INTELLIGENCE TECHNOLOGY, 2018, 3 (04) : 235 - 244
  • [27] QLAR: A Q-Learning based Adaptive Routing for MANETs
    Serhani, Abdellatif
    Naja, Najib
    Jamali, Abdellah
    [J]. 2016 IEEE/ACS 13TH INTERNATIONAL CONFERENCE OF COMPUTER SYSTEMS AND APPLICATIONS (AICCSA), 2016,
  • [28] Q-learning for adaptive, load based routing.
    Nowe, A
    Steenhaut, K
    Fakir, M
    Verbeeck, K
    [J]. 1998 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS, VOLS 1-5, 1998, : 3965 - 3970
  • [29] Adaptive Routing Strategy Based on Improved Double Q-Learning for Satellite Internet of Things
    Zhou, Jian
    Gong, Xiaotian
    Sun, Lijuan
    Xie, Yong
    Yan, Xiaoyong
    [J]. SECURITY AND COMMUNICATION NETWORKS, 2021, 2021
  • [30] AFSndn: A novel adaptive forwarding strategy in named data networking based on Q-learning
    Zhang, Mingchuan
    Wang, Xin
    Liu, Tingting
    Zhu, Junlong
    Wu, Qingtao
    [J]. PEER-TO-PEER NETWORKING AND APPLICATIONS, 2020, 13 (04) : 1176 - 1184