Adaptive Learning Recommendation Strategy Based on Deep Q-learning

被引：16

作者：

Tan, Chunxi ^{[1
]}

Han, Ruijian ^{[1
]}

Ye, Rougang ^{[1
]}

Chen, Kani ^{[1
]}

机构：

[1] Hong Kong Univ Sci & Technol, Kowloon, Hong Kong, Peoples R China

来源：

APPLIED PSYCHOLOGICAL MEASUREMENT | 2020年 / 44卷 / 04期

关键词：

adaptive learning; Markov decision process; recommendation system; reinforcement learning;

D O I：

10.1177/0146621619858674

中图分类号：

O1 [数学]; C [社会科学总论];

学科分类号：

03 ; 0303 ; 0701 ; 070101 ;

摘要：

Personalized recommendation system has been widely adopted in E-learning field that is adaptive to each learner's own learning pace. With full utilization of learning behavior data, psychometric assessment models keep track of the learner's proficiency on knowledge points, and then, the well-designed recommendation strategy selects a sequence of actions to meet the objective of maximizing learner's learning efficiency. This article proposes a novel adaptive recommendation strategy under the framework of reinforcement learning. The proposed strategy is realized by the deep Q-learning algorithms, which are the techniques that contributed to the success of AlphaGo Zero to achieve the super-human level in playing the game of go. The proposed algorithm incorporates an early stopping to account for the possibility that learners may choose to stop learning. It can properly deal with missing data and can handle more individual-specific features for better recommendations. The recommendation strategy guides individual learners with efficient learning paths that vary from person to person. The authors showcase concrete examples with numeric analysis of substantive learning scenarios to further demonstrate the power of the proposed method.

引用

页码：251 / 266

页数：16

共 50 条

[21] Q-learning and LSTM based deep active learning strategy for malware defense in industrial IoT applications
Khowaja, Sunder Ali
Khuwaja, Parus
[J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (10) : 14637 - 14663
[22] Convergence Improvement of Q-learning Based on a Personalized Recommendation System
Chiang, Chia-Ling
Cheng, Ming-Yang
Ye, Ting-Yu
Chen, Ya-Ling
Huang, Pin-Hsuan
[J]. 2019 INTERNATIONAL AUTOMATIC CONTROL CONFERENCE (CACS), 2019,
[23] Adaptive Traffic Signal Control with Deep Recurrent Q-learning
Zeng, Jinghong
Hu, Jianming
Zhang, Yi
[J]. 2018 IEEE INTELLIGENT VEHICLES SYMPOSIUM (IV), 2018, : 1215 - 1220
[24] Comparison of Deep Q-Learning, Q-Learning and SARSA Reinforced Learning for Robot Local Navigation
Anas, Hafiq
Ong, Wee Hong
Malik, Owais Ahmed
[J]. ROBOT INTELLIGENCE TECHNOLOGY AND APPLICATIONS 6, 2022, 429 : 443 - 454
[25] A novel deep learning driven robot path planning strategy: Q-learning approach
Hu, Junli
[J]. INTERNATIONAL JOURNAL OF COMPUTER APPLICATIONS IN TECHNOLOGY, 2023, 71 (03) : 237 - 243
[26] Adaptive PID controller based on Q-learning algorithm
Shi, Qian
Lam, Hak-Keung
Xiao, Bo
Tsai, Shun-Hung
[J]. CAAI TRANSACTIONS ON INTELLIGENCE TECHNOLOGY, 2018, 3 (04) : 235 - 244
[27] QLAR: A Q-Learning based Adaptive Routing for MANETs
Serhani, Abdellatif
Naja, Najib
Jamali, Abdellah
[J]. 2016 IEEE/ACS 13TH INTERNATIONAL CONFERENCE OF COMPUTER SYSTEMS AND APPLICATIONS (AICCSA), 2016,
[28] Q-learning for adaptive, load based routing.
Nowe, A
Steenhaut, K
Fakir, M
Verbeeck, K
[J]. 1998 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS, VOLS 1-5, 1998, : 3965 - 3970
[29] Adaptive Routing Strategy Based on Improved Double Q-Learning for Satellite Internet of Things
Zhou, Jian
Gong, Xiaotian
Sun, Lijuan
Xie, Yong
Yan, Xiaoyong
[J]. SECURITY AND COMMUNICATION NETWORKS, 2021, 2021
[30] AFSndn: A novel adaptive forwarding strategy in named data networking based on Q-learning
Zhang, Mingchuan
Wang, Xin
Liu, Tingting
Zhu, Junlong
Wu, Qingtao
[J]. PEER-TO-PEER NETWORKING AND APPLICATIONS, 2020, 13 (04) : 1176 - 1184

← 1 2 3 4 5 →