Adaptive Learning Recommendation Strategy Based on Deep Q-learning

被引：16

作者：

Tan, Chunxi ^{[1
]}

Han, Ruijian ^{[1
]}

Ye, Rougang ^{[1
]}

Chen, Kani ^{[1
]}

机构：

[1] Hong Kong Univ Sci & Technol, Kowloon, Hong Kong, Peoples R China

来源：

APPLIED PSYCHOLOGICAL MEASUREMENT | 2020年 / 44卷 / 04期

关键词：

adaptive learning; Markov decision process; recommendation system; reinforcement learning;

D O I：

10.1177/0146621619858674

中图分类号：

O1 [数学]; C [社会科学总论];

学科分类号：

03 ; 0303 ; 0701 ; 070101 ;

摘要：

Personalized recommendation system has been widely adopted in E-learning field that is adaptive to each learner's own learning pace. With full utilization of learning behavior data, psychometric assessment models keep track of the learner's proficiency on knowledge points, and then, the well-designed recommendation strategy selects a sequence of actions to meet the objective of maximizing learner's learning efficiency. This article proposes a novel adaptive recommendation strategy under the framework of reinforcement learning. The proposed strategy is realized by the deep Q-learning algorithms, which are the techniques that contributed to the success of AlphaGo Zero to achieve the super-human level in playing the game of go. The proposed algorithm incorporates an early stopping to account for the possibility that learners may choose to stop learning. It can properly deal with missing data and can handle more individual-specific features for better recommendations. The recommendation strategy guides individual learners with efficient learning paths that vary from person to person. The authors showcase concrete examples with numeric analysis of substantive learning scenarios to further demonstrate the power of the proposed method.

引用

页码：251 / 266

页数：16

共 50 条

[1] An adaptive deep Q-learning strategy for handwritten digit recognition
Qiao, Junfei
Wang, Gongming
Li, Wenjing
Chen, Min
[J]. NEURAL NETWORKS, 2018, 107 : 61 - 71
[2] Deep Reinforcement Learning: From Q-Learning to Deep Q-Learning
Tan, Fuxiao
Yan, Pengfei
Guan, Xinping
[J]. NEURAL INFORMATION PROCESSING (ICONIP 2017), PT IV, 2017, 10637 : 475 - 483
[3] Scalable Deep Q-Learning for Session-Based Slate Recommendation
Roy, Aayush Singha
D'Amico, Edoardo
Tragos, Elias
Lawlor, Aonghus
Hurley, Neil
[J]. PROCEEDINGS OF THE 17TH ACM CONFERENCE ON RECOMMENDER SYSTEMS, RECSYS 2023, 2023, : 877 - 882
[4] Cooperative strategy based on adaptive Q-learning for robot soccer systems
Hwang, KS
Tan, SW
Chen, CC
[J]. IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2004, 12 (04) : 569 - 576
[5] A Novel Behavioral Strategy for RoboCode Platform Based on Deep Q-Learning
Kayakoku, Hakan
Guzel, Mehmet Serdar
Bostanci, Erkan
Medeni, Ihsan Tolga
Mishra, Deepti
[J]. COMPLEXITY, 2021, 2021
[6] Trading Strategy of the Cryptocurrency Market Based on Deep Q-Learning Agents
Huang, Chester S. J.
Su, Yu-Sheng
[J]. APPLIED ARTIFICIAL INTELLIGENCE, 2024, 38 (01)
[7] Constrained Deep Q-Learning Gradually Approaching Ordinary Q-Learning
Ohnishi, Shota
Uchibe, Eiji
Yamaguchi, Yotaro
Nakanishi, Kosuke
Yasui, Yuji
Ishii, Shin
[J]. FRONTIERS IN NEUROROBOTICS, 2019, 13
[8] Q-Learning Based Adaptive Frequency Hopping Strategy Under Probabilistic Jamming
Wang, Yutao
Niu, Yingtao
Chen, Jianzhong
Fang, Fang
Han, Chen
[J]. 2019 11TH INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS AND SIGNAL PROCESSING (WCSP), 2019,
[9] Q-learning for adaptive traffic signal control based on delay minimization strategy
Lu Shoufeng
Liu Ximin
Dai Shiqiang
[J]. PROCEEDINGS OF 2008 IEEE INTERNATIONAL CONFERENCE ON NETWORKING, SENSING AND CONTROL, VOLS 1 AND 2, 2008, : 687 - +
[10] Adaptive job shop scheduling strategy based on weighted Q-learning algorithm
Wang, Yu-Fang
[J]. JOURNAL OF INTELLIGENT MANUFACTURING, 2020, 31 (02) : 417 - 432

← 1 2 3 4 5 →