Dynamic multi-objective sequence-wise recommendation framework via deep reinforcement learning

被引:0
|
作者
Xiankun Zhang
Yuhu Shang
Yimeng Ren
Kun Liang
机构
[1] Tianjin University of Science and Technology,College of Artificial Intelligence
来源
关键词
Sequence-wise recommendation; Domain-specific objectives; Actor–critic network; State representation; [inline-graphic not available: see fulltext]; [inline-graphic not available: see fulltext]; [inline-graphic not available: see fulltext]; [inline-graphic not available: see fulltext];
D O I
暂无
中图分类号
学科分类号
摘要
Sequence-wise recommendation, where recommend exercises to each student step by step, is one of the most exciting tasks in the field of intelligent tutoring systems (ITS). It is important to develop a personalized sequence-wise recommendation framework that immerses students in learning and helps them acquire as much necessary knowledge as possible, rather than merely focusing on providing non-mastered exercises, which is referred to optimize a single objective. However, due to the different knowledge levels of students and the large scale of exercise banks, it is difficult to generate a personalized exercise recommendation for each student. To fully exploit the multifaceted beneficial information collected from e-learning platforms, we design a dynamic multi-objective sequence-wise recommendation framework via deep reinforcement learning, i.e., DMoSwR-DRL, which automatically select the most suitable exercises for each student based on the well-designed domain-objective rewards. Within this framework, the interaction between students and exercises can be explicitly modeled by integrating the actor–critic network and the state representation component, which can greatly help the agent perform effective reinforcement learning. Specifically, we carefully design a state representation module with dynamic recurrent mechanism, which integrates concept information and exercise difficulty level, thus generating a continuous state representation of the student. Subsequently, a flexible reward function is designed to simultaneously optimize the four domain-specific objectives of difficulty, novelty, coverage, and diversity, providing the students with a trade-off sequence-wise recommendation. To set up the online evaluation, we test DMoSwR-DRL on a simulated environment which can model qualitative development of knowledge level and predicts their performance for a given exercise. Comprehensive experiments are conducted on four classical exercise-answer datasets, and the results show the effectiveness and advantages of DMoSwR-DRL in terms of recommendation quality.
引用
收藏
页码:1891 / 1911
页数:20
相关论文
共 50 条
  • [41] Multi-Objective Network Congestion Control via Constrained Reinforcement Learning
    Liu, Qiong
    Yang, Peng
    Lyu, Feng
    Zhang, Ning
    Yu, Li
    [J]. 2021 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM), 2021,
  • [42] Flexible Traffic Signal Control via Multi-Objective Reinforcement Learning
    Saiki, Takumi
    Arai, Sachiyo
    [J]. IEEE ACCESS, 2023, 11 : 75875 - 75883
  • [43] MARRGM: Learning Framework for Multi-Agent Reinforcement Learning via Reinforcement Recommendation and Group Modification
    Wu, Peiliang
    Tian, Liqiang
    Zhang, Qian
    Mao, Bingyi
    Chen, Wenbai
    [J]. IEEE ROBOTICS AND AUTOMATION LETTERS, 2024, 9 (06) : 5385 - 5392
  • [44] Multi-Agent Deep Reinforcement Learning for Resource Allocation in the Multi-Objective HetNet
    Nie, Hongrui
    Li, Shaosheng
    Liu, Yong
    [J]. IWCMC 2021: 2021 17TH INTERNATIONAL WIRELESS COMMUNICATIONS & MOBILE COMPUTING CONFERENCE (IWCMC), 2021, : 116 - 121
  • [45] Self-Learning Multi-Objective Service Coordination Using Deep Reinforcement Learning
    Schneider, Stefan
    Khalili, Ramin
    Manzoor, Adnan
    Qarawlus, Haydar
    Schellenberg, Rafael
    Karl, Holger
    Hecker, Artur
    [J]. IEEE TRANSACTIONS ON NETWORK AND SERVICE MANAGEMENT, 2021, 18 (03): : 3829 - 3842
  • [46] A deep reinforcement learning algorithm framework for solving multi-objective traveling salesman problem based on feature transformation
    Zhao, Shijie
    Gu, Shenshen
    [J]. NEURAL NETWORKS, 2024, 176
  • [47] Meta-Learning for Multi-objective Reinforcement Learning
    Chen, Xi
    Ghadirzadeh, Ali
    Bjorkman, Marten
    Jensfelt, Patric
    [J]. 2019 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2019, : 977 - 983
  • [48] Multi-Objective Dynamic Dispatch Optimisation using Multi-Agent Reinforcement Learning
    Mannion, Patrick
    Mason, Karl
    Devlin, Sam
    Duggan, Jim
    Howley, Enda
    [J]. AAMAS'16: PROCEEDINGS OF THE 2016 INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS & MULTIAGENT SYSTEMS, 2016, : 1345 - 1346
  • [49] Multi-objective pruning of dense neural networks using deep reinforcement learning
    Hirsch, Lior
    Katz, Gilad
    [J]. INFORMATION SCIENCES, 2022, 610 : 381 - 400
  • [50] Investigating the multi-objective optimization of quality and efficiency using deep reinforcement learning
    Wang, Zhenhui
    Lu, Juan
    Chen, Chaoyi
    Ma, Junyan
    Liao, Xiaoping
    [J]. APPLIED INTELLIGENCE, 2022, 52 (11) : 12873 - 12887