Dynamic multi-objective sequence-wise recommendation framework via deep reinforcement learning

被引:5
|
作者
Zhang, Xiankun [1 ]
Shang, Yuhu [1 ]
Ren, Yimeng [1 ]
Liang, Kun [1 ]
机构
[1] Tianjin Univ Sci & Technol, Coll Artificial Intelligence, Tianjin 300457, Peoples R China
基金
中国国家自然科学基金;
关键词
Sequence-wise recommendation; Domain-specific objectives; Actor-critic network; State representation; NEURAL-NETWORKS; DIFFICULTY; MODEL;
D O I
10.1007/s40747-022-00871-x
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Sequence-wise recommendation, where recommend exercises to each student step by step, is one of the most exciting tasks in the field of intelligent tutoring systems (ITS). It is important to develop a personalized sequence-wise recommendation framework that immerses students in learning and helps them acquire as much necessary knowledge as possible, rather than merely focusing on providing non-mastered exercises, which is referred to optimize a single objective. However, due to the different knowledge levels of students and the large scale of exercise banks, it is difficult to generate a personalized exercise recommendation for each student. To fully exploit the multifaceted beneficial information collected from e-learning platforms, we design a dynamic multi-objective sequence-wise recommendation framework via deep reinforcement learning, i.e., DMoSwR-DRL, which automatically select the most suitable exercises for each student based on the well-designed domain-objective rewards. Within this framework, the interaction between students and exercises can be explicitly modeled by integrating the actor-critic network and the state representation component, which can greatly help the agent perform effective reinforcement learning. Specifically, we carefully design a state representation module with dynamic recurrent mechanism, which integrates concept information and exercise difficulty level, thus generating a continuous state representation of the student. Subsequently, a flexible reward function is designed to simultaneously optimize the four domain-specific objectives of difficulty, novelty, coverage, and diversity, providing the students with a trade-off sequence-wise recommendation. To set up the online evaluation, we test DMoSwR-DRL on a simulated environment which can model qualitative development of knowledge level and predicts their performance for a given exercise. Comprehensive experiments are conducted on four classical exercise-answer datasets, and the results show the effectiveness and advantages of DMoSwR-DRL in terms of recommendation quality.
引用
收藏
页码:1891 / 1911
页数:21
相关论文
共 50 条
  • [1] Dynamic multi-objective sequence-wise recommendation framework via deep reinforcement learning
    Xiankun Zhang
    Yuhu Shang
    Yimeng Ren
    Kun Liang
    Complex & Intelligent Systems, 2023, 9 : 1891 - 1911
  • [2] A multi-objective deep reinforcement learning framework
    Thanh Thi Nguyen
    Ngoc Duy Nguyen
    Vamplew, Peter
    Nahavandi, Saeid
    Dazeley, Richard
    Lim, Chee Peng
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2020, 96
  • [3] Dynamic Weights in Multi-Objective Deep Reinforcement Learning
    Abels, Axel
    Roijers, Diederik M.
    Lenaerts, Tom
    Nowe, Ann
    Steckelmacher, Denis
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 97, 2019, 97
  • [4] Dynamic scheduling for multi-objective flexible job shop via deep reinforcement learning
    Yuan, Erdong
    Wang, Liejun
    Song, Shiji
    Cheng, Shuli
    Fan, Wei
    APPLIED SOFT COMPUTING, 2025, 171
  • [5] A Two-Stage Multi-Objective Deep Reinforcement Learning Framework
    Chen, Diqi
    Wang, Yizhou
    Gao, Wen
    ECAI 2020: 24TH EUROPEAN CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2020, 325 : 1063 - 1070
  • [6] Multi-objective reinforcement learning approach for trip recommendation
    Chen, Lei
    Zhu, Guixiang
    Liang, Weichao
    Wang, Youquan
    EXPERT SYSTEMS WITH APPLICATIONS, 2023, 226
  • [7] A Constrained Multi-Objective Reinforcement Learning Framework
    Huang, Sandy H.
    Abdolmaleki, Abbas
    Vezzani, Giulia
    Brakel, Philemon
    Mankowitz, Daniel J.
    Neunert, Michael
    Bohez, Steven
    Tassa, Yuval
    Heess, Nicolas
    Riedmiller, Martin
    Hadsell, Raia
    CONFERENCE ON ROBOT LEARNING, VOL 164, 2021, 164 : 883 - 893
  • [8] Multi-Objective Dynamic Path Planning with Multi-Agent Deep Reinforcement Learning
    Tao, Mengxue
    Li, Qiang
    Yu, Junxi
    JOURNAL OF MARINE SCIENCE AND ENGINEERING, 2025, 13 (01)
  • [9] Multi-objective Dynamic AGV Scheduling Method Based on Deep Reinforcement Learning
    Wang, Gaoshang
    Zou, Yuanyuan
    Li, Shaoyuan
    2024 14TH ASIAN CONTROL CONFERENCE, ASCC 2024, 2024, : 1450 - 1455
  • [10] Dynamic multi-objective scheduling for flexible job shop by deep reinforcement learning
    Luo, Shu
    Zhang, Linxuan
    Fan, Yushun
    COMPUTERS & INDUSTRIAL ENGINEERING, 2021, 159