Dynamic multi-objective sequence-wise recommendation framework via deep reinforcement learning

被引:4
|
作者
Zhang, Xiankun [1 ]
Shang, Yuhu [1 ]
Ren, Yimeng [1 ]
Liang, Kun [1 ]
机构
[1] Tianjin Univ Sci & Technol, Coll Artificial Intelligence, Tianjin 300457, Peoples R China
基金
中国国家自然科学基金;
关键词
Sequence-wise recommendation; Domain-specific objectives; Actor-critic network; State representation; NEURAL-NETWORKS; DIFFICULTY; MODEL;
D O I
10.1007/s40747-022-00871-x
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Sequence-wise recommendation, where recommend exercises to each student step by step, is one of the most exciting tasks in the field of intelligent tutoring systems (ITS). It is important to develop a personalized sequence-wise recommendation framework that immerses students in learning and helps them acquire as much necessary knowledge as possible, rather than merely focusing on providing non-mastered exercises, which is referred to optimize a single objective. However, due to the different knowledge levels of students and the large scale of exercise banks, it is difficult to generate a personalized exercise recommendation for each student. To fully exploit the multifaceted beneficial information collected from e-learning platforms, we design a dynamic multi-objective sequence-wise recommendation framework via deep reinforcement learning, i.e., DMoSwR-DRL, which automatically select the most suitable exercises for each student based on the well-designed domain-objective rewards. Within this framework, the interaction between students and exercises can be explicitly modeled by integrating the actor-critic network and the state representation component, which can greatly help the agent perform effective reinforcement learning. Specifically, we carefully design a state representation module with dynamic recurrent mechanism, which integrates concept information and exercise difficulty level, thus generating a continuous state representation of the student. Subsequently, a flexible reward function is designed to simultaneously optimize the four domain-specific objectives of difficulty, novelty, coverage, and diversity, providing the students with a trade-off sequence-wise recommendation. To set up the online evaluation, we test DMoSwR-DRL on a simulated environment which can model qualitative development of knowledge level and predicts their performance for a given exercise. Comprehensive experiments are conducted on four classical exercise-answer datasets, and the results show the effectiveness and advantages of DMoSwR-DRL in terms of recommendation quality.
引用
收藏
页码:1891 / 1911
页数:21
相关论文
共 50 条
  • [31] Multi-objective deep inverse reinforcement learning for weight estimation of objectives
    Naoya Takayama
    Sachiyo Arai
    [J]. Artificial Life and Robotics, 2022, 27 : 594 - 602
  • [32] Multi-objective deep inverse reinforcement learning for weight estimation of objectives
    Takayama, Naoya
    Arai, Sachiyo
    [J]. ARTIFICIAL LIFE AND ROBOTICS, 2022, 27 (03) : 594 - 602
  • [33] Examining multi-objective deep reinforcement learning frameworks for molecular design
    Al-Jumaily, Aws
    Mukaidaisi, Muhetaer
    Vu, Andrew
    Tchagang, Alain
    Li, Yifeng
    [J]. BIOSYSTEMS, 2023, 232
  • [34] Multi-Objective Deep Reinforcement Learning for Crowd Route Guidance Optimization
    Nishida, Ryo
    Tanigaki, Yuki
    Onishi, Masaki
    Hashimoto, Koichi
    [J]. TRANSPORTATION RESEARCH RECORD, 2024, 2678 (05) : 617 - 633
  • [35] A Stable Deep Reinforcement Learning Framework for Recommendation
    Liu, Ruochen
    Jiang, Dawei
    Zhang, Xilong
    [J]. IEEE INTELLIGENT SYSTEMS, 2022, 37 (03) : 76 - 84
  • [36] Multi-objective reinforcement learning framework for dynamic flexible job shop scheduling problem with uncertain events
    Wang, Hao
    Cheng, Junfu
    Liu, Chang
    Zhang, Yuanyuan
    Hu, Shunfang
    Chen, Liangyin
    [J]. APPLIED SOFT COMPUTING, 2022, 131
  • [37] Multi-objective deep reinforcement learning for crowd-aware robot navigation with dynamic human preference
    Guangran Cheng
    Yuanda Wang
    Lu Dong
    Wenzhe Cai
    Changyin Sun
    [J]. Neural Computing and Applications, 2023, 35 : 16247 - 16265
  • [38] A Data-Driven Reinforcement Learning Based Multi-Objective Route Recommendation System
    Sarker, Ankur
    Shen, Haiying
    Kowsari, Kamran
    [J]. 2020 IEEE 17TH INTERNATIONAL CONFERENCE ON MOBILE AD HOC AND SMART SYSTEMS (MASS 2020), 2020, : 103 - 111
  • [39] Multi-objective deep reinforcement learning for crowd-aware robot navigation with dynamic human preference
    Cheng, Guangran
    Wang, Yuanda
    Dong, Lu
    Cai, Wenzhe
    Sun, Changyin
    [J]. NEURAL COMPUTING & APPLICATIONS, 2023, 35 (22): : 16247 - 16265
  • [40] Personalized robotic control via constrained multi-objective reinforcement learning
    He, Xiangkun
    Hu, Zhongxu
    Yang, Haohan
    Lv, Chen
    [J]. NEUROCOMPUTING, 2024, 565