Dynamic multi-objective sequence-wise recommendation framework via deep reinforcement learning

被引:5
|
作者
Zhang, Xiankun [1 ]
Shang, Yuhu [1 ]
Ren, Yimeng [1 ]
Liang, Kun [1 ]
机构
[1] Tianjin Univ Sci & Technol, Coll Artificial Intelligence, Tianjin 300457, Peoples R China
基金
中国国家自然科学基金;
关键词
Sequence-wise recommendation; Domain-specific objectives; Actor-critic network; State representation; NEURAL-NETWORKS; DIFFICULTY; MODEL;
D O I
10.1007/s40747-022-00871-x
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Sequence-wise recommendation, where recommend exercises to each student step by step, is one of the most exciting tasks in the field of intelligent tutoring systems (ITS). It is important to develop a personalized sequence-wise recommendation framework that immerses students in learning and helps them acquire as much necessary knowledge as possible, rather than merely focusing on providing non-mastered exercises, which is referred to optimize a single objective. However, due to the different knowledge levels of students and the large scale of exercise banks, it is difficult to generate a personalized exercise recommendation for each student. To fully exploit the multifaceted beneficial information collected from e-learning platforms, we design a dynamic multi-objective sequence-wise recommendation framework via deep reinforcement learning, i.e., DMoSwR-DRL, which automatically select the most suitable exercises for each student based on the well-designed domain-objective rewards. Within this framework, the interaction between students and exercises can be explicitly modeled by integrating the actor-critic network and the state representation component, which can greatly help the agent perform effective reinforcement learning. Specifically, we carefully design a state representation module with dynamic recurrent mechanism, which integrates concept information and exercise difficulty level, thus generating a continuous state representation of the student. Subsequently, a flexible reward function is designed to simultaneously optimize the four domain-specific objectives of difficulty, novelty, coverage, and diversity, providing the students with a trade-off sequence-wise recommendation. To set up the online evaluation, we test DMoSwR-DRL on a simulated environment which can model qualitative development of knowledge level and predicts their performance for a given exercise. Comprehensive experiments are conducted on four classical exercise-answer datasets, and the results show the effectiveness and advantages of DMoSwR-DRL in terms of recommendation quality.
引用
收藏
页码:1891 / 1911
页数:21
相关论文
共 50 条
  • [21] A Multi-objective Optimization Planning Framework for Active Distribution System Via Reinforcement Learning
    Li H.
    Wang C.
    Tian H.
    Ren Z.
    Zhao E.
    Xu L.
    Distributed Generation and Alternative Energy Journal, 2023, 38 (06): : 1741 - 1762
  • [22] Multi-Objective Reinforcement Learning Based on Decomposition: A Taxonomy and Framework
    Felten, Florian
    Talbi, El-Ghazali
    Danoy, Gregoire
    JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2024, 79 : 679 - 723
  • [23] Multi-objective path planning based on deep reinforcement learning
    Xu, Jian
    Huang, Fei
    Cui, Yunfei
    Du, Xue
    2022 41ST CHINESE CONTROL CONFERENCE (CCC), 2022, : 3273 - 3279
  • [24] Modular Multi-Objective Deep Reinforcement Learning with Decision Values
    Tajmajer, Tomasz
    PROCEEDINGS OF THE 2018 FEDERATED CONFERENCE ON COMPUTER SCIENCE AND INFORMATION SYSTEMS (FEDCSIS), 2018, : 85 - 93
  • [25] Dynamic Multi-Objective Optimization Framework With Interactive Evolution for Sequential Recommendation
    Zhou, Wei
    Liu, Yong
    Li, Min
    Wang, Yu
    Shen, Zhiqi
    Feng, Liang
    Zhu, Zexuan
    IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE, 2023, 7 (04): : 1228 - 1241
  • [26] Deep reinforcement learning for multi-objective game strategy selection
    Jiang, Ruhao
    Deng, Yanchen
    Chen, Yingying
    Luo, He
    An, Bo
    COMPUTERS & OPERATIONS RESEARCH, 2024, 168
  • [27] Collaborative Ground-Space Communications via Evolutionary Multi-Objective Deep Reinforcement Learning
    Li, Jiahui
    Sun, Geng
    Wu, Qingqing
    Niyato, Dusit
    Kang, Jiawen
    Jamalipour, Abbas
    Leung, Victor C. M.
    IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS, 2024, 42 (12) : 3395 - 3411
  • [28] Joint Optimization of Microservice Deployment and Routing in Edge via Multi-Objective Deep Reinforcement Learning
    Hu, Menglan
    Wang, Hao
    Xu, Xiaohui
    He, Jianwen
    Hu, Yi
    Deng, Tianping
    Peng, Kai
    IEEE TRANSACTIONS ON NETWORK AND SERVICE MANAGEMENT, 2024, 21 (06): : 6364 - 6381
  • [29] Multi-condition multi-objective optimization using deep reinforcement learning
    Kim, Sejin
    Kim, Innyoung
    You, Donghyun
    JOURNAL OF COMPUTATIONAL PHYSICS, 2022, 462
  • [30] Multi-objective ω-Regular Reinforcement Learning
    Hahn, Ernst Moritz
    Perez, Mateo
    Schewe, Sven
    Somenzi, Fabio
    Trivedi, Ashutosh
    Wojtczak, Dominik
    FORMAL ASPECTS OF COMPUTING, 2023, 35 (02)