Learning Options in Multiobjective Reinforcement Learning

被引:0
|
作者
Bonini, Rodrigo Cesar [1 ]
da Silva, Felipe Leno [1 ]
Reali Costa, Anna Helena [1 ]
机构
[1] Univ Sao Paulo, Escola Politecn, Engn Eletr, Av Prof Luciano Gualberto 158, BR-05508970 Sao Paulo, Brazil
基金
巴西圣保罗研究基金会;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Reinforcement Learning (RL) is a successful technique to train autonomous agents. However, the classical RL methods take a long time to learn how to solve tasks. Option-based solutions can be used to accelerate learning and transfer learned behaviors across tasks by encapsulating a partial policy into an action. However, the literature report only single-agent and single-objective option-based methods, but many RL tasks, especially real-world problems, are better described through multiple objectives. We here propose a method to learn options in Multiobjective Reinforcement Learning domains in order to accelerate learning and reuse knowledge across tasks. Our initial experiments in the Goldmine Domain show that our proposal learn useful options that accelerate learning in multiobjective domains. Our next steps are to use the learned options to transfer knowledge across tasks and evaluate this method with stochastic policies.
引用
收藏
页码:4907 / 4908
页数:2
相关论文
共 50 条
  • [21] Q-Managed: A new algorithm for a multiobjective reinforcement learning
    Freire de Oliveira, Thiago Henrique
    de Souza Medeiros, Luiz Paulo
    Doria Neto, Adriao Duarte
    Melo, Jorge Dantas
    [J]. SOFTWARE IMPACTS, 2021, 9
  • [22] Solving biobjective traveling thief problems with multiobjective reinforcement learning
    Santiyuda, Gemilang
    Wardoyo, Retantyo
    Pulungan, Reza
    [J]. APPLIED SOFT COMPUTING, 2024, 161
  • [23] Q-Managed: A new algorithm for a multiobjective reinforcement learning
    Freire de Oliveira, Thiago Henrique
    de Souza Medeiros, Luiz Paulo
    Doria Neto, Adriao Duarte
    Melo, Jorge Dantas
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2021, 168
  • [24] Multiobjective Battery Charging Strategy Based on Deep Reinforcement Learning
    Xiong, Zheng
    Luo, Biao
    Wang, Bing-Chuan
    Xu, Xiaodong
    Huang, Tingwen
    [J]. IEEE TRANSACTIONS ON TRANSPORTATION ELECTRIFICATION, 2024, 10 (03): : 6893 - 6903
  • [25] Successor Options: An Option Discovery Framework for Reinforcement Learning
    Ramesh, Rahul
    Tomar, Manan
    Ravindran, Balaraman
    [J]. PROCEEDINGS OF THE TWENTY-EIGHTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2019, : 3304 - 3310
  • [26] Building Portable Options: Skill Transfer in Reinforcement Learning
    Konidaris, George
    Barto, Andrew
    [J]. 20TH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2007, : 895 - 900
  • [27] Constructing Stochastic Mixture Policies for Episodic Multiobjective Reinforcement Learning Tasks
    Vamplew, Peter
    Dazeley, Richard
    Barker, Ewan
    Kelarev, Andrei
    [J]. AI 2009: ADVANCES IN ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2009, 5866 : 340 - 349
  • [28] The impact of environmental stochasticity on value-based multiobjective reinforcement learning
    Vamplew, Peter
    Foale, Cameron
    Dazeley, Richard
    [J]. NEURAL COMPUTING & APPLICATIONS, 2022, 34 (03): : 1783 - 1799
  • [29] Reinforcement Learning of Pareto-Optimal Multiobjective Policies Using Steering
    Vamplew, Peter
    Issabekov, Rustam
    Dazeley, Richard
    Foale, Cameron
    [J]. AI 2015: ADVANCES IN ARTIFICIAL INTELLIGENCE, 2015, 9457 : 596 - 608
  • [30] Multiobjective Offloading Optimization in Fog Computing Using Deep Reinforcement Learning
    Department of Computer Engineering Science and Research Branch, Islamic Azad University, Tehran, Iran
    不详
    [J]. J. Comput. Netw. Commun, 2024,