A Meta-MDP Approach to Exploration for Lifelong Reinforcement Learning

被引:0
|
作者
Garcia, Francisco M. [1 ]
Thomas, Philip S. [1 ]
机构
[1] Univ Massachusetts, Amherst, MA 01003 USA
关键词
Reinforcement Learning; Hierarchical RL; Exploration;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
In this paper we consider the problem of how a reinforcement learning agent that is tasked with solving a sequence of reinforcement learning problems (Markov decision processes) can use knowledge acquired early in its lifetime to improve its ability to solve new problems. Specifically, we focus on the question of how the agent should explore when faced with a new environment. We show that the search for an optimal exploration strategy can be formulated as a reinforcement learning problem itself, albeit with a different timescale. We conclude with experiments that show the benefits of optimizing an exploration strategy using our proposed approach.
引用
收藏
页码:1976 / 1978
页数:3
相关论文
共 50 条
  • [1] A Meta-MDP Approach to Exploration for Lifelong Reinforcement Learning
    Garcia, Francisco M.
    Thomas, Philip S.
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
  • [2] A Meta-MDP Approach for Information Gathering Heterogeneous Multi-agent Systems
    Gandois, Alvin
    Mouaddib, Abdel-Illah
    Le Gloannec, Simon
    Alfalou, Ayman
    ROBOTICS, COMPUTER VISION AND INTELLIGENT SYSTEMS, ROBOVIS 2024, 2024, 2077 : 345 - 360
  • [3] Intrinsically Motivated Lifelong Exploration in Reinforcement Learning
    Bougie, Nicolas
    Ichise, Ryutaro
    ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 1357 : 109 - 120
  • [4] Using MDP characteristics to guide exploration in reinforcement learning
    Ratitch, B
    Precup, D
    MACHINE LEARNING: ECML 2003, 2003, 2837 : 313 - 324
  • [5] Model-based Lifelong Reinforcement Learning with Bayesian Exploration
    Fu, Haotian
    Yu, Shangqun
    Littman, Michael
    Konidaris, George
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
  • [6] REACTIVE EXPLORATION TO COPE WITH NON-STATIONARITY IN LIFELONG REINFORCEMENT LEARNING
    Steinparz, Christian
    Schmied, Thomas
    Paischer, Fabian
    Dinu, Marius-Constantin
    Patil, Vihang
    Bitto-Nemling, Angela
    Eghbal-zadeh, Hamid
    Hochreiter, Sepp
    CONFERENCE ON LIFELONG LEARNING AGENTS, VOL 199, 2022, 199
  • [7] Exploration With Task Information for Meta Reinforcement Learning
    Jiang, Peng
    Song, Shiji
    Huang, Gao
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 34 (08) : 4033 - 4046
  • [8] Scalable lifelong reinforcement learning
    Zhan, Yusen
    Ammar, Haitham Bou
    Taylor, Matthew E.
    PATTERN RECOGNITION, 2017, 72 : 407 - 418
  • [9] Lifelong Inverse Reinforcement Learning
    Mendez, Jorge A.
    Shivkumar, Shashank
    Eaton, Eric
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018), 2018, 31
  • [10] Dream to Adapt: Meta Reinforcement Learning by Latent Context Imagination and MDP Imagination
    Wen, Lu
    Tseng, Eric H.
    Peng, Huei
    Zhang, Songan
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2024, 9 (11): : 9701 - 9708