A Meta-MDP Approach to Exploration for Lifelong Reinforcement Learning

被引:0
|
作者
Garcia, Francisco M. [1 ]
Thomas, Philip S. [1 ]
机构
[1] Univ Massachusetts, Amherst, MA 01003 USA
关键词
Reinforcement Learning; Hierarchical RL; Exploration;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
In this paper we consider the problem of how a reinforcement learning agent that is tasked with solving a sequence of reinforcement learning problems (Markov decision processes) can use knowledge acquired early in its lifetime to improve its ability to solve new problems. Specifically, we focus on the question of how the agent should explore when faced with a new environment. We show that the search for an optimal exploration strategy can be formulated as a reinforcement learning problem itself, albeit with a different timescale. We conclude with experiments that show the benefits of optimizing an exploration strategy using our proposed approach.
引用
收藏
页码:1976 / 1978
页数:3
相关论文
共 50 条
  • [21] Exploration in Approximate Hyper-State Space for Meta Reinforcement Learning
    Zintgraf, Luisa
    Feng, Leo
    Lu, Cong
    Igl, Maximilian
    Hartikainen, Kristian
    Hofmann, Katja
    Whiteson, Shimon
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139
  • [22] Decoupling Exploration and Exploitation for Meta-Reinforcement Learning without Sacrifices
    Liu, Evan Zheran
    Raghunathan, Aditi
    Liang, Percy
    Finn, Chelsea
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139
  • [23] Exploration in Interactive Personalized Music Recommendation: A Reinforcement Learning Approach
    Wang, Xinxi
    Wang, Yi
    Hsu, David
    Wang, Ye
    ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2014, 11 (01)
  • [24] HLifeRL: A hierarchical lifelong reinforcement learning framework
    Ding, Fan
    Zhu, Fei
    JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES, 2022, 34 (07) : 4312 - 4321
  • [25] Variational Policy Chaining for Lifelong Reinforcement Learning
    Doyle, Christopher
    Gueriau, Maxime
    Dusparic, Ivana
    2019 IEEE 31ST INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2019), 2019, : 1546 - 1550
  • [26] A Meta Reinforcement Learning Approach for Predictive Autoscaling in the Cloud
    Xue, Siqiao
    Qu, Chao
    Shi, Xiaoming
    Liao, Cong
    Zhu, Shiyi
    Tan, Xiaoyu
    Ma, Lintao
    Wang, Shiyu
    Wang, Shijun
    Hu, Yun
    Lei, Lei
    Zhen, Yangfei
    Li, Jianguo
    Zhang, James
    PROCEEDINGS OF THE 28TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, KDD 2022, 2022, : 4290 - 4299
  • [27] A Meta-Reinforcement Learning Approach to Process Control
    McClement, Daniel G.
    Lawrence, Nathan P.
    Loewen, Philip D.
    Forbes, Michael G.
    Backstrom, Johan U.
    Gopaluni, R. Bhushan
    IFAC PAPERSONLINE, 2021, 54 (03): : 685 - 692
  • [28] Model primitives for hierarchical lifelong reinforcement learning
    Bohan Wu
    Jayesh K. Gupta
    Mykel Kochenderfer
    Autonomous Agents and Multi-Agent Systems, 2020, 34
  • [29] Model Primitive Hierarchical Lifelong Reinforcement Learning
    Wu, Bohan
    Gupta, Jayesh K.
    Kochenderfer, Mykel J.
    AAMAS '19: PROCEEDINGS OF THE 18TH INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS AND MULTIAGENT SYSTEMS, 2019, : 34 - 42
  • [30] Model primitives for hierarchical lifelong reinforcement learning
    Wu, Bohan
    Gupta, Jayesh K.
    Kochenderfer, Mykel
    AUTONOMOUS AGENTS AND MULTI-AGENT SYSTEMS, 2020, 34 (01)