Model-based Lifelong Reinforcement Learning with Bayesian Exploration

被引：0

作者：

Fu, Haotian ^{[1
]}

Yu, Shangqun ^{[1
]}

Littman, Michael ^{[1
]}

Konidaris, George ^{[1
]}

机构：

[1] Brown Univ, Dept Comp Sci, Providence, RI 02912 USA

来源：

ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022) | 2022年

关键词：

ENTROPY;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We propose a model-based lifelong reinforcement-learning approach that estimates a hierarchical Bayesian posterior distilling the common structure shared across different tasks. The learned posterior combined with a sample-based Bayesian exploration procedure increases the sample efficiency of learning across a family of related tasks. We first derive an analysis of the relationship between the sample complexity and the initialization quality of the posterior in the finite MDP setting. We next scale the approach to continuous-state domains by introducing a Variational Bayesian Lifelong Reinforcement Learning algorithm that can be combined with recent model-based deep RL methods, and that exhibits backward transfer. Experimental results on several challenging domains show that our algorithms achieve both better forward and backward transfer performance than state-of-the-art lifelong RL methods.

引用

页数：14

共 50 条

[31] Incremental model-based reinforcement learning with model constraint
Yang, Zhiyou
Fu, Mingsheng
Qu, Hong
Li, Fan
Shi, Shuqing
Hu, Wang
NEURAL NETWORKS, 2025, 185
[32] Objective Mismatch in Model-based Reinforcement Learning
Lambert, Nathan
Amos, Brandon
Yadan, Omry
Calandra, Roberto
LEARNING FOR DYNAMICS AND CONTROL, VOL 120, 2020, 120 : 761 - 770
[33] Model-based reinforcement learning with dimension reduction
Tangkaratt, Voot
Morimoto, Jun
Sugiyama, Masashi
NEURAL NETWORKS, 2016, 84 : 1 - 16
[34] On Effective Scheduling of Model-based Reinforcement Learning
Lai, Hang
Shen, Jian
Zhang, Weinan
Huang, Yimin
Zhang, Xing
Tang, Ruiming
Yu, Yong
Li, Zhenguo
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
[35] Transferring Instances for Model-Based Reinforcement Learning
Taylor, Matthew E.
Jong, Nicholas K.
Stone, Peter
MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, PART II, PROCEEDINGS, 2008, 5212 : 488 - 505
[36] MOReL: Model-Based Offline Reinforcement Learning
Kidambi, Rahul
Rajeswaran, Aravind
Netrapalli, Praneeth
Joachims, Thorsten
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
[37] Modeling Survival in model-based Reinforcement Learning
Moazami, Saeed
Doerschuk, Peggy
2020 SECOND INTERNATIONAL CONFERENCE ON TRANSDISCIPLINARY AI (TRANSAI 2020), 2020, : 17 - 24
[38] Model-Based Reinforcement Learning With Isolated Imaginations
Pan, Minting
Zhu, Xiangming
Zheng, Yitao
Wang, Yunbo
Yang, Xiaokang
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2024, 46 (05) : 2788 - 2803
[39] Model-based average reward reinforcement learning
Tadepalli, P
Ok, D
ARTIFICIAL INTELLIGENCE, 1998, 100 (1-2) : 177 - 224
[40] Model-Based Reinforcement Learning in Robotics: A Survey
Sun S.
Lan X.
Zhang H.
Zheng N.
Moshi Shibie yu Rengong Zhineng/Pattern Recognition and Artificial Intelligence, 2022, 35 (01): : 1 - 16

← 1 2 3 4 5 →