Asynchronous Methods for Model-Based Reinforcement Learning

被引：0

作者：

Zhang, Yunzhi ^{[1
]}

Clavera, Ignasi ^{[1
]}

Tsai, Boren ^{[1
]}

Abbeel, Pieter ^{[1
]}

机构：

[1] Univ Calif Berkeley, Berkeley, CA 94720 USA

来源：

CONFERENCE ON ROBOT LEARNING, VOL 100 | 2019年 / 100卷

关键词：

Reinforcement Learning; Model-Based; Asynchronous Learning;

D O I：

暂无

中图分类号：

TP39 [计算机的应用];

学科分类号：

081203 ; 0835 ;

摘要：

Significant progress has been made in the area of model-based reinforcement learning. State-of-the-art algorithms are now able to match the asymptotic performance of model-free methods while being significantly more data efficient. However, this success has come at a price: state-of-the-art model-based methods require significant computation interleaved with data collection, resulting in run times that take days, even if the amount of agent interaction might be just hours or even minutes. When considering the goal of learning in real-time on real robots, this means these state-of-the-art model-based algorithms still remain impractical. In this work, we propose an asynchronous framework for model-based reinforcement learning methods that brings down the run time of these algorithms to be just the data collection time. We evaluate our asynchronous framework on a range of standard MuJoCo benchmarks. We also evaluate our asynchronous framework on three real-world robotic manipulation tasks. We show how asynchronous learning not only speeds up learning w.r.t wall-clock time through parallelization, but also further reduces the sample complexity of model-based approaches by means of improving the exploration and by means of effectively avoiding the policy overfitting to the deficiencies of learned dynamics models.

引用

下载

页数：10

共 50 条

[1] Towards Evaluating Adaptivity of Model-Based Reinforcement Learning Methods
Wan, Yi
Rahimi-Kalahroudi, Ali
Rajendran, Janarthanan
Momennejad, Ida
Chandar, Sarath
van Seijen, Harm
INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
[2] The ubiquity of model-based reinforcement learning
Doll, Bradley B.
Simon, Dylan A.
Daw, Nathaniel D.
CURRENT OPINION IN NEUROBIOLOGY, 2012, 22 (06) : 1075 - 1081
[3] Model-based Reinforcement Learning: A Survey
Moerland, Thomas M.
Broekens, Joost
Plaat, Aske
Jonker, Catholijn M.
FOUNDATIONS AND TRENDS IN MACHINE LEARNING, 2023, 16 (01): : 1 - 118
[4] A survey on model-based reinforcement learning
Fan-Ming LUO
Tian XU
Hang LAI
Xiong-Hui CHEN
Weinan ZHANG
Yang YU
Science China(Information Sciences), 2024, 67 (02) : 59 - 84
[5] Nonparametric model-based reinforcement learning
Atkeson, CG
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 10, 1998, 10 : 1008 - 1014
[6] Multiple model-based reinforcement learning
Doya, K
Samejima, K
Katagiri, K
Kawato, M
NEURAL COMPUTATION, 2002, 14 (06) : 1347 - 1369
[7] A survey on model-based reinforcement learning
Luo, Fan-Ming
Xu, Tian
Lai, Hang
Chen, Xiong-Hui
Zhang, Weinan
Yu, Yang
SCIENCE CHINA-INFORMATION SCIENCES, 2024, 67 (02)
[8] Asynchronous Methods for Deep Reinforcement Learning
Mnih, Volodymyr
Badia, Adria Puigdomenech
Mirza, Mehdi
Graves, Alex
Harley, Tim
Lillicrap, Timothy P.
INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 48, 2016, 48
[9] Learning to Paint With Model-based Deep Reinforcement Learning
Huang, Zhewei
Heng, Wen
Zhou, Shuchang
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 8708 - 8717
[10] Objective Mismatch in Model-based Reinforcement Learning
Lambert, Nathan
Amos, Brandon
Yadan, Omry
Calandra, Roberto
LEARNING FOR DYNAMICS AND CONTROL, VOL 120, 2020, 120 : 761 - 770

← 1 2 3 4 5 →