High-accuracy model-based reinforcement learning, a survey

被引：0

作者：

Aske Plaat

Walter Kosters

Mike Preuss

机构：

[1] Leiden University,Computer Science

来源：

Artificial Intelligence Review | 2023年 / 56卷

关键词：

Model-based reinforcement learning; Latent models; Deep learning; Machine learning; Planning;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

Deep reinforcement learning has shown remarkable success in the past few years. Highly complex sequential decision making problems from game playing and robotics have been solved with deep model-free methods. Unfortunately, the sample complexity of model-free methods is often high. Model-based reinforcement learning, in contrast, can reduce the number of environment samples, by learning an explicit internal model of the environment dynamics. However, achieving good model accuracy in high dimensional problems is challenging. In recent years, a diverse landscape of model-based methods has been introduced to improve model accuracy, using methods such as probabilistic inference, model-predictive control, latent models, and end-to-end learning and planning. Some of these methods succeed in achieving high accuracy at low sample complexity in typical benchmark applications. In this paper, we survey these methods; we explain how they work and what their strengths and weaknesses are. We conclude with a research agenda for future work to make the methods more robust and applicable to a wider range of applications.

引用

页码：9541 / 9573

页数：32

共 50 条

[31] Transferring Instances for Model-Based Reinforcement Learning
Taylor, Matthew E.
Jong, Nicholas K.
Stone, Peter
[J]. MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, PART II, PROCEEDINGS, 2008, 5212 : 488 - 505
[32] Consistency of Fuzzy Model-Based Reinforcement Learning
Busoniu, Lucian
Ernst, Damien
De Schutter, Bart
Babuska, Robert
[J]. 2008 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS, VOLS 1-5, 2008, : 518 - +
[33] Abstraction Selection in Model-Based Reinforcement Learning
Jiang, Nan
Kulesza, Alex
Singh, Satinder
[J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 37, 2015, 37 : 179 - 188
[34] Asynchronous Methods for Model-Based Reinforcement Learning
Zhang, Yunzhi
Clavera, Ignasi
Tsai, Boren
Abbeel, Pieter
[J]. CONFERENCE ON ROBOT LEARNING, VOL 100, 2019, 100
[35] Online Constrained Model-based Reinforcement Learning
van Niekerk, Benjamin
Damianou, Andreas
Rosman, Benjamin
[J]. CONFERENCE ON UNCERTAINTY IN ARTIFICIAL INTELLIGENCE (UAI2017), 2017,
[36] Calibrated Model-Based Deep Reinforcement Learning
Malik, Ali
Kuleshov, Volodymyr
Song, Jiaming
Nemer, Danny
Seymour, Harlan
Ermon, Stefano
[J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 97, 2019, 97
[37] Iterative Learning Procedure With Reinforcement for High-Accuracy Force Tracking in Robotized Tasks
Roveda, Loris
Pallucca, Giacomo
Pedrocchi, Nicola
Braghin, Francesco
Tosatti, Lorenzo Molinari
[J]. IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2018, 14 (04) : 1753 - 1763
[38] Model gradient: unified model and policy learning in model-based reinforcement learning
Jia, Chengxing
Zhang, Fuxiang
Xu, Tian
Pang, Jing-Cheng
Zhang, Zongzhang
Yu, Yang
[J]. FRONTIERS OF COMPUTER SCIENCE, 2024, 18 (04)
[39] Model gradient: unified model and policy learning in model-based reinforcement learning
Chengxing Jia
Fuxiang Zhang
Tian Xu
Jing-Cheng Pang
Zongzhang Zhang
Yang Yu
[J]. Frontiers of Computer Science, 2024, 18
[40] Incremental Learning of Planning Actions in Model-Based Reinforcement Learning
Ng, Jun Hao Alvin
Petrick, Ronald P. A.
[J]. PROCEEDINGS OF THE TWENTY-EIGHTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2019, : 3195 - 3201

← 1 2 3 4 5 →