Real-World Reinforcement Learning via Multifidelity Simulators

被引：33

作者：

Cutler, Mark ^{[1
]}

Walsh, Thomas J. ^{[1
]}

How, Jonathan P. ^{[1
]}

机构：

[1] MIT, Lab Informat & Decis Syst, Cambridge, MA 02139 USA

来源：

IEEE TRANSACTIONS ON ROBOTICS | 2015年 / 31卷 / 03期

基金：

美国国家科学基金会;

关键词：

Animation and simulation; autonomous agents; learning and adaptive systems; reinforcement learning (RL); ROBOTICS;

D O I：

10.1109/TRO.2015.2419431

中图分类号：

TP24 [机器人技术];

学科分类号：

080202 ; 1405 ;

摘要：

Reinforcement learning (RL) can be a tool for designing policies and controllers for robotic systems. However, the cost of real-world samples remains prohibitive as many RL algorithms require a large number of samples before learning useful policies. Simulators are one way to decrease the number of required real-world samples, but imperfect models make deciding when and how to trust samples from a simulator difficult. We present a framework for efficient RL in a scenario where multiple simulators of a target task are available, each with varying levels of fidelity. The framework is designed to limit the number of samples used in each successively higher-fidelity/cost simulator by allowing a learning agent to choose to run trajectories at the lowest level simulator that will still provide it with useful information. Theoretical proofs of the framework's sample complexity are given and empirical results are demonstrated on a remote-controlled car with multiple simulators. The approach enables RL algorithms to find near-optimal policies in a physical robot domain with fewer expensive real-world samples than previous transfer approaches or learning without simulators.

引用

页码：655 / 671

页数：17

共 50 条

[41] Real-world Cross-modal Retrieval via Sequential Learning
Song, Ge
Tan, Xiaoyang
IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 23 : 1708 - 1721
[42] Adaptive internal state space construction method for Reinforcement learning of a real-world agent
Samejima, K
Omori, T
NEURAL NETWORKS, 1999, 12 (7-8) : 1143 - 1155
[43] Deep reinforcement learning towards real-world dynamic thermal management of data centers
Zhang, Qingang
Zeng, Wei
Lin, Qinjie
Chng, Chin-Boon
Chui, Chee-Kong
Lee, Poh-Seng
APPLIED ENERGY, 2023, 333
[44] Application of Reinforcement Learning with Continuous State Space to Ramp Metering in Real-world Conditions
Rezaee, Kasra
Abdulhai, Baher
Abdelgawad, Hossam
2012 15TH INTERNATIONAL IEEE CONFERENCE ON INTELLIGENT TRANSPORTATION SYSTEMS (ITSC), 2012, : 1590 - 1595
[45] Reinforcement Learning for Semi-Active Vertical Dynamics Control with Real-World Tests
Ultsch, Johannes
Pfeiffer, Andreas
Ruggaber, Julian
Kamp, Tobias
Brembeck, Jonathan
Tobolar, Jakub
APPLIED SCIENCES-BASEL, 2024, 14 (16):
[46] First steps towards real-world traffic signal control optimisation by reinforcement learning
Meess, Henri
Gerner, Jeremias
Hein, Daniel
Schmidtner, Stefanie
Elger, Gordon
Bogenberger, Klaus
JOURNAL OF SIMULATION, 2024, 18 (06) : 957 - 972
[47] Uli-RL: A Real-World Deep Reinforcement Learning Pedagogical Agent for Children
Riedmann, Anna
Goetz, Julia
D'Eramo, Carlo
Lugrin, Birgit
KI 2024: ADVANCES IN ARTIFICIAL INTELLIGENCE, KI 2024, 2024, 14992 : 316 - 323
[48] Virtual-Taobao: Virtualizing Real-World Online Retail Environment for Reinforcement Learning
Shi, Jing-Cheng
Yu, Yang
Da, Qing
Chen, Shi-Yong
Zeng, An-Xiang
THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 4902 - 4909
[49] Real-world ride-hailing vehicle repositioning using deep reinforcement learning
Jiao, Yan
Tang, Xiaocheng
Qin, Zhiwei
Li, Shuaiji
Zhang, Fan
Zhu, Hongtu
Ye, Jieping
TRANSPORTATION RESEARCH PART C-EMERGING TECHNOLOGIES, 2021, 130
[50] Differentiable Physics Models for Real-world Offline Model-based Reinforcement Learning
Lutter, Michael
Silberbauer, Johannes
Watson, Joe
Peters, Jan
2021 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2021), 2021, : 4163 - 4170

← 1 2 3 4 5 →