Real-World Reinforcement Learning via Multifidelity Simulators

被引:33
|
作者
Cutler, Mark [1 ]
Walsh, Thomas J. [1 ]
How, Jonathan P. [1 ]
机构
[1] MIT, Lab Informat & Decis Syst, Cambridge, MA 02139 USA
基金
美国国家科学基金会;
关键词
Animation and simulation; autonomous agents; learning and adaptive systems; reinforcement learning (RL); ROBOTICS;
D O I
10.1109/TRO.2015.2419431
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
Reinforcement learning (RL) can be a tool for designing policies and controllers for robotic systems. However, the cost of real-world samples remains prohibitive as many RL algorithms require a large number of samples before learning useful policies. Simulators are one way to decrease the number of required real-world samples, but imperfect models make deciding when and how to trust samples from a simulator difficult. We present a framework for efficient RL in a scenario where multiple simulators of a target task are available, each with varying levels of fidelity. The framework is designed to limit the number of samples used in each successively higher-fidelity/cost simulator by allowing a learning agent to choose to run trajectories at the lowest level simulator that will still provide it with useful information. Theoretical proofs of the framework's sample complexity are given and empirical results are demonstrated on a remote-controlled car with multiple simulators. The approach enables RL algorithms to find near-optimal policies in a physical robot domain with fewer expensive real-world samples than previous transfer approaches or learning without simulators.
引用
收藏
页码:655 / 671
页数:17
相关论文
共 50 条
  • [31] Real-World Implementation of Reinforcement Learning Based Energy Coordination for a Cluster of Households
    Gokhale, Gargya
    Tiben, Niels
    Verwee, Marie-Sophie
    Lahariya, Manu
    Claessens, Bert
    Develder, Chris
    PROCEEDINGS OF THE 10TH ACM INTERNATIONAL CONFERENCE ON SYSTEMS FOR ENERGY-EFFICIENT BUILDINGS, CITIES, AND TRANSPORTATION, BUILDSYS 2023, 2023, : 347 - 351
  • [32] Controlling Aluminum Strip Thickness by Clustered Reinforcement Learning With Real-World Dataset
    Xiao, Ziqi
    He, Zhili
    Liang, Huanghuang
    Hu, Chuang
    Cheng, Dazhao
    IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2024, 20 (08) : 9928 - 9938
  • [33] Exploring Applications of Deep Reinforcement Learning for Real-world Autonomous Driving Systems
    Talpaert, Victor
    Sobh, Ibrahim
    Kiran, B. Ravi
    Mannion, Patrick
    Yogamani, Senthil
    El-Sallab, Ahmad
    Perez, Patrick
    PROCEEDINGS OF THE 14TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER VISION, IMAGING AND COMPUTER GRAPHICS THEORY AND APPLICATIONS (VISAPP), VOL 5, 2019, : 564 - 572
  • [34] Guided Reinforcement Learning A Review and Evaluation for Efficient and Effective Real-World Robotics
    Esser, Julian
    Bach, Nicolas
    Jestel, Christian
    Urbann, Oliver
    Kerner, Soren
    IEEE ROBOTICS & AUTOMATION MAGAZINE, 2023, 30 (02) : 67 - 85
  • [35] A Survey on Reproducibility by Evaluating Deep Reinforcement Learning Algorithms on Real-World Robots
    Lynnerup, Nicolai A.
    Nolling, Laura
    Hasle, Rasmus
    Hallam, John
    CONFERENCE ON ROBOT LEARNING, VOL 100, 2019, 100
  • [36] Learning With Real-World Data
    不详
    IEEE CONTROL SYSTEMS MAGAZINE, 2023, 43 (05): : 158 - 159
  • [37] A REFUGE FOR REAL-WORLD LEARNING
    MCFADEN, D
    NELSON, B
    EDUCATIONAL LEADERSHIP, 1995, 52 (08) : 11 - 13
  • [38] Train a real-world local path planner in one hour via partially decoupled reinforcement learning and vectorized diversity
    Xin, Jinghao
    Kim, Jinwoo
    Li, Zhi
    Li, Ning
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2025, 141
  • [39] DeepEmo: Real-world Facial Expression Analysis via Deep Learning
    Deng, Weihong
    Hu, Jiani
    Zhang, Shuo
    Gao, Jun
    2015 VISUAL COMMUNICATIONS AND IMAGE PROCESSING (VCIP), 2015,
  • [40] Underwater Image Restoration via Contrastive Learning and a Real-World Dataset
    Han, Junlin
    Shoeiby, Mehrdad
    Malthus, Tim
    Botha, Elizabeth
    Anstee, Janet
    Anwar, Saeed
    Wei, Ran
    Armin, Mohammad Ali
    Li, Hongdong
    Petersson, Lars
    REMOTE SENSING, 2022, 14 (17)