Real-World Reinforcement Learning via Multifidelity Simulators

被引:33
|
作者
Cutler, Mark [1 ]
Walsh, Thomas J. [1 ]
How, Jonathan P. [1 ]
机构
[1] MIT, Lab Informat & Decis Syst, Cambridge, MA 02139 USA
基金
美国国家科学基金会;
关键词
Animation and simulation; autonomous agents; learning and adaptive systems; reinforcement learning (RL); ROBOTICS;
D O I
10.1109/TRO.2015.2419431
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
Reinforcement learning (RL) can be a tool for designing policies and controllers for robotic systems. However, the cost of real-world samples remains prohibitive as many RL algorithms require a large number of samples before learning useful policies. Simulators are one way to decrease the number of required real-world samples, but imperfect models make deciding when and how to trust samples from a simulator difficult. We present a framework for efficient RL in a scenario where multiple simulators of a target task are available, each with varying levels of fidelity. The framework is designed to limit the number of samples used in each successively higher-fidelity/cost simulator by allowing a learning agent to choose to run trajectories at the lowest level simulator that will still provide it with useful information. Theoretical proofs of the framework's sample complexity are given and empirical results are demonstrated on a remote-controlled car with multiple simulators. The approach enables RL algorithms to find near-optimal policies in a physical robot domain with fewer expensive real-world samples than previous transfer approaches or learning without simulators.
引用
收藏
页码:655 / 671
页数:17
相关论文
共 50 条
  • [41] Real-world Cross-modal Retrieval via Sequential Learning
    Song, Ge
    Tan, Xiaoyang
    IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 23 : 1708 - 1721
  • [42] Adaptive internal state space construction method for Reinforcement learning of a real-world agent
    Samejima, K
    Omori, T
    NEURAL NETWORKS, 1999, 12 (7-8) : 1143 - 1155
  • [43] Deep reinforcement learning towards real-world dynamic thermal management of data centers
    Zhang, Qingang
    Zeng, Wei
    Lin, Qinjie
    Chng, Chin-Boon
    Chui, Chee-Kong
    Lee, Poh-Seng
    APPLIED ENERGY, 2023, 333
  • [44] Application of Reinforcement Learning with Continuous State Space to Ramp Metering in Real-world Conditions
    Rezaee, Kasra
    Abdulhai, Baher
    Abdelgawad, Hossam
    2012 15TH INTERNATIONAL IEEE CONFERENCE ON INTELLIGENT TRANSPORTATION SYSTEMS (ITSC), 2012, : 1590 - 1595
  • [45] Reinforcement Learning for Semi-Active Vertical Dynamics Control with Real-World Tests
    Ultsch, Johannes
    Pfeiffer, Andreas
    Ruggaber, Julian
    Kamp, Tobias
    Brembeck, Jonathan
    Tobolar, Jakub
    APPLIED SCIENCES-BASEL, 2024, 14 (16):
  • [46] First steps towards real-world traffic signal control optimisation by reinforcement learning
    Meess, Henri
    Gerner, Jeremias
    Hein, Daniel
    Schmidtner, Stefanie
    Elger, Gordon
    Bogenberger, Klaus
    JOURNAL OF SIMULATION, 2024, 18 (06) : 957 - 972
  • [47] Uli-RL: A Real-World Deep Reinforcement Learning Pedagogical Agent for Children
    Riedmann, Anna
    Goetz, Julia
    D'Eramo, Carlo
    Lugrin, Birgit
    KI 2024: ADVANCES IN ARTIFICIAL INTELLIGENCE, KI 2024, 2024, 14992 : 316 - 323
  • [48] Virtual-Taobao: Virtualizing Real-World Online Retail Environment for Reinforcement Learning
    Shi, Jing-Cheng
    Yu, Yang
    Da, Qing
    Chen, Shi-Yong
    Zeng, An-Xiang
    THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 4902 - 4909
  • [49] Real-world ride-hailing vehicle repositioning using deep reinforcement learning
    Jiao, Yan
    Tang, Xiaocheng
    Qin, Zhiwei
    Li, Shuaiji
    Zhang, Fan
    Zhu, Hongtu
    Ye, Jieping
    TRANSPORTATION RESEARCH PART C-EMERGING TECHNOLOGIES, 2021, 130
  • [50] Differentiable Physics Models for Real-world Offline Model-based Reinforcement Learning
    Lutter, Michael
    Silberbauer, Johannes
    Watson, Joe
    Peters, Jan
    2021 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2021), 2021, : 4163 - 4170