STASIS: Reinforcement Learning Simulators for Human-Centric Real-World Environments

被引:0
|
作者
Efstathiadis, Georgios [1 ]
Emedom-Nnamdi, Patrick [1 ]
Kolbeinsson, Arinbjorn [2 ]
Onnela, Jukka-Pekka [1 ]
Lu, Junwei [1 ]
机构
[1] Harvard TH Chan Sch Publ Hlth, Dept Biostat, Boston, MA 02115 USA
[2] Evidat Hlth, London, England
关键词
reinforcement learning; health care; real-world simulators;
D O I
10.1007/978-3-031-39539-0_8
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present on-going work toward building Stasis, a suite of reinforcement learning (RL) environments that aim to maintain realism for human-centric agents operating in real-world settings. Through representation learning and alignment with real-world offline data, Stasis allows for the evaluation of RL algorithms in offline environments with adjustable characteristics, such as observability, heterogeneity and levels of missing data. We aim to introduce environments the encourage training RL agents that are capable of maintaining a level of performance and robustness comparable to agents trained in real-world online environments, while avoiding the high cost and risks associated with making mistakes during online training. We provide examples of two environments that will be part of Stasis and discuss its implications for the deployment of RL-based systems in sensitive and high-risk areas of application.
引用
收藏
页码:85 / 92
页数:8
相关论文
共 50 条
  • [21] Challenges of real-world reinforcement learning: definitions, benchmarks and analysis
    Dulac-Arnold, Gabriel
    Levine, Nir
    Mankowitz, Daniel J.
    Li, Jerry
    Paduraru, Cosmin
    Gowal, Sven
    Hester, Todd
    MACHINE LEARNING, 2021, 110 (09) : 2419 - 2468
  • [22] Challenges of real-world reinforcement learning: definitions, benchmarks and analysis
    Gabriel Dulac-Arnold
    Nir Levine
    Daniel J. Mankowitz
    Jerry Li
    Cosmin Paduraru
    Sven Gowal
    Todd Hester
    Machine Learning, 2021, 110 : 2419 - 2468
  • [23] Toward the confident deployment of real-world reinforcement learning agents
    Hanna, Josiah P.
    AI MAGAZINE, 2024, 45 (03) : 396 - 403
  • [24] ContainerGym: A Real-World Reinforcement Learning Benchmark for Resource Allocation
    Pendyala, Abhijeet
    Dettmer, Justin
    Glasmachers, Tobias
    Atamna, Asma
    MACHINE LEARNING, OPTIMIZATION, AND DATA SCIENCE, LOD 2023, PT I, 2024, 14505 : 78 - 92
  • [25] A Real-World Quadrupedal Locomotion Benchmark for Offline Reinforcement Learning
    Zhang, Hongyin
    Yang, Shuyu
    Wang, Donglin
    2024 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN 2024, 2024,
  • [26] Learning in a Real-World Context and Exploring Innovative Digital Learning Environments
    Oksanen, Lea
    PROCEEDINGS OF THE 16TH EUROPEAN CONFERENCE ON INNOVATION AND ENTREPRENEURSHIP (ECIE 2021), VOL 2, 2021, : 1182 - 1189
  • [27] A human-centric framework for robotic task learning and optimization
    Roveda, Loris
    Veerappan, Palaniappan
    Maccarini, Marco
    Bucca, Giuseppe
    Ajoudani, Arash
    Piga, Dario
    JOURNAL OF MANUFACTURING SYSTEMS, 2023, 67 : 68 - 79
  • [28] Offline Reinforcement Learning from Human Feedback in Real-World Sequence-to-Sequence Tasks
    Kreutzer, Julia
    Riezler, Stefan
    Lawrence, Carolin
    SPNLP 2021: THE 5TH WORKSHOP ON STRUCTURED PREDICTION FOR NLP, 2021, : 37 - 43
  • [29] Real-world Robot Reaching Skill Learning Based on Deep Reinforcement Learning
    Liu, Naijun
    Lu, Tao
    Cai, Yinghao
    Wang, Rui
    Wang, Shuo
    PROCEEDINGS OF THE 32ND 2020 CHINESE CONTROL AND DECISION CONFERENCE (CCDC 2020), 2020, : 4780 - 4784
  • [30] MAX: Wide area human-centric search of the physical world
    Yap, Kok-Kiong
    Srinivasan, Vikram
    Motani, Mehul
    ACM TRANSACTIONS ON SENSOR NETWORKS, 2008, 4 (04)