STASIS: Reinforcement Learning Simulators for Human-Centric Real-World Environments

被引:0
|
作者
Efstathiadis, Georgios [1 ]
Emedom-Nnamdi, Patrick [1 ]
Kolbeinsson, Arinbjorn [2 ]
Onnela, Jukka-Pekka [1 ]
Lu, Junwei [1 ]
机构
[1] Harvard TH Chan Sch Publ Hlth, Dept Biostat, Boston, MA 02115 USA
[2] Evidat Hlth, London, England
关键词
reinforcement learning; health care; real-world simulators;
D O I
10.1007/978-3-031-39539-0_8
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present on-going work toward building Stasis, a suite of reinforcement learning (RL) environments that aim to maintain realism for human-centric agents operating in real-world settings. Through representation learning and alignment with real-world offline data, Stasis allows for the evaluation of RL algorithms in offline environments with adjustable characteristics, such as observability, heterogeneity and levels of missing data. We aim to introduce environments the encourage training RL agents that are capable of maintaining a level of performance and robustness comparable to agents trained in real-world online environments, while avoiding the high cost and risks associated with making mistakes during online training. We provide examples of two environments that will be part of Stasis and discuss its implications for the deployment of RL-based systems in sensitive and high-risk areas of application.
引用
收藏
页码:85 / 92
页数:8
相关论文
共 50 条
  • [41] A Real-World Reinforcement Learning Framework for Safe and Human-Like Tactical Decision-Making
    Yavas, Muharrem Ugur
    Kumbasar, Tufan
    Ure, Nazim Kemal
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2023, 24 (11) : 11773 - 11784
  • [42] Learning to Generate Unambiguous Spatial Referring Expressions for Real-World Environments
    Dagan, Fethiye Irmak
    Kalkan, Sinan
    Leite, Iolanda
    2019 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2019, : 4992 - 4999
  • [43] Real-time human-centric segmentation for complex video scenes
    Yu, Ran
    Tian, Chenyu
    Xia, Weihao
    Zhao, Xinyuan
    Wang, Liejun
    Yang, Yujiu
    IMAGE AND VISION COMPUTING, 2022, 126
  • [44] Object-Centric Learning for Real-World Videos by Predicting Temporal Feature Similarities
    Zadaianchuk, Andrii
    Seitzer, Maximilian
    Martius, Georg
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [45] Evaluating theories for managing imperfect knowledge in human-centric database reengineering environments
    Jahnke, JH
    Walenstein, A
    INTERNATIONAL JOURNAL OF SOFTWARE ENGINEERING AND KNOWLEDGE ENGINEERING, 2002, 12 (01) : 77 - 102
  • [46] A Novel Methodology Analyzing the Influence of Micro-Stresses on Human-Centric Environments
    Shakhovska, Nataliya
    Kaminskyy, Roman
    Khudoba, Bohdan
    Mykhailyshyn, Vladyslav
    Helzhynskyi, Ihor
    COMPUTATION, 2023, 11 (11)
  • [47] Practical Implementation of Visual Navigation Based on Semantic Segmentation for Human-Centric Environments
    Adachi, Miho
    Honda, Kazufumi
    Xue, Junfeng
    Sudo, Hiroaki
    Ueda, Yuriko
    Yuda, Yuki
    Wada, Marin
    Miyamoto, Ryusuke
    JOURNAL OF ROBOTICS AND MECHATRONICS, 2023, 35 (06) : 1419 - 1434
  • [48] Domain Adapting Deep Reinforcement Learning for Real-World Speech Emotion Recognition
    Rajapakshe, Thejan
    Rana, Rajib
    Khalifa, Sara
    Schuller, Bjoern W.
    IEEE ACCESS, 2024, 12 : 193101 - 193114
  • [49] Optimizing Reinforcement Learning Control Model in Furuta Pendulum and Transferring it to Real-World
    Hong, Myung Rae
    Kang, Sanghun
    Lee, Jingoo
    Seo, Sungchul
    Han, Seungyong
    Koh, Je-Sung
    Kang, Daeshik
    IEEE ACCESS, 2023, 11 : 95195 - 95200
  • [50] Real-World Implementation of Reinforcement Learning Based Energy Coordination for a Cluster of Households
    Gokhale, Gargya
    Tiben, Niels
    Verwee, Marie-Sophie
    Lahariya, Manu
    Claessens, Bert
    Develder, Chris
    PROCEEDINGS OF THE 10TH ACM INTERNATIONAL CONFERENCE ON SYSTEMS FOR ENERGY-EFFICIENT BUILDINGS, CITIES, AND TRANSPORTATION, BUILDSYS 2023, 2023, : 347 - 351