SocNavGym: A Reinforcement Learning Gym for Social Navigation

被引:0
|
作者
Kapoor, Aditya [1 ]
Swamy, Sushant [2 ]
Bachiller, Pilar [3 ]
Manso, Luis J. [4 ]
机构
[1] Tata Consultancy Serv, Res & Innovat, Mumbai, Maharashtra, India
[2] Birla Inst Technol & Sci, Sancoale, Goa, India
[3] Univ Extremadura, Comp & Commun Technol Dept, Badajoz, Spain
[4] Aston Univ, Dept Comp Sci, Autonomous Robot & Percept Lab, Birmingham, W Midlands, England
基金
英国工程与自然科学研究理事会;
关键词
MODEL;
D O I
10.1109/RO-MAN57019.2023.10309591
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
It is essential for autonomous robots to be socially compliant while navigating in human-populated environments. Machine Learning and, especially, Deep Reinforcement Learning have recently gained considerable traction in the field of Social Navigation. This can be partially attributed to the resulting policies not being bound by human limitations in terms of code complexity or the number of variables that are handled. Unfortunately, the lack of safety guarantees and the large data requirements by DRL algorithms make learning in the real world unfeasible. To bridge this gap, simulation environments are frequently used. We propose SocNavGym, an advanced simulation environment for social navigation that can generate a wide variety of social navigation scenarios and facilitates the development of intelligent social agents. SocNavGym is lightweight, fast, easy to use, and can be effortlessly configured to generate different types of social navigation scenarios. It can also be configured to work with different hand-crafted and data-driven social reward signals and to yield a variety of evaluation metrics to benchmark agents' performance. Further, we also provide a case study where a Dueling-DQN agent is trained to learn social-navigation policies using SocNavGym. The results provide evidence that SocNavGym can be used to train an agent from scratch to navigate in simple as well as complex social scenarios. Our experiments also show that the agents trained using the data-driven reward function display more advanced social compliance in comparison to the heuristic-based reward function.
引用
收藏
页码:2010 / 2017
页数:8
相关论文
共 50 条
  • [41] The Health Gym: synthetic health-related datasets for the development of reinforcement learning algorithms
    Nicholas I-Hsien Kuo
    Mark N. Polizzotto
    Simon Finfer
    Federico Garcia
    Anders Sönnerborg
    Maurizio Zazzi
    Michael Böhm
    Rolf Kaiser
    Louisa Jorm
    Sebastiano Barbieri
    [J]. Scientific Data, 9
  • [42] Reinforcement Learning in Multi-agent Games: Open AI Gym Diplomacy Environment
    Cruz, Diogo
    Cruz, Jose Aleixo
    Cardoso, Henrique Lopes
    [J]. PROGRESS IN ARTIFICIAL INTELLIGENCE, EPIA 2019, PT I, 2019, 11804 : 49 - 60
  • [43] Improving reinforcement learning with human assistance: an argument for human subject studies with HIPPO Gym
    Matthew E. Taylor
    Nicholas Nissen
    Yuan Wang
    Neda Navidi
    [J]. Neural Computing and Applications, 2023, 35 : 23429 - 23439
  • [44] The Health Gym: synthetic health-related datasets for the development of reinforcement learning algorithms
    Kuo, Nicholas I-Hsien
    Polizzotto, Mark N.
    Finfer, Simon
    Garcia, Federico
    Sonnerborg, Anders
    Zazzi, Maurizio
    Boehm, Michael
    Kaiser, Rolf
    Jorm, Louisa
    Barbieri, Sebastiano
    [J]. SCIENTIFIC DATA, 2022, 9 (01)
  • [45] Improving reinforcement learning with human assistance: an argument for human subject studies with HIPPO Gym
    Taylor, Matthew E.
    Nissen, Nicholas
    Wang, Yuan
    Navidi, Neda
    [J]. NEURAL COMPUTING & APPLICATIONS, 2023, 35 (32): : 23429 - 23439
  • [46] NetAI-Gym: Customized Environment for Network to Evaluate Agent Algorithm using Reinforcement Learning in Open-AI Gym Platform
    Vidyadhar, Varshini
    Nagaraj, R.
    Ashoka, D., V
    [J]. INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2021, 12 (04) : 169 - 176
  • [47] Deep Reinforcement Learning for Mapless Robot Navigation Systems
    Oliveira, Iure Rosa L.
    Brandao, Alexandre S.
    [J]. 2023 LATIN AMERICAN ROBOTICS SYMPOSIUM, LARS, 2023 BRAZILIAN SYMPOSIUM ON ROBOTICS, SBR, AND 2023 WORKSHOP ON ROBOTICS IN EDUCATION, WRE, 2023, : 331 - 336
  • [48] Flock Navigation by Coordinated Shepherds via Reinforcement Learning
    Hasan, Yazied
    Baxter, John E. G.
    Salcedo, Cesar A.
    Delgado, Elena
    Tapia, Lydia
    [J]. ALGORITHMIC FOUNDATIONS OF ROBOTICS XV, 2023, 25 : 454 - 469
  • [49] Quantum Deep Reinforcement Learning for Robot Navigation Tasks
    Hohenfeld, Hans
    Heimann, Dirk
    Wiebe, Felix
    Kirchner, Frank
    [J]. IEEE ACCESS, 2024, 12 : 87217 - 87236
  • [50] A comparison of reinforcement learning models of human spatial navigation
    Qiliang He
    Jancy Ling Liu
    Lou Eschapasse
    Elizabeth H. Beveridge
    Thackery I. Brown
    [J]. Scientific Reports, 12