SocNavGym: A Reinforcement Learning Gym for Social Navigation

被引：0

作者：

Kapoor, Aditya ^{[1
]}

Swamy, Sushant ^{[2
]}

Bachiller, Pilar ^{[3
]}

Manso, Luis J. ^{[4
]}

机构：

[1] Tata Consultancy Serv, Res & Innovat, Mumbai, Maharashtra, India

[2] Birla Inst Technol & Sci, Sancoale, Goa, India

[3] Univ Extremadura, Comp & Commun Technol Dept, Badajoz, Spain

[4] Aston Univ, Dept Comp Sci, Autonomous Robot & Percept Lab, Birmingham, W Midlands, England

来源：

2023 32ND IEEE INTERNATIONAL CONFERENCE ON ROBOT AND HUMAN INTERACTIVE COMMUNICATION, RO-MAN | 2023年

基金：

英国工程与自然科学研究理事会;

关键词：

MODEL;

D O I：

10.1109/RO-MAN57019.2023.10309591

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

It is essential for autonomous robots to be socially compliant while navigating in human-populated environments. Machine Learning and, especially, Deep Reinforcement Learning have recently gained considerable traction in the field of Social Navigation. This can be partially attributed to the resulting policies not being bound by human limitations in terms of code complexity or the number of variables that are handled. Unfortunately, the lack of safety guarantees and the large data requirements by DRL algorithms make learning in the real world unfeasible. To bridge this gap, simulation environments are frequently used. We propose SocNavGym, an advanced simulation environment for social navigation that can generate a wide variety of social navigation scenarios and facilitates the development of intelligent social agents. SocNavGym is lightweight, fast, easy to use, and can be effortlessly configured to generate different types of social navigation scenarios. It can also be configured to work with different hand-crafted and data-driven social reward signals and to yield a variety of evaluation metrics to benchmark agents' performance. Further, we also provide a case study where a Dueling-DQN agent is trained to learn social-navigation policies using SocNavGym. The results provide evidence that SocNavGym can be used to train an agent from scratch to navigate in simple as well as complex social scenarios. Our experiments also show that the agents trained using the data-driven reward function display more advanced social compliance in comparison to the heuristic-based reward function.

引用

页码：2010 / 2017

页数：8

共 50 条

[41] The Health Gym: synthetic health-related datasets for the development of reinforcement learning algorithms
Nicholas I-Hsien Kuo
Mark N. Polizzotto
Simon Finfer
Federico Garcia
Anders Sönnerborg
Maurizio Zazzi
Michael Böhm
Rolf Kaiser
Louisa Jorm
Sebastiano Barbieri
[J]. Scientific Data, 9
[42] Reinforcement Learning in Multi-agent Games: Open AI Gym Diplomacy Environment
Cruz, Diogo
Cruz, Jose Aleixo
Cardoso, Henrique Lopes
[J]. PROGRESS IN ARTIFICIAL INTELLIGENCE, EPIA 2019, PT I, 2019, 11804 : 49 - 60
[43] Improving reinforcement learning with human assistance: an argument for human subject studies with HIPPO Gym
Matthew E. Taylor
Nicholas Nissen
Yuan Wang
Neda Navidi
[J]. Neural Computing and Applications, 2023, 35 : 23429 - 23439
[44] The Health Gym: synthetic health-related datasets for the development of reinforcement learning algorithms
Kuo, Nicholas I-Hsien
Polizzotto, Mark N.
Finfer, Simon
Garcia, Federico
Sonnerborg, Anders
Zazzi, Maurizio
Boehm, Michael
Kaiser, Rolf
Jorm, Louisa
Barbieri, Sebastiano
[J]. SCIENTIFIC DATA, 2022, 9 (01)
[45] Improving reinforcement learning with human assistance: an argument for human subject studies with HIPPO Gym
Taylor, Matthew E.
Nissen, Nicholas
Wang, Yuan
Navidi, Neda
[J]. NEURAL COMPUTING & APPLICATIONS, 2023, 35 (32): : 23429 - 23439
[46] NetAI-Gym: Customized Environment for Network to Evaluate Agent Algorithm using Reinforcement Learning in Open-AI Gym Platform
Vidyadhar, Varshini
Nagaraj, R.
Ashoka, D., V
[J]. INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2021, 12 (04) : 169 - 176
[47] Deep Reinforcement Learning for Mapless Robot Navigation Systems
Oliveira, Iure Rosa L.
Brandao, Alexandre S.
[J]. 2023 LATIN AMERICAN ROBOTICS SYMPOSIUM, LARS, 2023 BRAZILIAN SYMPOSIUM ON ROBOTICS, SBR, AND 2023 WORKSHOP ON ROBOTICS IN EDUCATION, WRE, 2023, : 331 - 336
[48] Flock Navigation by Coordinated Shepherds via Reinforcement Learning
Hasan, Yazied
Baxter, John E. G.
Salcedo, Cesar A.
Delgado, Elena
Tapia, Lydia
[J]. ALGORITHMIC FOUNDATIONS OF ROBOTICS XV, 2023, 25 : 454 - 469
[49] Quantum Deep Reinforcement Learning for Robot Navigation Tasks
Hohenfeld, Hans
Heimann, Dirk
Wiebe, Felix
Kirchner, Frank
[J]. IEEE ACCESS, 2024, 12 : 87217 - 87236
[50] A comparison of reinforcement learning models of human spatial navigation
Qiliang He
Jancy Ling Liu
Lou Eschapasse
Elizabeth H. Beveridge
Thackery I. Brown
[J]. Scientific Reports, 12

← 1 2 3 4 5 →