Inverse reinforcement learning-based time-dependent A* planner for human-aware robot navigation with local vision

被引:15
|
作者
Sun Shiying [1 ]
Zhao Xiaoguang [1 ]
Li Qianzhong [1 ]
Tan Min [1 ]
机构
[1] Chinese Acad Sci, Inst Automat, State Key Lab Management & Control Complex Syst, Beijing, Peoples R China
基金
中国国家自然科学基金;
关键词
Human-aware navigation; inverse reinforcement learning; path planning; service robot;
D O I
10.1080/01691864.2020.1753569
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
In an environment where robots coexist with humans, mobile robots should be human-aware and comply with humans' behavioural norms so as to not disturb humans' personal space and activities. In this work, we propose an inverse reinforcement learning-based time-dependent A* planner for human-aware robot navigation with local vision. In this method, the planning process of time-dependent A* is regarded as a Markov decision process and the cost function of the time-dependent A* is learned using the inverse reinforcement learning via capturing humans' demonstration trajectories. With this method, a robot can plan a path that complies with humans' behaviour patterns and the robot's kinematics. When constructing feature vectors of the cost function, considering the local vision characteristics, we propose a visual coverage feature for enabling robots to learn from how humans move in a limited visual field. The effectiveness of the proposed method has been validated by experiments in real-world scenarios: using this approach robots can effectively mimic human motion patterns when avoiding pedestrians; furthermore, in a limited visual field, robots can learn to choose a path that enables them to have the larger visual coverage which shows a better navigation performance.
引用
收藏
页码:888 / 901
页数:14
相关论文
共 50 条
  • [31] Memory-based crowd-aware robot navigation using deep reinforcement learning
    Sunil Srivatsav Samsani
    Husna Mutahira
    Mannan Saeed Muhammad
    [J]. Complex & Intelligent Systems, 2023, 9 : 2147 - 2158
  • [32] Travel Time-dependent Maximum Entropy Inverse Reinforcement Learning for Seabird Trajectory Prediction
    Hirakawa, Tsubasa
    Yamashita, Takayoshi
    Yoda, Ken
    Tamaki, Toru
    Fujiyoshi, Hironobu
    [J]. PROCEEDINGS 2017 4TH IAPR ASIAN CONFERENCE ON PATTERN RECOGNITION (ACPR), 2017, : 430 - 435
  • [33] Memory-based crowd-aware robot navigation using deep reinforcement learning
    Samsani, Sunil Srivatsav
    Mutahira, Husna
    Muhammad, Mannan Saeed
    [J]. COMPLEX & INTELLIGENT SYSTEMS, 2023, 9 (02) : 2147 - 2158
  • [34] Fast Adaptation of Deep Reinforcement Learning-Based Navigation Skills to Human Preference
    Choi, Jinyoung
    Dance, Christopher
    Kim, Jung-eun
    Park, Kyung-sik
    Han, Jaehun
    Seo, Joonho
    Kim, Minsu
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2020, : 3363 - 3370
  • [35] Vision-Based Autonomous Navigation Approach for a Tracked Robot Using Deep Reinforcement Learning
    Ejaz, Muhammad Mudassir
    Tang, Tong Boon
    Lu, Cheng-Kai
    [J]. IEEE SENSORS JOURNAL, 2021, 21 (02) : 2230 - 2240
  • [36] Adaptive Risk-Based Replanning For Human-Aware Multi-Robot Task Allocation With Local Perception
    Talebpour, Zeynab
    Martinoli, Alcherio
    [J]. IEEE ROBOTICS AND AUTOMATION LETTERS, 2019, 4 (04): : 3790 - 3797
  • [37] Reinforcement Learning-Based Energy-Aware Area Coverage for Reconfigurable hRombo Tiling Robot
    Le, Anh Vu
    Parween, Rizuwana
    Kyaw, Phone Thiha
    Mohan, Rajesh Elara
    Minh, Tran Hoang Quang
    Borusu, Charan Satya Chandra Sairam
    [J]. IEEE ACCESS, 2020, 8 : 209750 - 209761
  • [38] Crowd-Robot Interaction: Crowd-aware Robot Navigation with Attention-based Deep Reinforcement Learning
    Chen, Changan
    Liu, Yuejiang
    Kreiss, Sven
    Alahi, Alexandre
    [J]. 2019 INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2019, : 6015 - 6022
  • [39] Context-Conditional Navigation with a Learning-Based Terrain- and Robot-Aware Dynamics Model
    Guttikonda, Suresh
    Achterhold, Jan
    Li, Haolong
    Boedecker, Joschka
    Stueckler, Joerg
    [J]. 2023 EUROPEAN CONFERENCE ON MOBILE ROBOTS, ECMR, 2023, : 186 - 192
  • [40] Multi-objective deep reinforcement learning for crowd-aware robot navigation with dynamic human preference
    Guangran Cheng
    Yuanda Wang
    Lu Dong
    Wenzhe Cai
    Changyin Sun
    [J]. Neural Computing and Applications, 2023, 35 : 16247 - 16265