Multi-objective deep reinforcement learning for crowd-aware robot navigation with dynamic human preference

被引:0
|
作者
Cheng, Guangran [1 ,2 ]
Wang, Yuanda [1 ,2 ]
Dong, Lu [3 ]
Cai, Wenzhe [1 ,2 ]
Sun, Changyin [1 ,2 ]
机构
[1] Southeast Univ, Sch Automat, Nanjing 210096, Peoples R China
[2] Peng Cheng Lab, Shenzhen 518066, Peoples R China
[3] Southeast Univ, Sch Cyber Sci & Engn, Nanjing 211189, Peoples R China
来源
NEURAL COMPUTING & APPLICATIONS | 2023年 / 35卷 / 22期
基金
中国国家自然科学基金;
关键词
Crowd-aware navigation; Multi-objective deep reinforcement learning; Mobile robot; Path planning; Path tracking; ENVIRONMENT;
D O I
10.1007/s00521-023-08385-4
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The growing development of autonomous systems is driving the application of mobile robots in crowded environments. These scenarios often require robots to satisfy multiple conflicting objectives with different relative preferences, such as work efficiency, safety, and smoothness, which inherently cause robots' poor exploration in seeking policies optimizing several performance criteria. In this paper, we propose a multi-objective deep reinforcement learning framework for crowd-aware robot navigation problems to learn policies over multiple competing objectives whose relative importance preference is dynamic to the robot. First, a two-stream structure is introduced to separately extract the spatial and temporal features of pedestrian motion characteristics. Second, to learn navigation policies for each possible preference, a multi-objective deep reinforcement learning method is proposed to maximize a weighted-sum scalarization of different objective functions. We consider path planning and path tracking tasks, which focus on conflicting objectives of collision avoidance, target reaching, and path following. Experimental results demonstrate that our method can effectively navigate through crowds in simulated environments while satisfying different task requirements.
引用
收藏
页码:16247 / 16265
页数:19
相关论文
共 50 条
  • [31] Multi-objective safe reinforcement learning: the relationship between multi-objective reinforcement learning and safe reinforcement learning
    Horie, Naoto
    Matsui, Tohgoroh
    Moriyama, Koichi
    Mutoh, Atsuko
    Inuzuka, Nobuhiro
    [J]. ARTIFICIAL LIFE AND ROBOTICS, 2019, 24 (03) : 352 - 359
  • [32] Multi-objective safe reinforcement learning: the relationship between multi-objective reinforcement learning and safe reinforcement learning
    Naoto Horie
    Tohgoroh Matsui
    Koichi Moriyama
    Atsuko Mutoh
    Nobuhiro Inuzuka
    [J]. Artificial Life and Robotics, 2019, 24 : 352 - 359
  • [33] Multi-Objective Reinforcement Learning for Autonomous Drone Navigation in Urban Area
    Wu, Jiahao
    Ye, Yang
    Du, Jing
    [J]. CONSTRUCTION RESEARCH CONGRESS 2024: ADVANCED TECHNOLOGIES, AUTOMATION, AND COMPUTER APPLICATIONS IN CONSTRUCTION, 2024, : 707 - 716
  • [34] Multi-objective path planning based on deep reinforcement learning
    Xu, Jian
    Huang, Fei
    Cui, Yunfei
    Du, Xue
    [J]. 2022 41ST CHINESE CONTROL CONFERENCE (CCC), 2022, : 3273 - 3279
  • [35] Modular Multi-Objective Deep Reinforcement Learning with Decision Values
    Tajmajer, Tomasz
    [J]. PROCEEDINGS OF THE 2018 FEDERATED CONFERENCE ON COMPUTER SCIENCE AND INFORMATION SYSTEMS (FEDCSIS), 2018, : 85 - 93
  • [36] Deep reinforcement learning for multi-objective game strategy selection
    Jiang, Ruhao
    Deng, Yanchen
    Chen, Yingying
    Luo, He
    An, Bo
    [J]. COMPUTERS & OPERATIONS RESEARCH, 2024, 168
  • [37] Crowd-Comfort Robot Navigation Among Dynamic Environment Based on Social-Stressed Deep Reinforcement Learning
    Zhengxi Hu
    Yingli Zhao
    Sen Zhang
    Lei Zhou
    Jingtai Liu
    [J]. International Journal of Social Robotics, 2022, 14 : 913 - 929
  • [38] Crowd-Comfort Robot Navigation Among Dynamic Environment Based on Social-Stressed Deep Reinforcement Learning
    Hu, Zhengxi
    Zhao, Yingli
    Zhang, Sen
    Zhou, Lei
    Liu, Jingtai
    [J]. INTERNATIONAL JOURNAL OF SOCIAL ROBOTICS, 2022, 14 (04) : 913 - 929
  • [39] Emotion Regulation Based on Multi-objective Weighted Reinforcement Learning for Human-robot Interaction
    Hao, Man
    Cao, Weihua
    Liu, Zhentao
    Wu, Min
    Yuan, Yan
    [J]. 2019 12TH ASIAN CONTROL CONFERENCE (ASCC), 2019, : 1402 - 1406
  • [40] Dynamic multi-objective sequence-wise recommendation framework via deep reinforcement learning
    Xiankun Zhang
    Yuhu Shang
    Yimeng Ren
    Kun Liang
    [J]. Complex & Intelligent Systems, 2023, 9 : 1891 - 1911