Multirobot coordination with deep reinforcement learning in complex environments

被引:18
|
作者
Wang, Di [1 ]
Deng, Hongbin [1 ]
机构
[1] Beijing Inst Technol, Sch Mechatron Engn, Beijing 10081, Peoples R China
基金
中国国家自然科学基金;
关键词
Multirobot coordination; Reinforcement learning; Deep learning; Visual perception; MANIPULATION;
D O I
10.1016/j.eswa.2021.115128
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In the multiple autonomous robot system, it is very important to complete path planning coordinately and effectively in the processes of interference avoidance, resource allocation and information sharing. In traditional multirobot coordination algorithms, most of the solutions are in known environments, the target position that each robot needs to move to and the robot priority are set, which limits the autonomy of the robot. Only using visual information to solve the problem of multirobot coordination is still less. This paper proposes a multi-robot cooperative algorithm based on deep reinforcement learning to make the robot more autonomous in the process of selecting target positions and moving. We use the end-to-end approach, using only the top view, that is, a robot-centered top view, and the first-person view, that is, the image information collected from the first-person perspective of the robot, as input. The proposed algorithm, which includes a dueling neural network structure, can solve task allocation and path planning; we call the algorithm TFDueling. Through its perception and understanding of the environment, the robot can reach the target position without collision, and the robot can move to any target position. We compare the proposed algorithm, TFDueling, with different input structure algorithms, TDueling and FDueling, and with different neural network structures, TFDQN and TFDDQN. Experiments show that the proposed TFDueling algorithm has the highest accuracy and robustness.
引用
收藏
页数:9
相关论文
共 50 条
  • [41] Metrics for Assessing Generalization of Deep Reinforcement Learning in Parameterized Environments
    Aleksandrowicz, Maciej
    Jaworek-Korjakowska, Joanna
    JOURNAL OF ARTIFICIAL INTELLIGENCE AND SOFT COMPUTING RESEARCH, 2023, 14 (01) : 45 - 61
  • [42] Deep Reinforcement Learning for Workload Prediction in Federated Cloud Environments
    Ahamed, Zaakki
    Khemakhem, Maher
    Eassa, Fathy
    Alsolami, Fawaz
    Basuhail, Abdullah
    Jambi, Kamal
    SENSORS, 2023, 23 (15)
  • [43] Adaptive deep reinforcement learning for non-stationary environments
    Zhu, Jin
    Wei, Yutong
    Kang, Yu
    Jiang, Xiaofeng
    Dullerud, Geir E.
    SCIENCE CHINA-INFORMATION SCIENCES, 2022, 65 (10)
  • [44] Adaptive deep reinforcement learning for non-stationary environments
    Jin Zhu
    Yutong Wei
    Yu Kang
    Xiaofeng Jiang
    Geir E. Dullerud
    Science China Information Sciences, 2022, 65
  • [45] Benchmarking Deep and Non-deep Reinforcement Learning Algorithms for Discrete Environments
    Duarte, Fernando F.
    Lau, Nuno
    Pereira, Artur
    Reis, Luis P.
    FOURTH IBERIAN ROBOTICS CONFERENCE: ADVANCES IN ROBOTICS, ROBOT 2019, VOL 2, 2020, 1093 : 263 - 275
  • [46] Multiagent reinforcement learning for a planetary exploration multirobot system
    Zhang Zheng
    Ma Shu-gen
    Cao Bing-gang
    Zhang Li-ping
    Li Bin
    AGENT COMPUTING AND MULTI-AGENT SYSTEMS, 2006, 4088 : 339 - 350
  • [47] HeterPS: Distributed deep learning with reinforcement learning based scheduling in heterogeneous environments
    Liu, Ji
    Wu, Zhihua
    Feng, Danlei
    Zhang, Minxu
    Wu, Xinxuan
    Yao, Xuefeng
    Yu, Dianhai
    Ma, Yanjun
    Zhao, Feng
    Dou, Dejing
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2023, 148 : 106 - 117
  • [48] Modeling Complex Networks Based on Deep Reinforcement Learning
    Song, Wenbo
    Sheng, Wei
    Li, Dong
    Wu, Chong
    Ma, Jun
    FRONTIERS IN PHYSICS, 2022, 9
  • [49] Multi-constraint reinforcement learning in complex robot environments
    Sheng Han
    Hengrui Zhang
    Hao Wu
    Youfang Lin
    Kai Lv
    Frontiers of Computer Science, 2025, 19 (8)
  • [50] Reinforcement learning in complex environments through multiple adaptive partitions
    Bonarini, Andrea
    Lazaric, Alessandro
    Restelli, Marcello
    AI(ASTERISK)IA 2007: ARTIFICIAL INTELLIGENCE AND HUMAN-ORIENTED COMPUTING, 2007, 4733 : 531 - 542