Multirobot coordination with deep reinforcement learning in complex environments

被引：18

作者：

Wang, Di ^{[1
]}

Deng, Hongbin ^{[1
]}

机构：

[1] Beijing Inst Technol, Sch Mechatron Engn, Beijing 10081, Peoples R China

来源：

EXPERT SYSTEMS WITH APPLICATIONS | 2021年 / 180卷

基金：

中国国家自然科学基金;

关键词：

Multirobot coordination; Reinforcement learning; Deep learning; Visual perception; MANIPULATION;

D O I：

10.1016/j.eswa.2021.115128

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In the multiple autonomous robot system, it is very important to complete path planning coordinately and effectively in the processes of interference avoidance, resource allocation and information sharing. In traditional multirobot coordination algorithms, most of the solutions are in known environments, the target position that each robot needs to move to and the robot priority are set, which limits the autonomy of the robot. Only using visual information to solve the problem of multirobot coordination is still less. This paper proposes a multi-robot cooperative algorithm based on deep reinforcement learning to make the robot more autonomous in the process of selecting target positions and moving. We use the end-to-end approach, using only the top view, that is, a robot-centered top view, and the first-person view, that is, the image information collected from the first-person perspective of the robot, as input. The proposed algorithm, which includes a dueling neural network structure, can solve task allocation and path planning; we call the algorithm TFDueling. Through its perception and understanding of the environment, the robot can reach the target position without collision, and the robot can move to any target position. We compare the proposed algorithm, TFDueling, with different input structure algorithms, TDueling and FDueling, and with different neural network structures, TFDQN and TFDDQN. Experiments show that the proposed TFDueling algorithm has the highest accuracy and robustness.

引用

页数：9

共 50 条

[41] Metrics for Assessing Generalization of Deep Reinforcement Learning in Parameterized Environments
Aleksandrowicz, Maciej
Jaworek-Korjakowska, Joanna
JOURNAL OF ARTIFICIAL INTELLIGENCE AND SOFT COMPUTING RESEARCH, 2023, 14 (01) : 45 - 61
[42] Deep Reinforcement Learning for Workload Prediction in Federated Cloud Environments
Ahamed, Zaakki
Khemakhem, Maher
Eassa, Fathy
Alsolami, Fawaz
Basuhail, Abdullah
Jambi, Kamal
SENSORS, 2023, 23 (15)
[43] Adaptive deep reinforcement learning for non-stationary environments
Zhu, Jin
Wei, Yutong
Kang, Yu
Jiang, Xiaofeng
Dullerud, Geir E.
SCIENCE CHINA-INFORMATION SCIENCES, 2022, 65 (10)
[44] Adaptive deep reinforcement learning for non-stationary environments
Jin Zhu
Yutong Wei
Yu Kang
Xiaofeng Jiang
Geir E. Dullerud
Science China Information Sciences, 2022, 65
[45] Benchmarking Deep and Non-deep Reinforcement Learning Algorithms for Discrete Environments
Duarte, Fernando F.
Lau, Nuno
Pereira, Artur
Reis, Luis P.
FOURTH IBERIAN ROBOTICS CONFERENCE: ADVANCES IN ROBOTICS, ROBOT 2019, VOL 2, 2020, 1093 : 263 - 275
[46] Multiagent reinforcement learning for a planetary exploration multirobot system
Zhang Zheng
Ma Shu-gen
Cao Bing-gang
Zhang Li-ping
Li Bin
AGENT COMPUTING AND MULTI-AGENT SYSTEMS, 2006, 4088 : 339 - 350
[47] HeterPS: Distributed deep learning with reinforcement learning based scheduling in heterogeneous environments
Liu, Ji
Wu, Zhihua
Feng, Danlei
Zhang, Minxu
Wu, Xinxuan
Yao, Xuefeng
Yu, Dianhai
Ma, Yanjun
Zhao, Feng
Dou, Dejing
FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2023, 148 : 106 - 117
[48] Modeling Complex Networks Based on Deep Reinforcement Learning
Song, Wenbo
Sheng, Wei
Li, Dong
Wu, Chong
Ma, Jun
FRONTIERS IN PHYSICS, 2022, 9
[49] Multi-constraint reinforcement learning in complex robot environments
Sheng Han
Hengrui Zhang
Hao Wu
Youfang Lin
Kai Lv
Frontiers of Computer Science, 2025, 19 (8)
[50] Reinforcement learning in complex environments through multiple adaptive partitions
Bonarini, Andrea
Lazaric, Alessandro
Restelli, Marcello
AI(ASTERISK)IA 2007: ARTIFICIAL INTELLIGENCE AND HUMAN-ORIENTED COMPUTING, 2007, 4733 : 531 - 542

← 1 2 3 4 5 →