Formation Control With Collision Avoidance Through Deep Reinforcement Learning Using Model-Guided Demonstration

被引:0
|
作者
Sui, Zezhi [1 ,2 ]
Pu, Zhiqiang [1 ,2 ,3 ]
Yi, Jianqiang [1 ,2 ,3 ]
Wu, Shiguang [1 ,2 ]
机构
[1] Chinese Acad Sci, Inst Automat, Beijing 100190, Peoples R China
[2] Univ Chinese Acad Sci, Sch Artificial Intelligence, Beijing 100049, Peoples R China
[3] Taizhou Inst Intelligent Mfg, Taizhou 225300, Peoples R China
关键词
Collision avoidance; Training; Maintenance engineering; Machine learning; Multi-agent systems; Task analysis; deep reinforcement learning (DRL); formation control; leader-follower; FOLLOWER FORMATION CONTROL; MOBILE ROBOTS; ENVIRONMENT; CONSENSUS; VEHICLES; SYSTEMS;
D O I
10.1109/TNNLS.2020.3004893
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Generating collision-free, time-efficient paths in an uncertain dynamic environment poses huge challenges for the formation control with collision avoidance (FCCA) problem in a leader-follower structure. In particular, the followers have to take both formation maintenance and collision avoidance into account simultaneously. Unfortunately, most of the existing works are simple combinations of methods dealing with the two problems separately. In this article, a new method based on deep reinforcement learning (RL) is proposed to solve the problem of FCCA. Especially, the learning-based policy is extended to the field of formation control, which involves a two-stage training framework: an imitation learning (IL) and later an RL. In the IL stage, a model-guided method consisting of a consensus theory-based formation controller and an optimal reciprocal collision avoidance strategy is designed to speed up training and increase efficiency. In the RL stage, a compound reward function is presented to guide the training. In addition, we design a formation-oriented network structure to perceive the environment. Long short-term memory is adopted to enable the network structure to perceive the information of obstacles of an uncertain number, and a transfer training approach is adopted to improve the generalization of the network in different scenarios. Numerous representative simulations are conducted, and our method is further deployed to an experimental platform based on a multiomnidirectional-wheeled car system. The effectiveness and practicability of our proposed method are validated through both the simulation and experiment results.
引用
收藏
页码:2358 / 2372
页数:15
相关论文
共 50 条
  • [1] Formation Control with Collision Avoidance through Deep Reinforcement Learning
    Sui, Zezhi
    Pu, Zhiqiang
    Yi, Jianqiang
    Xiong, Tianyi
    [J]. 2019 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2019,
  • [2] Vision-guided Collision Avoidance through Deep Reinforcement Learning
    Song, Sirui
    Zhang, Yuanhang
    Qin, Xi
    Saunders, Kirk
    Liu, Jundong
    [J]. PROCEEDINGS OF THE 2021 IEEE NATIONAL AEROSPACE AND ELECTRONICS CONFERENCE (NAECON), 2021, : 191 - 194
  • [3] Space Manipulator Collision Avoidance Using a Deep Reinforcement Learning Control
    Blaise, James
    Bazzocchi, Michael C. F.
    [J]. AEROSPACE, 2023, 10 (09)
  • [4] Vehicles Control: Collision Avoidance using Federated Deep Reinforcement Learning
    Ben Elallid, Badr
    Abouaomar, Amine
    Benamar, Nabil
    Kobbane, Abdellatif
    [J]. IEEE CONFERENCE ON GLOBAL COMMUNICATIONS, GLOBECOM, 2023, : 4369 - 4374
  • [5] Pedestrian Collision Avoidance Using Deep Reinforcement Learning
    Alireza Rafiei
    Amirhossein Oliaei Fasakhodi
    Farshid Hajati
    [J]. International Journal of Automotive Technology, 2022, 23 : 613 - 622
  • [6] Pedestrian Collision Avoidance Using Deep Reinforcement Learning
    Rafiei, Alireza
    Fasakhodi, Amirhossein Oliaei
    Hajati, Farshid
    [J]. INTERNATIONAL JOURNAL OF AUTOMOTIVE TECHNOLOGY, 2022, 23 (03) : 613 - 622
  • [7] A learning method for AUV collision avoidance through deep reinforcement learning
    Xu, Jian
    Huang, Fei
    Wu, Di
    Cui, Yunfei
    Yan, Zheping
    Du, Xue
    [J]. OCEAN ENGINEERING, 2022, 260
  • [8] Smooth Trajectory Collision Avoidance through Deep Reinforcement Learning
    Song, Sirui
    Saunders, Kirk
    Yue, Ye
    Liu, Jundong
    [J]. 2022 21ST IEEE INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS, ICMLA, 2022, : 914 - 919
  • [9] A Deep Reinforcement Learning Approach Combined With Model-Based Paradigms for Multiagent Formation Control With Collision Avoidance
    Pu, Zhiqiang
    Zhang, Tianle
    Ai, Xiaolin
    Qiu, Tenghai
    Yi, Jianqiang
    [J]. IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2023, 53 (07): : 4189 - 4204
  • [10] Ship Collision Avoidance Using Constrained Deep Reinforcement Learning
    Zhang, Rui
    Wang, Xiao
    Liu, Kezhong
    Wu, Xiaolie
    Lu, Tianyou
    Chao Zhaohui
    [J]. 2018 5TH INTERNATIONAL CONFERENCE ON BEHAVIORAL, ECONOMIC, AND SOCIO-CULTURAL COMPUTING (BESC), 2018, : 115 - 120