Multi-vehicle Flocking Control with Deep Deterministic Policy Gradient Method

被引:0
|
作者
Xu, Zhao [1 ]
Lyu, Yang [2 ]
Pan, Quan [2 ]
Hu, Jinwen [2 ]
Zhao, Chunhui [2 ]
Liu, Shuai [3 ]
机构
[1] Northwestern Polytech Univ, Sch Elect & Informat, Xian 710072, Shaanxi, Peoples R China
[2] Northwestern Polytech Univ, Sch Automat, Minist Educ, Key Lab Informat Fus, Xian 710072, Shaanxi, Peoples R China
[3] Shandong Univ, Sch Control Sci & Engn, Jinan 250061, Shandong, Peoples R China
基金
新加坡国家研究基金会;
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Flocking control has been studied extensively along with the wide application of multi-vehicle systems. In this paper the Multi-vehicles System (MVS) flocking control with collision avoidance and communication preserving is considered based on the deep reinforcement learning framework. Specifically the deep deterministic policy gradient (DDPG) with centralized training and distributed execution process is implemented to obtain the flocking control policy. First, to avoid the dynamically changed observation of state, a three layers tensor based representation of the observation is used so that the state remains constant although the observation dimension is changing. A reward function is designed to guide the way-points tracking, collision avoidance and communication preserving. The reward function is augmented by introducing the local reward function of neighbors. Finally, a centralized training process which trains the shared policy based on common training set among all agents. The proposed method is tested under simulated scenarios with different setup.
引用
收藏
页码:306 / 311
页数:6
相关论文
共 50 条
  • [1] Multi-Task Vehicle Platoon Control: A Deep Deterministic Policy Gradient Approach
    Berahman, Mehran
    Rostami-Shahrbabaki, Majid
    Bogenberger, Klaus
    [J]. FUTURE TRANSPORTATION, 2022, 2 (04): : 1028 - 1046
  • [2] Flocking of multi-vehicle systems with a leader
    Liu, Bo
    Chu, Tianguang
    Wang, Long
    [J]. 2006 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS, VOLS 1-12, 2006, : 5948 - +
  • [3] Research on deep deterministic policy gradient guidance method for reentry vehicle
    Guo, Dongzi
    Huang, Rong
    Xu, Hechuan
    Sun, Liwei
    Cui, Naigang
    [J]. Xi Tong Gong Cheng Yu Dian Zi Ji Shu/Systems Engineering and Electronics, 2022, 44 (06): : 1942 - 1949
  • [4] A Method of Attitude Control Based on Deep Deterministic Policy Gradient
    Zhang, Jian
    Wu, Fengge
    Zhao, Junsuo
    Xu, Fanjiang
    [J]. COGNITIVE SYSTEMS AND SIGNAL PROCESSING, PT II, 2019, 1006 : 197 - 207
  • [5] A Multi-Agent Deep Deterministic Policy Gradient Method for Multi-Zone HVAC Control
    Liu, Xuebo
    Wu, Yingying
    Liu, Bo
    Wu, Hongyu
    [J]. 2023 IEEE POWER & ENERGY SOCIETY GENERAL MEETING, PESGM, 2023,
  • [6] Multi-vehicle flocking: Scalability of cooperative control algorithms using pairwise potentials
    Chuang, Yao-Li
    Huang, Yuan R.
    D'Orsogna, Maria R.
    Bertozzi, Andrea L.
    [J]. PROCEEDINGS OF THE 2007 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, VOLS 1-10, 2007, : 2292 - +
  • [7] A Deep Deterministic Policy Gradient Approach for Vehicle Speed Tracking Control With a Robotic Driver
    Hao, Gaofeng
    Fu, Zhuang
    Feng, Xin
    Gong, Zening
    Chen, Peng
    Wang, Dan
    Wang, Weibin
    Si, Yang
    [J]. IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2022, 19 (03) : 2514 - 2525
  • [8] AUTONOMOUS VEHICLE DRIVING VIA DEEP DETERMINISTIC POLICY GRADIENT
    Huang, Wenhui
    Braghin, Francesco
    Arrigoni, Stefano
    [J]. PROCEEDINGS OF THE ASME INTERNATIONAL DESIGN ENGINEERING TECHNICAL CONFERENCES AND COMPUTERS AND INFORMATION IN ENGINEERING CONFERENCE, 2019, VOL 3, 2020,
  • [9] Control Method for PEMFC Using Improved Deep Deterministic Policy Gradient Algorithm
    Li, Jiawen
    Li, Yaping
    Yu, Tao
    [J]. FRONTIERS IN ENERGY RESEARCH, 2021, 9
  • [10] Deep Recurrent Deterministic Policy Gradient for Physical Control
    Zhang, Lei
    Han, Shuai
    Zhang, Zhiruo
    Li, Lefan
    Lu, Shuai
    [J]. ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, ICANN 2020, PT II, 2020, 12397 : 257 - 268