A Deep Reinforcement Learning Approach Using Asymmetric Self-Play for Robust Multirobot Flocking

被引:0
|
作者
Jia, Yunjie [1 ]
Song, Yong [1 ]
Cheng, Jiyu [2 ]
Jin, Jiong [3 ]
Zhang, Wei [2 ]
Yang, Simon X. [4 ]
Kwong, Sam [5 ]
机构
[1] Shandong Univ, Sch Mech Elect & Informat Engn, Shandong Key Lab Intelligent Elect Packaging Testi, Weihai 264209, Peoples R China
[2] Shandong Univ, Sch Control Sci & Engn, Jinan 250061, Peoples R China
[3] Swinburne Univ Technol, Sch Sci Comp & Engn Technol, Hawthorn, VIC 3122, Australia
[4] Univ Guelph, Adv Robot & Intelligent Syst Lab, Guelph, ON N1G 2W1, Canada
[5] Lingnan Univ, Sch Data Sci, Hong Kong, Peoples R China
基金
中国国家自然科学基金;
关键词
Robots; Adaptation models; Training; Collision avoidance; Navigation; Multi-robot systems; Uncertainty; Robot sensing systems; Robustness; Vehicle dynamics; Adversarial training; flocking; multiagent deep reinforcement learning (MADRL); autonomous vehicles; NONLINEAR MULTIAGENT SYSTEMS; OUTPUT REGULATION; ENHANCEMENT; UAVS;
D O I
10.1109/TII.2024.3523576
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Flocking control, as an essential approach for survivable navigation of multirobot systems, has been widely applied in fields, such as logistics, service delivery, and search and rescue. However, realistic environments are typically complex, dynamic, and even aggressive, posing considerable threats to the safety of flocking robots. In this article, based on deep reinforcement learning, an Asymmetric Self-play-empowered Flocking Control framework is proposed to address this concern. Specifically, the flocking robots are trained concurrently with learnable adversarial interferers to stimulate the intelligence of the flocking strategy. A two-stage self-play training paradigm is developed to improve the robustness and generalization of the model. Furthermore, an auxiliary training module regarding the learning of transition dynamics is designed, dramatically enhancing the adaptability to environmental uncertainties. Feature-level and agent-level attention are implemented for action and value generation, respectively. Both extensive comparative experiments and real-world deployment demonstrate the superiority and practicality of the proposed framework.
引用
收藏
页数:10
相关论文
共 50 条
  • [1] Mastering Fighting Game Using Deep Reinforcement Learning With Self-play
    Kim, Dae-Wook
    Park, Sungyun
    Yang, Seong-il
    2020 IEEE CONFERENCE ON GAMES (IEEE COG 2020), 2020, : 576 - 583
  • [2] Mastering construction heuristics with self-play deep reinforcement learning
    Wang, Qi
    He, Yuqing
    Tang, Chunlei
    NEURAL COMPUTING & APPLICATIONS, 2023, 35 (06): : 4723 - 4738
  • [3] Mastering construction heuristics with self-play deep reinforcement learning
    Qi Wang
    Yuqing He
    Chunlei Tang
    Neural Computing and Applications, 2023, 35 : 4723 - 4738
  • [4] DouZero: Mastering DouDizhu with Self-Play Deep Reinforcement Learning
    Zha, Daochen
    Xie, Jingru
    Ma, Wenye
    Zhang, Sheng
    Lian, Xiangru
    Hu, Xia
    Liu, Ji
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139
  • [5] Asymmetric Self-Play-Enabled Intelligent Heterogeneous Multirobot Catching System Using Deep Multiagent Reinforcement Learning
    Gao, Yuan
    Chen, Junfeng
    Chen, Xi
    Wang, Chongyang
    Hu, Junjie
    Deng, Fuqin
    Lam, Tin Lun
    IEEE TRANSACTIONS ON ROBOTICS, 2023, 39 (04) : 2603 - 2622
  • [6] Do as you teach: a multi-teacher approach to self-play in deep reinforcement learning
    Chaitanya Kharyal
    Sai Krishna Gottipati
    Tanmay Kumar Sinha
    Fatemeh Abdollahi
    Srijita Das
    Matthew E. Taylor
    Neural Computing and Applications, 2025, 37 (8) : 5945 - 5956
  • [7] Self-play Reinforcement Learning for Video Transmission
    Huang, Tianchi
    Zhang, Rui-Xiao
    Sun, Lifeng
    NOSSDAV '20: PROCEEDINGS OF THE 2020 WORKSHOP ON NETWORK AND OPERATING SYSTEM SUPPORT FOR DIGITAL AUDIO AND VIDEO, 2020, : 7 - 13
  • [8] TIYUNTSONG: A SELF-PLAY REINFORCEMENT LEARNING APPROACH FOR ABR VIDEO STREAMING
    Huang, Tianchi
    Yao, Xin
    Wu, Chenglei
    Zhang, Rui-Xiao
    Pang, Zhengyuan
    Sun, Lifeng
    2019 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2019, : 1678 - 1683
  • [9] Learning to Drive via Asymmetric Self-Play
    Zhang, Chris
    Biswas, Sourav
    Wong, Kelvin
    Fallah, Kion
    Zhang, Lunjun
    Chen, Dian
    Casas, Sergio
    Urtasun, Raquel
    COMPUTER VISION - ECCV 2024, PT LXII, 2025, 15120 : 149 - 168
  • [10] A Proposal of Score Distribution Predictive Model in Self-Play Deep Reinforcement Learning
    Kagoshima, Kazuya
    Sakaji, Hiroki
    Noda, Itsuki
    Transactions of the Japanese Society for Artificial Intelligence, 2024, 39 (05)