Autonomous Cooperative Hunting with Rule-Based and Self-Learning Control for Multiagent Systems

被引:0
|
作者
Luo, Jiaxiang [1 ,2 ]
Xu, Bozhe [1 ]
Li, Xiangyang [1 ,3 ]
Yao, Zhannan [1 ]
机构
[1] South China Univ Technol, Sch Automat Sci & Engn, Guangzhou, Peoples R China
[2] Minist Educ, Engn Ctr Precis Elect Mfg Equipment, Guangzhou, Peoples R China
[3] Minist Educ, Key Lab Autonomous Syst & Networked Control, Guangzhou, Peoples R China
基金
中国国家自然科学基金;
关键词
Multiagent system; Cooperative control; Reinforcement learning; Imitation learning; Collision avoidance; GROUP-SIZE; PURSUIT; SUCCESS;
D O I
10.1007/s10846-024-02177-1
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper considers the problem of autonomous cooperative hunting in an unknown dynamic environment, where a group of mobile agents collaborate to capture a moving target. Due to the decentralized decision-making nature of multi-agent systems and the presence of real-world constraints, it is a challenging task. To solve this problem, an artificial rule based hunting algorithm (AR-HA) is firstly developed based on the principles of attraction and repulsion with heading adjustment, and each agent is controlled by the designed rules. Then, to further enhance the stability of cooperative hunting, a self-learning algorithm based on Twin Delayed Deep Deterministic policy gradient (SL-TD3) is proposed. Each agent is governed by its own SL-TD3 controller and learns independently from its interaction with the environment, taking advantage of the reward function designed based on the control rules of AR-HA. Besides, in order to improve training efficiency, imitation learning is employed to initialize the actor network. Experiments on both virtual and real robots demonstrate the effectiveness of the proposed algorithms for autonomous cooperative hunting.
引用
收藏
页数:20
相关论文
共 50 条
  • [31] NEURAL NETWORKS FOR SELF-LEARNING CONTROL-SYSTEMS
    NGUYEN, DH
    WIDROW, B
    INTERNATIONAL JOURNAL OF CONTROL, 1991, 54 (06) : 1439 - 1451
  • [32] A self-learning human-machine cooperative control method based on driver intention recognition
    Jiang, Yan
    Ding, Yuyan
    Zhang, Xinglong
    Xu, Xin
    Huang, Junwen
    CAAI TRANSACTIONS ON INTELLIGENCE TECHNOLOGY, 2024, 9 (05) : 1101 - 1115
  • [33] Towards design of complete rule-based control systems
    Ligeza, A
    ARTIFICIAL INTELLIGENCE IN REAL-TIME CONTROL 1995 (AIRTC'95), 1996, : 167 - 172
  • [34] Optimal Self-Learning Cooperative Control for Continuous-Time Heterogeneous Multi-Agent Systems
    Wei Qinglai
    Liu Derong
    Song Ruizhuo
    2015 34TH CHINESE CONTROL CONFERENCE (CCC), 2015, : 3005 - 3010
  • [35] Rule-based price control for bike sharing systems
    Ruch, Claudio
    Warrington, Joseph
    Morari, Manfred
    2014 EUROPEAN CONTROL CONFERENCE (ECC), 2014, : 708 - 713
  • [36] Cooperative avoidance control for multiagent systems
    Stipanovic, Dusan M.
    Hokayem, Peter F.
    Spong, Mark W.
    Siljak, Dragoslav D.
    JOURNAL OF DYNAMIC SYSTEMS MEASUREMENT AND CONTROL-TRANSACTIONS OF THE ASME, 2007, 129 (05): : 699 - 707
  • [37] Multiobjective Rule-Based Cooperative Continuous Ant Colony Optimized Fuzzy Systems With a Robot Control Application
    Juang, Chia-Feng
    Lin, Chan-Hung
    Bui, Trong Bac
    IEEE TRANSACTIONS ON CYBERNETICS, 2020, 50 (02) : 650 - 663
  • [38] Convex Temporal Convolutional Network-Based Distributed Cooperative Learning Control for Multiagent Systems
    Chen, Shaofeng
    Kang, Yu
    Di, Jian
    Li, Pengfei
    Cao, Yang
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 34 (09) : 5234 - 5243
  • [39] Effects of self-assessment on retention in rule-based learning
    Hassmen, P
    Hunt, DP
    Dybeck, C
    PERCEPTUAL AND MOTOR SKILLS, 2002, 94 (01) : 296 - 306