Decision Making for Multi-Robot Fixture Planning Using Multi-Agent Reinforcement Learning

被引:0
|
作者
Canzini, Ethan [1 ,2 ]
Auledas-Noguera, Marc [1 ]
Pope, Simon [1 ]
Tiwari, Ashutosh [1 ]
机构
[1] Univ Sheffield, Dept Automat Control & Syst Engn, Sheffield S10 2TN, England
[2] Airbus Robot, Broughton CH4 0DR, England
基金
英国科研创新办公室; 英国工程与自然科学研究理事会;
关键词
Fixtures; Planning; Task analysis; Manufacturing; Reinforcement learning; Robots; Layout; Multi-agent systems; reinforcement learning; aerospace manufacturing; fixture planning; robotic fixturing; DESIGN;
D O I
10.1109/TASE.2024.3424677
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Within the realm of flexible manufacturing, fixture layout planning allows manufacturers to rapidly deploy optimal fixturing plans that can reduce surface deformation that leads to crack propagation in components during manufacturing tasks. The role of fixture layout planning has evolved from being performed by experienced engineers to computational methods due to the number of possible configurations for components. Current optimisation methods commonly fall into sub-optimal positions due to the existence of local optima, with data-driven machine learning techniques relying on costly to collect labelled training data. In this paper, we present a framework for multi-agent reinforcement learning with team decision theory to find optimal fixturing plans for manufacturing tasks. We demonstrate our approach on two representative aerospace components with complex geometries across a set of drilling tasks, illustrating the capabilities of our method; we will compare this against state of the art methods to showcase our method's improvement at finding optimal fixturing plans with 3 times the improvement in deformation control within tolerance bounds.
引用
收藏
页数:12
相关论文
共 50 条
  • [11] Multi-Robot Motion Planning with Dynamics Guided by Multi-Agent Search
    Le, Duong
    Plaku, Erion
    [J]. PROCEEDINGS OF THE TWENTY-SEVENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2018, : 5314 - 5318
  • [12] Expert Knowledge Based Multi-Agent Reinforcement Learning and Its Application in Multi-Robot Hunting Problem
    Wei, Zhanyang
    Zhang, Wanpeng
    Chen, Jing
    Yang, Zhen
    [J]. PROCEEDINGS OF THE 30TH CHINESE CONTROL AND DECISION CONFERENCE (2018 CCDC), 2018, : 2687 - 2692
  • [13] Multi-Agent Reinforcement Learning based on K-Means Clustering in Multi-Robot Cooperative Systems
    Liu Chang-an
    Liu Fei
    Liu Chun-yang
    Wu Hua
    [J]. OPTICAL, ELECTRONIC MATERIALS AND APPLICATIONS, PTS 1-2, 2011, 216 : 75 - 80
  • [14] A Multi-agent Architecture for Multi-robot Surveillance
    Vallejo, David
    Remagnino, Paolo
    Monekosso, Dorothy N.
    Jimenez, Luis
    Gonzalez, Carlos
    [J]. COMPUTATIONAL COLLECTIVE INTELLIGENCE: SEMANTIC WEB, SOCIAL NETWORKS AND MULTIAGENT SYSTEMS, 2009, 5796 : 266 - +
  • [15] Multi-Agent Decision Fusion and its application in Adversarial Multi-Robot System
    Fan, Bo
    Pu, Jiexin
    [J]. 2008 INTERNATIONAL WORKSHOP ON EDUCATION TECHNOLOGY AND TRAINING AND 2008 INTERNATIONAL WORKSHOP ON GEOSCIENCE AND REMOTE SENSING, VOL 1, PROCEEDINGS, 2009, : 672 - 675
  • [16] A multi-agent reinforcement learning approach to robot soccer
    Yong Duan
    Bao Xia Cui
    Xin He Xu
    [J]. Artificial Intelligence Review, 2012, 38 : 193 - 211
  • [17] A multi-agent reinforcement learning approach to robot soccer
    Duan, Yong
    Cui, Bao Xia
    Xu, Xin He
    [J]. ARTIFICIAL INTELLIGENCE REVIEW, 2012, 38 (03) : 193 - 211
  • [18] Combining planning with reinforcement learning for multi-robot task allocation
    Strens, M
    Windelinckx, N
    [J]. ADAPTIVE AGENTS AND MULTI-AGENT SYSTEMS II: ADAPTATION AND MULTI-AGENT LEARNING, 2005, 3394 : 260 - 274
  • [19] Distributed safe reinforcement learning for multi-robot motion planning
    Lu, Yang
    Guo, Yaohua
    Zhao, Guoxiang
    Zhu, Minghui
    [J]. 2021 29TH MEDITERRANEAN CONFERENCE ON CONTROL AND AUTOMATION (MED), 2021, : 1209 - 1214
  • [20] Multi-agent reinforcement learning as a rehearsal for decentralized planning
    Kraemer, Landon
    Banerjee, Bikramjit
    [J]. NEUROCOMPUTING, 2016, 190 : 82 - 94