DESIGN OF SELF-ORGANIZING SYSTEMS USING MULTI-AGENT REINFORCEMENT LEARNING AND THE COMPROMISE DECISION SUPPORT PROBLEM CONSTRUCT

被引：0

作者：

Jiang, Mingfei ^{[1
]}

Ming, Zhenjun ^{[2
]}

Li, Chuanhao ^{[3
]}

Mistree, Farrokh ^{[4
]}

Allen, Janet K. ^{[4
]}

机构：

[1] Beijing Inst Technol, Elect Engn, Beijing, Peoples R China

[2] Beijing Inst Technol, Beijing, Peoples R China

[3] Beijing Inst Technol, Mech Engn, Beijing, Peoples R China

[4] Univ Oklahoma, Syst Realizat Lab OU, Norman, OK 73019 USA

来源：

PROCEEDINGS OF ASME 2023 INTERNATIONAL DESIGN ENGINEERING TECHNICAL CONFERENCES AND COMPUTERS AND INFORMATION IN ENGINEERING CONFERENCE, IDETC-CIE2023, VOL 3A | 2023年

基金：

中国国家自然科学基金;

关键词：

Self-organizing System; Compromise Decision Support Problem; Box-Pushing Problem;

D O I：

暂无

中图分类号：

T [工业技术];

学科分类号：

08 ;

摘要：

How can multi-robot self-organizing systems be designed so that they show the desired behaviors and are able to perform tasks specified by designers? Multi-robot self-organizing systems, e.g., swarm robots, have great potential for adapting when performing complex tasks in a changing environment. However, such systems are difficult to design due to the stochasticity of the system performance and the non-linearity between the local actions/interaction and the desired global behavior. In order to address this, in this paper we propose a framework for designing self-organizing systems using Multi-Agent Reinforcement Learning (MARL) and the compromise Decision-Support Problem (cDSP) construct. The design framework consists of two stages - preliminary design and design improvement. In the preliminary design stage, MARL is used to help designers train the robots so that they show stable group behavior for performing the task. In the design improvement stage, the cDSP construct is used to enable designers to explore the design space and identify satisfactory solutions considering several performance indicators based on the trained system established in the previous stage. Between the two stages, surrogate models are used to map the relationship between local parameters and global performance indicators utilizing the data generated in preliminary design. The surrogate models represent the goal functions in the cDSP. A multi-robot box-pushing problem is used as an example to test the efficacy of the proposed framework. The framework is general and can be extended to design other multi-robot self-organizing systems. Our focus in this paper is in describing the framework.

引用

页数：13

共 50 条

[41] SELF-ORGANIZING MULTI-AGENT SYSTEM FOR MANAGEMENT AND PLANNING SURVEILLANCE ROUTES
Rodriguez, Sara
Tapia, Dante I.
de Paz, Juan F.
Bajo, Javier
Corchado, Juan M.
Abraham, Ajith
COMPUTING AND INFORMATICS, 2012, 31 (05) : 1081 - 1100
[42] Self-organizing multi-agent system for management and planning surveillance routes
Rodríguez, S. (srg@usal.es), 1600, Slovak Academy of Sciences (31):
[43] Multi-Agent Reinforcement Learning is A Sequence Modeling Problem
Wen, Muning
Kuba, Jakub Grudzien
Lin, Runji
Zhang, Weinan
Wen, Ying
Wang, Jun
Yang, Yaodong
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
[44] A Stepwise Refinement Based Development of Self-Organizing Multi-Agent Systems: Application to the Foraging Ants
Graja, Zeineb
Migeon, Fredrric
Maurel, Christine
Gleizes, Marie-Pierre
Kacem, Ahmed Hadj
ENGINEERING MULTI-AGENT SYSTEMS, EMAS 2014, 2014, 8758 : 40 - 57
[45] Application of Multi-agent Reinforcement Learning to the Dynamic Scheduling Problem in Manufacturing Systems
Heik, David
Bahrpeyma, Fouad
Reichelt, Dirk
MACHINE LEARNING, OPTIMIZATION, AND DATA SCIENCE, LOD 2023, PT II, 2024, 14506 : 237 - 254
[46] Learning coordination in multi-agent systems using influence value reinforcement learning
Barrios-Aranibar, Dennis
Garcia Goncalves, Luiz Marcos
PROCEEDINGS OF THE 7TH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS DESIGN AND APPLICATIONS, 2007, : 471 - 476
[47] Reward design for driver repositioning using multi-agent reinforcement learning
Shou, Zhenyu
Di, Xuan
TRANSPORTATION RESEARCH PART C-EMERGING TECHNOLOGIES, 2020, 119
[48] Multi-agent Environment for Decision-Support in Production Systems Using Machine Learning Methods
Koźlak, Jaroslaw
Sniezynski, Bartlomiej
Wilk-Kolodziejczyk, Dorota
Leśniak, Albert
Jaśkowiec, Krzysztof
Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2019, 11537 LNCS : 517 - 529
[49] Multi-agent Environment for Decision-Support in Production Systems Using Machine Learning Methods
Kozlak, Jaroslaw
Sniezynski, Bartlomiej
Wilk-Kolodziejczyk, Dorota
Lesniak, Albert
Jaskowiec, Krzysztof
COMPUTATIONAL SCIENCE - ICCS 2019, PT II, 2019, 11537 : 517 - 529
[50] A teaching method using a self-organizing map for reinforcement learning
Takeshi Tateyama
Seiichi Kawata
Toshiki Oguchi
Artificial Life and Robotics, 2004, 7 (4) : 193 - 197

← 1 2 3 4 5 →