DESIGN OF SELF-ORGANIZING SYSTEMS USING MULTI-AGENT REINFORCEMENT LEARNING AND THE COMPROMISE DECISION SUPPORT PROBLEM CONSTRUCT

被引:0
|
作者
Jiang, Mingfei [1 ]
Ming, Zhenjun [2 ]
Li, Chuanhao [3 ]
Mistree, Farrokh [4 ]
Allen, Janet K. [4 ]
机构
[1] Beijing Inst Technol, Elect Engn, Beijing, Peoples R China
[2] Beijing Inst Technol, Beijing, Peoples R China
[3] Beijing Inst Technol, Mech Engn, Beijing, Peoples R China
[4] Univ Oklahoma, Syst Realizat Lab OU, Norman, OK 73019 USA
基金
中国国家自然科学基金;
关键词
Self-organizing System; Compromise Decision Support Problem; Box-Pushing Problem;
D O I
暂无
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
How can multi-robot self-organizing systems be designed so that they show the desired behaviors and are able to perform tasks specified by designers? Multi-robot self-organizing systems, e.g., swarm robots, have great potential for adapting when performing complex tasks in a changing environment. However, such systems are difficult to design due to the stochasticity of the system performance and the non-linearity between the local actions/interaction and the desired global behavior. In order to address this, in this paper we propose a framework for designing self-organizing systems using Multi-Agent Reinforcement Learning (MARL) and the compromise Decision-Support Problem (cDSP) construct. The design framework consists of two stages - preliminary design and design improvement. In the preliminary design stage, MARL is used to help designers train the robots so that they show stable group behavior for performing the task. In the design improvement stage, the cDSP construct is used to enable designers to explore the design space and identify satisfactory solutions considering several performance indicators based on the trained system established in the previous stage. Between the two stages, surrogate models are used to map the relationship between local parameters and global performance indicators utilizing the data generated in preliminary design. The surrogate models represent the goal functions in the cDSP. A multi-robot box-pushing problem is used as an example to test the efficacy of the proposed framework. The framework is general and can be extended to design other multi-robot self-organizing systems. Our focus in this paper is in describing the framework.
引用
收藏
页数:13
相关论文
共 50 条
  • [41] SELF-ORGANIZING MULTI-AGENT SYSTEM FOR MANAGEMENT AND PLANNING SURVEILLANCE ROUTES
    Rodriguez, Sara
    Tapia, Dante I.
    de Paz, Juan F.
    Bajo, Javier
    Corchado, Juan M.
    Abraham, Ajith
    COMPUTING AND INFORMATICS, 2012, 31 (05) : 1081 - 1100
  • [42] Self-organizing multi-agent system for management and planning surveillance routes
    Rodríguez, S. (srg@usal.es), 1600, Slovak Academy of Sciences (31):
  • [43] Multi-Agent Reinforcement Learning is A Sequence Modeling Problem
    Wen, Muning
    Kuba, Jakub Grudzien
    Lin, Runji
    Zhang, Weinan
    Wen, Ying
    Wang, Jun
    Yang, Yaodong
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
  • [44] A Stepwise Refinement Based Development of Self-Organizing Multi-Agent Systems: Application to the Foraging Ants
    Graja, Zeineb
    Migeon, Fredrric
    Maurel, Christine
    Gleizes, Marie-Pierre
    Kacem, Ahmed Hadj
    ENGINEERING MULTI-AGENT SYSTEMS, EMAS 2014, 2014, 8758 : 40 - 57
  • [45] Application of Multi-agent Reinforcement Learning to the Dynamic Scheduling Problem in Manufacturing Systems
    Heik, David
    Bahrpeyma, Fouad
    Reichelt, Dirk
    MACHINE LEARNING, OPTIMIZATION, AND DATA SCIENCE, LOD 2023, PT II, 2024, 14506 : 237 - 254
  • [46] Learning coordination in multi-agent systems using influence value reinforcement learning
    Barrios-Aranibar, Dennis
    Garcia Goncalves, Luiz Marcos
    PROCEEDINGS OF THE 7TH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS DESIGN AND APPLICATIONS, 2007, : 471 - 476
  • [47] Reward design for driver repositioning using multi-agent reinforcement learning
    Shou, Zhenyu
    Di, Xuan
    TRANSPORTATION RESEARCH PART C-EMERGING TECHNOLOGIES, 2020, 119
  • [48] Multi-agent Environment for Decision-Support in Production Systems Using Machine Learning Methods
    Koźlak, Jaroslaw
    Sniezynski, Bartlomiej
    Wilk-Kolodziejczyk, Dorota
    Leśniak, Albert
    Jaśkowiec, Krzysztof
    Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2019, 11537 LNCS : 517 - 529
  • [49] Multi-agent Environment for Decision-Support in Production Systems Using Machine Learning Methods
    Kozlak, Jaroslaw
    Sniezynski, Bartlomiej
    Wilk-Kolodziejczyk, Dorota
    Lesniak, Albert
    Jaskowiec, Krzysztof
    COMPUTATIONAL SCIENCE - ICCS 2019, PT II, 2019, 11537 : 517 - 529
  • [50] A teaching method using a self-organizing map for reinforcement learning
    Takeshi Tateyama
    Seiichi Kawata
    Toshiki Oguchi
    Artificial Life and Robotics, 2004, 7 (4) : 193 - 197