Development Of Deep Reinforcement Learning Multi-Agent Framework Design Using Self-Organizing Map

被引:0
|
作者
Setyawan, Gembong Edhi [1 ]
Cholissodin, Imam [1 ]
机构
[1] Univ Brawijaya, Fac Comp Sci, Malang, Indonesia
关键词
framework design; deep reinforcement learning; q-learning; multi-agent; artificial neural network; self-organizing map;
D O I
10.1109/siet48054.2019.8986121
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
The developmental steps and paradigm changes in the use of automation technology using deep reinforcement learning (RL) are very rapid because they are also widely accompanied by the development of deep learning combination, which combines RL algorithms. One of the combinations is Q-learning algorithm with one of the deep learning algorithms family of artificial neural networks (ANN) and part of the artificial intelligence science. The combination also becomes a challenge for many researchers because so far it is very difficult to find the right combination in accordance with the case resolved although there are also those that combine with non-ANN. In addition, most RLs only use a single combination, which means that they have not found the ideal combination, whether it should be a single one of the algorithms of ANN or some of it. This study proposes a framework design using the Self-Organizing Map (SOM) algorithm that adaptively combines and plays as the actor to calculate the final Q-value value that is updated from a single or multiple Q-value values in a sustainable and dynamic manner. The result of the formed framework indicates that SOM is able to provide an adaptive combination for the algorithms that should be used in deep RL.
引用
收藏
页码:246 / 250
页数:5
相关论文
共 50 条
  • [1] DESIGNING SELF-ORGANIZING SYSTEMS WITH DEEP MULTI-AGENT REINFORCEMENT LEARNING
    Ji, Hao
    Jin, Yan
    PROCEEDINGS OF THE ASME INTERNATIONAL DESIGN ENGINEERING TECHNICAL CONFERENCES AND COMPUTERS AND INFORMATION IN ENGINEERING CONFERENCE, 2019, VOL 7, 2020,
  • [2] HiSOMA: A hierarchical multi-agent model integrating self-organizing neural networks with multi-agent deep reinforcement learning
    Geng, Minghong
    Pateria, Shubham
    Subagdja, Budhitama
    Tan, Ah-Hwee
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 252
  • [3] Self-organizing cognitive agents and reinforcement learning in multi-agent environment
    Tan, AH
    Xiao, D
    2005 IEEE/WIC/ACM INTERNATIONAL CONFERENCE ON INTELLIGENT AGENT TECHNOLOGY, PROCEEDINGS, 2005, : 351 - 357
  • [4] Manufacturing resource-based self-organizing scheduling using multi-agent system and deep reinforcement learning
    Li, Yuxin
    Liu, Qihao
    Li, Xinyu
    Gao, Liang
    JOURNAL OF MANUFACTURING SYSTEMS, 2025, 79 : 179 - 198
  • [5] Strategy Analysis of Multi-Agent Games Using Self-Organizing Map
    Tominaga, Moeko
    Takemura, Yasunori
    Ishii, Kazuo
    ICAROB 2018: PROCEEDINGS OF THE 2018 INTERNATIONAL CONFERENCE ON ARTIFICIAL LIFE AND ROBOTICS, 2018, : 757 - 760
  • [6] Design of Self-Organizing Systems Using Multi-Agent Reinforcement Learning and the Compromise Decision Support Problem Construct
    Jiang, Mingfei
    Ming, Zhenjun
    Li, Chuanhao
    Allen, Janet K.
    Mistree, Farrokh
    JOURNAL OF MECHANICAL DESIGN, 2024, 146 (05)
  • [7] DESIGN OF SELF-ORGANIZING SYSTEMS USING MULTI-AGENT REINFORCEMENT LEARNING AND THE COMPROMISE DECISION SUPPORT PROBLEM CONSTRUCT
    Jiang, Mingfei
    Ming, Zhenjun
    Li, Chuanhao
    Mistree, Farrokh
    Allen, Janet K.
    PROCEEDINGS OF ASME 2023 INTERNATIONAL DESIGN ENGINEERING TECHNICAL CONFERENCES AND COMPUTERS AND INFORMATION IN ENGINEERING CONFERENCE, IDETC-CIE2023, VOL 3A, 2023,
  • [8] Collaborative dynamic scheduling in a self-organizing manufacturing system using multi-agent reinforcement learning
    Gui, Yong
    Zhang, Zequn
    Tang, Dunbing
    Zhu, Haihua
    Zhang, Yi
    ADVANCED ENGINEERING INFORMATICS, 2024, 62
  • [9] Multi-agent machine learning in self-organizing systems
    Hejazi, Ehsan
    INFORMATION SCIENCES, 2021, 581 : 194 - 214
  • [10] A teaching method using a self-organizing map for reinforcement learning
    Takeshi Tateyama
    Seiichi Kawata
    Toshiki Oguchi
    Artificial Life and Robotics, 2004, 7 (4) : 193 - 197