Development Of Deep Reinforcement Learning Multi-Agent Framework Design Using Self-Organizing Map

被引：0

作者：

Setyawan, Gembong Edhi ^{[1
]}

Cholissodin, Imam ^{[1
]}

机构：

[1] Univ Brawijaya, Fac Comp Sci, Malang, Indonesia

来源：

PROCEEDINGS OF 2019 4TH INTERNATIONAL CONFERENCE ON SUSTAINABLE INFORMATION ENGINEERING AND TECHNOLOGY (SIET 2019) | 2019年

关键词：

framework design; deep reinforcement learning; q-learning; multi-agent; artificial neural network; self-organizing map;

D O I：

10.1109/siet48054.2019.8986121

中图分类号：

TP301 [理论、方法];

学科分类号：

081202 ;

摘要：

The developmental steps and paradigm changes in the use of automation technology using deep reinforcement learning (RL) are very rapid because they are also widely accompanied by the development of deep learning combination, which combines RL algorithms. One of the combinations is Q-learning algorithm with one of the deep learning algorithms family of artificial neural networks (ANN) and part of the artificial intelligence science. The combination also becomes a challenge for many researchers because so far it is very difficult to find the right combination in accordance with the case resolved although there are also those that combine with non-ANN. In addition, most RLs only use a single combination, which means that they have not found the ideal combination, whether it should be a single one of the algorithms of ANN or some of it. This study proposes a framework design using the Self-Organizing Map (SOM) algorithm that adaptively combines and plays as the actor to calculate the final Q-value value that is updated from a single or multiple Q-value values in a sustainable and dynamic manner. The result of the formed framework indicates that SOM is able to provide an adaptive combination for the algorithms that should be used in deep RL.

引用

页码：246 / 250

页数：5

共 50 条

[1] DESIGNING SELF-ORGANIZING SYSTEMS WITH DEEP MULTI-AGENT REINFORCEMENT LEARNING
Ji, Hao
Jin, Yan
PROCEEDINGS OF THE ASME INTERNATIONAL DESIGN ENGINEERING TECHNICAL CONFERENCES AND COMPUTERS AND INFORMATION IN ENGINEERING CONFERENCE, 2019, VOL 7, 2020,
[2] HiSOMA: A hierarchical multi-agent model integrating self-organizing neural networks with multi-agent deep reinforcement learning
Geng, Minghong
Pateria, Shubham
Subagdja, Budhitama
Tan, Ah-Hwee
EXPERT SYSTEMS WITH APPLICATIONS, 2024, 252
[3] Self-organizing cognitive agents and reinforcement learning in multi-agent environment
Tan, AH
Xiao, D
2005 IEEE/WIC/ACM INTERNATIONAL CONFERENCE ON INTELLIGENT AGENT TECHNOLOGY, PROCEEDINGS, 2005, : 351 - 357
[4] Manufacturing resource-based self-organizing scheduling using multi-agent system and deep reinforcement learning
Li, Yuxin
Liu, Qihao
Li, Xinyu
Gao, Liang
JOURNAL OF MANUFACTURING SYSTEMS, 2025, 79 : 179 - 198
[5] Strategy Analysis of Multi-Agent Games Using Self-Organizing Map
Tominaga, Moeko
Takemura, Yasunori
Ishii, Kazuo
ICAROB 2018: PROCEEDINGS OF THE 2018 INTERNATIONAL CONFERENCE ON ARTIFICIAL LIFE AND ROBOTICS, 2018, : 757 - 760
[6] Design of Self-Organizing Systems Using Multi-Agent Reinforcement Learning and the Compromise Decision Support Problem Construct
Jiang, Mingfei
Ming, Zhenjun
Li, Chuanhao
Allen, Janet K.
Mistree, Farrokh
JOURNAL OF MECHANICAL DESIGN, 2024, 146 (05)
[7] DESIGN OF SELF-ORGANIZING SYSTEMS USING MULTI-AGENT REINFORCEMENT LEARNING AND THE COMPROMISE DECISION SUPPORT PROBLEM CONSTRUCT
Jiang, Mingfei
Ming, Zhenjun
Li, Chuanhao
Mistree, Farrokh
Allen, Janet K.
PROCEEDINGS OF ASME 2023 INTERNATIONAL DESIGN ENGINEERING TECHNICAL CONFERENCES AND COMPUTERS AND INFORMATION IN ENGINEERING CONFERENCE, IDETC-CIE2023, VOL 3A, 2023,
[8] Collaborative dynamic scheduling in a self-organizing manufacturing system using multi-agent reinforcement learning
Gui, Yong
Zhang, Zequn
Tang, Dunbing
Zhu, Haihua
Zhang, Yi
ADVANCED ENGINEERING INFORMATICS, 2024, 62
[9] Multi-agent machine learning in self-organizing systems
Hejazi, Ehsan
INFORMATION SCIENCES, 2021, 581 : 194 - 214
[10] A teaching method using a self-organizing map for reinforcement learning
Takeshi Tateyama
Seiichi Kawata
Toshiki Oguchi
Artificial Life and Robotics, 2004, 7 (4) : 193 - 197

← 1 2 3 4 5 →