Development Of Deep Reinforcement Learning Multi-Agent Framework Design Using Self-Organizing Map

被引:0
|
作者
Setyawan, Gembong Edhi [1 ]
Cholissodin, Imam [1 ]
机构
[1] Univ Brawijaya, Fac Comp Sci, Malang, Indonesia
关键词
framework design; deep reinforcement learning; q-learning; multi-agent; artificial neural network; self-organizing map;
D O I
10.1109/siet48054.2019.8986121
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
The developmental steps and paradigm changes in the use of automation technology using deep reinforcement learning (RL) are very rapid because they are also widely accompanied by the development of deep learning combination, which combines RL algorithms. One of the combinations is Q-learning algorithm with one of the deep learning algorithms family of artificial neural networks (ANN) and part of the artificial intelligence science. The combination also becomes a challenge for many researchers because so far it is very difficult to find the right combination in accordance with the case resolved although there are also those that combine with non-ANN. In addition, most RLs only use a single combination, which means that they have not found the ideal combination, whether it should be a single one of the algorithms of ANN or some of it. This study proposes a framework design using the Self-Organizing Map (SOM) algorithm that adaptively combines and plays as the actor to calculate the final Q-value value that is updated from a single or multiple Q-value values in a sustainable and dynamic manner. The result of the formed framework indicates that SOM is able to provide an adaptive combination for the algorithms that should be used in deep RL.
引用
收藏
页码:246 / 250
页数:5
相关论文
共 50 条
  • [41] Toward requirements engineering for self-organizing multi-agent systems
    Sudeikat, Jan
    Renz, Wolfgang
    FIRST IEEE INTERNATIONAL CONFERENCE ON SELF-ADAPTIVE AND SELF-ORGANIZING SYSTEMS, 2007, : 299 - +
  • [42] Acquisition of the relation between vision and action using Self-Organizing Map and reinforcement learning
    Terada, Kazunori
    Takeda, Hideaki
    Nishida, Toyoaki
    International Conference on Knowledge-Based Intelligent Electronic Systems, Proceedings, KES, 1998, 1 : 429 - 434
  • [43] A Self-Organizing Multi-Agent System for Distributed Voltage Regulation
    Al Faiya, Badr
    Athanasiadis, Dimitrios
    Chen, Minjiang
    McArthur, Stephen
    Kockar, Ivana
    Lu, Haowei
    de Leon, Francisco
    IEEE TRANSACTIONS ON SMART GRID, 2021, 12 (05) : 4102 - 4112
  • [44] Multi-agent Self-organizing Scheme for Chemical Patent Datamining
    Krishnamurthy, E. V.
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON SOFT COMPUTING FOR PROBLEM SOLVING (SOCPROS 2011), VOL 2, 2012, 131 : 41 - 51
  • [45] MAGNet: Multi-agent Graph Network for Deep Multi-agent Reinforcement Learning
    Malysheva, Aleksandra
    Kudenko, Daniel
    Shpilman, Aleksei
    2019 XVI INTERNATIONAL SYMPOSIUM PROBLEMS OF REDUNDANCY IN INFORMATION AND CONTROL SYSTEMS (REDUNDANCY), 2019, : 171 - 176
  • [46] A fully value distributional deep reinforcement learning framework for multi-agent cooperation
    Fu, Mingsheng
    Huang, Liwei
    Li, Fan
    Qu, Hong
    Xu, Chengzhong
    NEURAL NETWORKS, 2025, 184
  • [47] SELF-ORGANIZING SYNCHRONICITY AND DESYNCHRONICITY USING REINFORCEMENT LEARNING
    Mihaylov, Mihail
    Le Borgne, Yann-Ael
    Nowe, Ann
    Tuyls, Karl
    ICAART 2011: PROCEEDINGS OF THE 3RD INTERNATIONAL CONFERENCE ON AGENTS AND ARTIFICIAL INTELLIGENCE, VOL 2, 2011, : 94 - 103
  • [48] Development of a Reference Signal Self-Organizing Control System Based on Deep Reinforcement Learning
    Iwasaki, Hiromichi
    Okuyama, Atsushi
    2021 IEEE INTERNATIONAL CONFERENCE ON MECHATRONICS (ICM), 2021,
  • [49] A multi-agent deep reinforcement learning framework for algorithmic trading in financial markets
    Shavandi, Ali
    Khedmati, Majid
    EXPERT SYSTEMS WITH APPLICATIONS, 2022, 208
  • [50] Information Design in Multi-Agent Reinforcement Learning
    Lin, Yue
    Li, Wenhao
    Zha, Hongyuan
    Wang, Baoxian
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,