Multi-agent Double Deep Q-Networks

被引:2
|
作者
Simoes, David [1 ,2 ,3 ]
Lau, Nuno [1 ,3 ]
Reis, Luis Paulo [1 ,2 ,4 ]
机构
[1] Univ Aveiro, IEETA Inst Elect & Informat Engn Aveiro, Aveiro, Portugal
[2] LIACC Artificial Intelligence & Comp Sci Lab, Porto, Portugal
[3] Univ Aveiro, DETI UA Elect Telecommun & Informat Dept, Aveiro, Portugal
[4] Univ Minho, DSI EEUM Informat Syst Dept, Sch Engn, Braga, Portugal
关键词
D O I
10.1007/978-3-319-65340-2_11
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
There are many open issues and challenges in the multi-agent reward-based learning field. Theoretical convergence guarantees are lost, and the complexity of the action-space is also exponential to the amount of agents calculating their optimal joint-action. Function approximators, such as deep neural networks, have successfully been used in singleagent environments with high dimensional state-spaces. We propose the Multi-agent Double Deep Q-Networks algorithm, an extension of Deep Q-Networks to the multi-agent paradigm. Two common techniques of multi-agent Q-learning are used to formally describe our proposal, and are tested in a Foraging Task and a Pursuit Game. We also demonstrate how they can generalize to similar tasks and to larger teams, due to the strength of deep-learning techniques, and their viability for transfer learning approaches. With only a small fraction of the initial task's training, we adapt to longer tasks, and we accelerate the task completion by increasing the team size, thus empirically demonstrating a solution to the complexity issues of the multi-agent field.
引用
收藏
页码:123 / 134
页数:12
相关论文
共 50 条
  • [1] Enhancing medical image object detection with collaborative multi-agent deep Q-networks and multi-scale representation
    Wang, Qinghui
    Liu, Fenglin
    Zou, Ruirui
    Wang, Ying
    Zheng, Chenyang
    Tian, Zhiqiang
    Du, Shaoyi
    Zeng, Wei
    [J]. EURASIP JOURNAL ON ADVANCES IN SIGNAL PROCESSING, 2023, 2023 (01)
  • [2] Enhancing medical image object detection with collaborative multi-agent deep Q-networks and multi-scale representation
    Qinghui Wang
    Fenglin Liu
    Ruirui Zou
    Ying Wang
    Chenyang Zheng
    Zhiqiang Tian
    Shaoyi Du
    Wei Zeng
    [J]. EURASIP Journal on Advances in Signal Processing, 2023
  • [3] Multi-Agent Deep Q-Networks for Efficient Edge Federated Learning Communications in Software-Defined IoT
    Tam, Prohim
    Math, Sa
    Lee, Ahyoung
    Kim, Seokhoon
    [J]. CMC-COMPUTERS MATERIALS & CONTINUA, 2022, 71 (02): : 3319 - 3335
  • [4] Multi-Agent Double Deep Q-Learning for Beamforming in mmWave MIMO Networks
    Wang, Xueyuan
    Gursoy, M. Cenk
    [J]. 2020 IEEE 31ST ANNUAL INTERNATIONAL SYMPOSIUM ON PERSONAL, INDOOR AND MOBILE RADIO COMMUNICATIONS (IEEE PIMRC), 2020,
  • [5] Agent Decision Processes Using Double Deep Q-Networks plus Minimax Q-Learning
    Fitch, Natalie
    Clancy, Daniel
    [J]. 2021 IEEE AEROSPACE CONFERENCE (AEROCONF 2021), 2021,
  • [6] Deep Abstract Q-Networks
    Roderick, Melrose
    Grimm, Christopher
    Tellex, Stefanie
    [J]. PROCEEDINGS OF THE 17TH INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS AND MULTIAGENT SYSTEMS (AAMAS' 18), 2018, : 131 - 138
  • [7] Algorithmic Trading Using Double Deep Q-Networks and Sentiment Analysis
    Tabaro, Leon
    Kinani, Jean Marie Vianney
    Rosales-Silva, Alberto Jorge
    Salgado-Ramirez, Julio Cesar
    Mujica-Vargas, Dante
    Escamilla-Ambrosio, Ponciano Jorge
    Ramos-Diaz, Eduardo
    [J]. INFORMATION, 2024, 15 (08)
  • [8] Double Deep Q-Networks for Optimizing Electricity Cost of a Water Heater
    Amasyali, Kadir
    Kurte, Kuldeep
    Zandi, Helia
    Munk, Jeffrey
    Kotevska, Olivera
    Smith, Robert
    [J]. 2021 IEEE POWER & ENERGY SOCIETY INNOVATIVE SMART GRID TECHNOLOGIES CONFERENCE (ISGT), 2021,
  • [9] Wireless Lan Performance Enhancement Using Double Deep Q-Networks
    Asaf, Khizra
    Khan, Bilal
    Kim, Ga-Young
    [J]. APPLIED SCIENCES-BASEL, 2022, 12 (09):
  • [10] Autonomous UAV Navigation in Dynamic Environments with Double Deep Q-Networks
    Yang, Yupeng
    Zhang, Kai
    Liu, Dahai
    Song, Houbing
    [J]. 2020 AIAA/IEEE 39TH DIGITAL AVIONICS SYSTEMS CONFERENCE (DASC) PROCEEDINGS, 2020,