Teaching on a Budget in Multi-Agent Deep Reinforcement Learning

被引：0

作者：

Ilhan, Ercument ^{[1
]}

Gow, Jeremy ^{[1
]}

Perez-Liebana, Diego ^{[1
]}

机构：

[1] Queen Mary Univ London, Sch Elect Engn & Comp Sci, London, England

来源：

2019 IEEE CONFERENCE ON GAMES (COG) | 2019年

关键词：

multi-agent; reinforcement learning; deep q-networks; action advising; teacher-student;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Deep Reinforcement Learning (RL) algorithms can solve complex sequential decision tasks successfully. However, they have a major drawback of having poor sample efficiency which can often be tackled by knowledge reuse. In Multi-Agent Reinforcement Learning (MARL) this drawback becomes worse, but at the same time, a new set of opportunities to leverage knowledge are also presented through agent interactions. One promising approach among these is peer-to-peer action advising through a teacher-student framework. Despite being introduced for single-agent RL originally, recent studies show that it can also be applied to multi-agent scenarios with promising empirical results. However, studies in this line of research are currently very limited. In this paper, we propose heuristics-based action advising techniques in cooperative decentralised MARL, using a nonlinear function approximation based task-level policy. By adopting Random Network Distillation technique, we devise a measurement for agents to assess their knowledge in any given state and be able to initiate the teacher-student dynamics with no prior role assumptions. Experimental results in a gridworld environment show that such an approach may indeed be useful and needs to be further investigated.

引用

页数：8

共 50 条

[1] Multi-agent deep reinforcement learning: a survey
Sven Gronauer
Klaus Diepold
[J]. Artificial Intelligence Review, 2022, 55 : 895 - 943
[2] Deep reinforcement learning for multi-agent interaction
Ahmed, Ibrahim H.
Brewitt, Cillian
Carlucho, Ignacio
Christianos, Filippos
Dunion, Mhairi
Fosong, Elliot
Garcin, Samuel
Guo, Shangmin
Gyevnar, Balint
McInroe, Trevor
Papoudakis, Georgios
Rahman, Arrasy
Schafer, Lukas
Tamborski, Massimiliano
Vecchio, Giuseppe
Wang, Cheng
Albrecht, Stefano, V
[J]. AI COMMUNICATIONS, 2022, 35 (04) : 357 - 368
[3] HALFTONING WITH MULTI-AGENT DEEP REINFORCEMENT LEARNING
Jiang, Haitian
Xiong, Dongliang
Jiang, Xiaowen
Yin, Aiguo
Ding, Li
Huang, Kai
[J]. 2022 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2022, : 641 - 645
[4] Multi-agent deep reinforcement learning: a survey
Gronauer, Sven
Diepold, Klaus
[J]. ARTIFICIAL INTELLIGENCE REVIEW, 2022, 55 (02) : 895 - 943
[5] Lenient Multi-Agent Deep Reinforcement Learning
Palmer, Gregory
Tuyls, Karl
Bloembergen, Daan
Savani, Rahul
[J]. PROCEEDINGS OF THE 17TH INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS AND MULTIAGENT SYSTEMS (AAMAS' 18), 2018, : 443 - 451
[6] Deep Multi-Agent Reinforcement Learning: A Survey
Liang, Xing-Xing
Feng, Yang-He
Ma, Yang
Cheng, Guang-Quan
Huang, Jin-Cai
Wang, Qi
Zhou, Yu-Zhen
Liu, Zhong
[J]. Zidonghua Xuebao/Acta Automatica Sinica, 2020, 46 (12): : 2537 - 2557
[7] Learning to Communicate with Deep Multi-Agent Reinforcement Learning
Foerster, Jakob N.
Assael, Yannis M.
de Freitas, Nando
Whiteson, Shimon
[J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 29 (NIPS 2016), 2016, 29
[8] MAGNet: Multi-agent Graph Network for Deep Multi-agent Reinforcement Learning
Malysheva, Aleksandra
Kudenko, Daniel
Shpilman, Aleksei
[J]. 2019 XVI INTERNATIONAL SYMPOSIUM PROBLEMS OF REDUNDANCY IN INFORMATION AND CONTROL SYSTEMS (REDUNDANCY), 2019, : 171 - 176
[9] A Transfer Learning Framework for Deep Multi-Agent Reinforcement Learning
Yi Liu
Xiang Wu
Yuming Bo
Jiacun Wang
Lifeng Ma
[J]. IEEE/CAA Journal of Automatica Sinica., 2024, 11 (11) - 2348
[10] Experience Selection in Multi-Agent Deep Reinforcement Learning
Wang, Yishen
Zhang, Zongzhang
[J]. 2019 IEEE 31ST INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2019), 2019, : 864 - 870

← 1 2 3 4 5 →