Competitive Reinforcement Learning Agents with Adaptive Networks

被引：0

作者：

Nordaunet, Herman Pareli ^{[1
]}

Bo, Trym ^{[1
]}

Kassab, Evan Jasund ^{[1
]}

Veenstra, Frank ^{[1
]}

Cote-Allard, Ulysse ^{[1
]}

机构：

[1] Univ Oslo, Dept Informat, Oslo, Norway

来源：

2023 11TH INTERNATIONAL CONFERENCE ON CONTROL, MECHATRONICS AND AUTOMATION, ICCMA | 2023年

关键词：

Deep Reinforcement Learning; Adaptive Agents; Early-Exit Neural Networks; Network Selection;

D O I：

10.1109/ICCMA59762.2023.10374802

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

The depth of a neural network's architecture is a crucial decision that must balance network performance and the computational resources required during training and inference. In the context of Reinforcement Learning (RL), this architectural choice can profoundly impact the policy (i.e. agent's behavior) learned by the network. Depending on the state of the agent, different policies learned by the network may improve the agent's performance, particularly for time-sensitive applications (e.g. real-time, low-latency scenarios) when considering the additional computational time needed to access the output of deeper networks. Therefore, this paper proposes Greater Use of Time (GUT), a method that involves training multiple networks of different lengths and allowing them to make decisions collaboratively. If the shorter network is not confident enough, the longer network is relied on. For each network, the policy is learned through deep Q-learning, and the method's performance is evaluated in a competitive multi-agent environment. The results demonstrate that using multiple networks with different lengths not only reduces computational cost at inference time, but also yields significantly better performance than either the short or long network alone (p < 0.05). Importantly, the proposed use of confidence-based decision-making also significantly outperforms random decision-making (p < 0.05).

引用

页码：314 / 319

页数：6

共 50 条

[1] Reinforcement learning of competitive skills with soccer agents
Leng, Jinsong
Fyfe, Colin
Jain, Lakhmi
KNOWLEDGE-BASED INTELLIGENT INFORMATION AND ENGINEERING SYSTEMS: KES 2007 - WIRN 2007, PT I, PROCEEDINGS, 2007, 4692 : 572 - +
[2] Reinforcement Learning with Adaptive Networks
Sasaki, Tomoki
Yamada, Satoshi
2017 INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION SCIENCES (ICRAS), 2017, : 1 - 5
[3] Reinforcement learning of competitive and cooperative skills in soccer agents
Leng, Jinsong
Lim, Chee Peng
APPLIED SOFT COMPUTING, 2011, 11 (01) : 1353 - 1362
[4] Adaptive competitive learning neural networks
Abas, Ahmed R.
EGYPTIAN INFORMATICS JOURNAL, 2013, 14 (03) : 183 - 194
[5] Adaptive information agents using competitive learning
Khan, I
Card, HC
JOURNAL OF NETWORK AND COMPUTER APPLICATIONS, 1998, 21 (02) : 69 - 89
[6] A realization of socially adaptive robots by competitive reinforcement learning
Nakayama, T
Mikami, S
Wada, M
RO-MAN '96 - 5TH IEEE INTERNATIONAL WORKSHOP ON ROBOT AND HUMAN COMMUNICATION, PROCEEDINGS, 1996, : 107 - 111
[7] Adaptive agents with reinforcement learning and internal memory
Lanzi, PL
FROM ANIMALS TO ANIMATS 6, 2000, : 333 - 342
[8] Competitive Algorithms and Reinforcement Learning for NOMA in IoT Networks
Mlika, Zoubeir
Cherkaoui, Soumaya
IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC 2021), 2021,
[9] Coevolutionary networks of reinforcement-learning agents
Kianercy, Ardeshir
Galstyan, Aram
PHYSICAL REVIEW E, 2013, 88 (01):
[10] reinforcement learning, autonomous agents, neural networks
Parker-Holder, Jack
Rajan, Raghu
Song, Xingyou
Biedenkapp, Andre
Miao, Yingjie
Eimer, Theresa
Zhang, Baohe
Nguyen, Vu
Calandra, Roberto
Faust, Aleksandra
Hutter, Frank
Lindauer, Marius
JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2022, 74 : 517 - 568

← 1 2 3 4 5 →