Competitive Reinforcement Learning Agents with Adaptive Networks

被引:0
|
作者
Nordaunet, Herman Pareli [1 ]
Bo, Trym [1 ]
Kassab, Evan Jasund [1 ]
Veenstra, Frank [1 ]
Cote-Allard, Ulysse [1 ]
机构
[1] Univ Oslo, Dept Informat, Oslo, Norway
来源
2023 11TH INTERNATIONAL CONFERENCE ON CONTROL, MECHATRONICS AND AUTOMATION, ICCMA | 2023年
关键词
Deep Reinforcement Learning; Adaptive Agents; Early-Exit Neural Networks; Network Selection;
D O I
10.1109/ICCMA59762.2023.10374802
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The depth of a neural network's architecture is a crucial decision that must balance network performance and the computational resources required during training and inference. In the context of Reinforcement Learning (RL), this architectural choice can profoundly impact the policy (i.e. agent's behavior) learned by the network. Depending on the state of the agent, different policies learned by the network may improve the agent's performance, particularly for time-sensitive applications (e.g. real-time, low-latency scenarios) when considering the additional computational time needed to access the output of deeper networks. Therefore, this paper proposes Greater Use of Time (GUT), a method that involves training multiple networks of different lengths and allowing them to make decisions collaboratively. If the shorter network is not confident enough, the longer network is relied on. For each network, the policy is learned through deep Q-learning, and the method's performance is evaluated in a competitive multi-agent environment. The results demonstrate that using multiple networks with different lengths not only reduces computational cost at inference time, but also yields significantly better performance than either the short or long network alone (p < 0.05). Importantly, the proposed use of confidence-based decision-making also significantly outperforms random decision-making (p < 0.05).
引用
收藏
页码:314 / 319
页数:6
相关论文
共 50 条
  • [1] Reinforcement learning of competitive skills with soccer agents
    Leng, Jinsong
    Fyfe, Colin
    Jain, Lakhmi
    KNOWLEDGE-BASED INTELLIGENT INFORMATION AND ENGINEERING SYSTEMS: KES 2007 - WIRN 2007, PT I, PROCEEDINGS, 2007, 4692 : 572 - +
  • [2] Reinforcement Learning with Adaptive Networks
    Sasaki, Tomoki
    Yamada, Satoshi
    2017 INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION SCIENCES (ICRAS), 2017, : 1 - 5
  • [3] Reinforcement learning of competitive and cooperative skills in soccer agents
    Leng, Jinsong
    Lim, Chee Peng
    APPLIED SOFT COMPUTING, 2011, 11 (01) : 1353 - 1362
  • [4] Adaptive competitive learning neural networks
    Abas, Ahmed R.
    EGYPTIAN INFORMATICS JOURNAL, 2013, 14 (03) : 183 - 194
  • [5] Adaptive information agents using competitive learning
    Khan, I
    Card, HC
    JOURNAL OF NETWORK AND COMPUTER APPLICATIONS, 1998, 21 (02) : 69 - 89
  • [6] A realization of socially adaptive robots by competitive reinforcement learning
    Nakayama, T
    Mikami, S
    Wada, M
    RO-MAN '96 - 5TH IEEE INTERNATIONAL WORKSHOP ON ROBOT AND HUMAN COMMUNICATION, PROCEEDINGS, 1996, : 107 - 111
  • [7] Adaptive agents with reinforcement learning and internal memory
    Lanzi, PL
    FROM ANIMALS TO ANIMATS 6, 2000, : 333 - 342
  • [8] Competitive Algorithms and Reinforcement Learning for NOMA in IoT Networks
    Mlika, Zoubeir
    Cherkaoui, Soumaya
    IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC 2021), 2021,
  • [9] Coevolutionary networks of reinforcement-learning agents
    Kianercy, Ardeshir
    Galstyan, Aram
    PHYSICAL REVIEW E, 2013, 88 (01):
  • [10] reinforcement learning, autonomous agents, neural networks
    Parker-Holder, Jack
    Rajan, Raghu
    Song, Xingyou
    Biedenkapp, Andre
    Miao, Yingjie
    Eimer, Theresa
    Zhang, Baohe
    Nguyen, Vu
    Calandra, Roberto
    Faust, Aleksandra
    Hutter, Frank
    Lindauer, Marius
    JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2022, 74 : 517 - 568