Algorithms or Actions? A Study in Large-Scale Reinforcement Learning

被引:0
|
作者
Tavares, Anderson Rocha [1 ]
Anbalagan, Sivasubramanian [2 ]
Marcolino, Leandro Soriano [2 ]
Chaimowicz, Luiz [1 ]
机构
[1] Univ Fed Minas Gerais, Comp Sci Dept, Belo Horizonte, MG, Brazil
[2] Univ Lancaster, Sch Comp & Commun, Lancaster, England
来源
PROCEEDINGS OF THE TWENTY-SEVENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE | 2018年
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Large state and action spaces are very challenging to reinforcement learning. However, in many domains there is a set of algorithms available, which estimate the best action given a state. Hence, agents can either directly learn a performance-maximizing mapping from states to actions, or from states to algorithms. We investigate several aspects of this dilemma, showing sufficient conditions for learning over algorithms to outperform over actions for a finite number of training iterations. We present synthetic experiments to further study such systems. Finally, we propose a function approximation approach, demonstrating the effectiveness of learning over algorithms in real-time strategy games.
引用
收藏
页码:2717 / 2723
页数:7
相关论文
共 50 条
  • [41] Optimization Algorithms for Large-Scale Systems
    Azizan N.
    Performance Evaluation Review, 2020, 47 (03): : 2 - 5
  • [42] Weighted mean field reinforcement learning for large-scale UAV swarm confrontation
    Baolai Wang
    Shengang Li
    Xianzhong Gao
    Tao Xie
    Applied Intelligence, 2023, 53 : 5274 - 5289
  • [43] Multi-Objective Distributional Reinforcement Learning for Large-Scale Order Dispatching
    Zhou, Fan
    Lu, Chenfan
    Tang, Xiaocheng
    Zhang, Fan
    Qin, Zhiwei
    Ye, Jieping
    Zhu, Hongtu
    2021 21ST IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM 2021), 2021, : 1541 - 1546
  • [44] Cooperative Deep Reinforcement Learning for Large-Scale Traffic Grid Signal Control
    Tan, Tian
    Bao, Feng
    Deng, Yue
    Jin, Alex
    Dai, Qionghai
    Wang, Jie
    IEEE TRANSACTIONS ON CYBERNETICS, 2020, 50 (06) : 2687 - 2700
  • [45] DeepRoute on Chameleon: Experimenting with Large-scale Reinforcement Learning and SDN on Chameleon Testbed
    Mohammed, Bashir
    Kiran, Mariam
    Krishnaswamy, Nandini
    2019 IEEE 27TH INTERNATIONAL CONFERENCE ON NETWORK PROTOCOLS (IEEE ICNP), 2019,
  • [46] Dynamic Optimization for Secure MIMO Beamforming using Large-scale Reinforcement Learning
    Zhang, Xinran
    Sun, Songlin
    2019 IEEE WIRELESS COMMUNICATIONS AND NETWORKING CONFERENCE (WCNC), 2019,
  • [47] Large-scale dynamic surgical scheduling under uncertainty by hierarchical reinforcement learning
    Zhao, Lixiang
    Zhu, Han
    Zhang, Min
    Tang, Jiafu
    Wang, Yu
    INTERNATIONAL JOURNAL OF PRODUCTION RESEARCH, 2024,
  • [48] Reinforcement Learning Applied to Scrum Team towards Large-Scale Global Optimization
    Nootyaskool, Supakit
    Ounsrimuang, Pimolrat
    PROCEEDINGS OF 2018 14TH IEEE INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING (ICSP), 2018, : 1034 - 1039
  • [49] Weighted mean field reinforcement learning for large-scale UAV swarm confrontation
    Wang, Baolai
    Li, Shengang
    Gao, Xianzhong
    Xie, Tao
    APPLIED INTELLIGENCE, 2023, 53 (05) : 5274 - 5289
  • [50] A multi-swarm optimizer with a reinforcement learning mechanism for large-scale optimization
    Wang, Xujie
    Wang, Feng
    He, Qi
    Guo, Yinan
    SWARM AND EVOLUTIONARY COMPUTATION, 2024, 86