Algorithms or Actions? A Study in Large-Scale Reinforcement Learning

被引：0

作者：

Tavares, Anderson Rocha ^{[1
]}

Anbalagan, Sivasubramanian ^{[2
]}

Marcolino, Leandro Soriano ^{[2
]}

Chaimowicz, Luiz ^{[1
]}

机构：

[1] Univ Fed Minas Gerais, Comp Sci Dept, Belo Horizonte, MG, Brazil

[2] Univ Lancaster, Sch Comp & Commun, Lancaster, England

来源：

PROCEEDINGS OF THE TWENTY-SEVENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE | 2018年

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Large state and action spaces are very challenging to reinforcement learning. However, in many domains there is a set of algorithms available, which estimate the best action given a state. Hence, agents can either directly learn a performance-maximizing mapping from states to actions, or from states to algorithms. We investigate several aspects of this dilemma, showing sufficient conditions for learning over algorithms to outperform over actions for a finite number of training iterations. We present synthetic experiments to further study such systems. Finally, we propose a function approximation approach, demonstrating the effectiveness of learning over algorithms in real-time strategy games.

引用

页码：2717 / 2723

页数：7

共 50 条

[41] Optimization Algorithms for Large-Scale Systems
Azizan N.
Performance Evaluation Review, 2020, 47 (03): : 2 - 5
[42] Weighted mean field reinforcement learning for large-scale UAV swarm confrontation
Baolai Wang
Shengang Li
Xianzhong Gao
Tao Xie
Applied Intelligence, 2023, 53 : 5274 - 5289
[43] Multi-Objective Distributional Reinforcement Learning for Large-Scale Order Dispatching
Zhou, Fan
Lu, Chenfan
Tang, Xiaocheng
Zhang, Fan
Qin, Zhiwei
Ye, Jieping
Zhu, Hongtu
2021 21ST IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM 2021), 2021, : 1541 - 1546
[44] Cooperative Deep Reinforcement Learning for Large-Scale Traffic Grid Signal Control
Tan, Tian
Bao, Feng
Deng, Yue
Jin, Alex
Dai, Qionghai
Wang, Jie
IEEE TRANSACTIONS ON CYBERNETICS, 2020, 50 (06) : 2687 - 2700
[45] DeepRoute on Chameleon: Experimenting with Large-scale Reinforcement Learning and SDN on Chameleon Testbed
Mohammed, Bashir
Kiran, Mariam
Krishnaswamy, Nandini
2019 IEEE 27TH INTERNATIONAL CONFERENCE ON NETWORK PROTOCOLS (IEEE ICNP), 2019,
[46] Dynamic Optimization for Secure MIMO Beamforming using Large-scale Reinforcement Learning
Zhang, Xinran
Sun, Songlin
2019 IEEE WIRELESS COMMUNICATIONS AND NETWORKING CONFERENCE (WCNC), 2019,
[47] Large-scale dynamic surgical scheduling under uncertainty by hierarchical reinforcement learning
Zhao, Lixiang
Zhu, Han
Zhang, Min
Tang, Jiafu
Wang, Yu
INTERNATIONAL JOURNAL OF PRODUCTION RESEARCH, 2024,
[48] Reinforcement Learning Applied to Scrum Team towards Large-Scale Global Optimization
Nootyaskool, Supakit
Ounsrimuang, Pimolrat
PROCEEDINGS OF 2018 14TH IEEE INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING (ICSP), 2018, : 1034 - 1039
[49] Weighted mean field reinforcement learning for large-scale UAV swarm confrontation
Wang, Baolai
Li, Shengang
Gao, Xianzhong
Xie, Tao
APPLIED INTELLIGENCE, 2023, 53 (05) : 5274 - 5289
[50] A multi-swarm optimizer with a reinforcement learning mechanism for large-scale optimization
Wang, Xujie
Wang, Feng
He, Qi
Guo, Yinan
SWARM AND EVOLUTIONARY COMPUTATION, 2024, 86

← 1 2 3 4 5 →