Algorithms or Actions? A Study in Large-Scale Reinforcement Learning

被引：0

作者：

Tavares, Anderson Rocha ^{[1
]}

Anbalagan, Sivasubramanian ^{[2
]}

Marcolino, Leandro Soriano ^{[2
]}

Chaimowicz, Luiz ^{[1
]}

机构：

[1] Univ Fed Minas Gerais, Comp Sci Dept, Belo Horizonte, MG, Brazil

[2] Univ Lancaster, Sch Comp & Commun, Lancaster, England

来源：

PROCEEDINGS OF THE TWENTY-SEVENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE | 2018年

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Large state and action spaces are very challenging to reinforcement learning. However, in many domains there is a set of algorithms available, which estimate the best action given a state. Hence, agents can either directly learn a performance-maximizing mapping from states to actions, or from states to algorithms. We investigate several aspects of this dilemma, showing sufficient conditions for learning over algorithms to outperform over actions for a finite number of training iterations. We present synthetic experiments to further study such systems. Finally, we propose a function approximation approach, demonstrating the effectiveness of learning over algorithms in real-time strategy games.

引用

页码：2717 / 2723

页数：7

共 50 条

[31] Reinforcement learning for optimal tracking of large-scale systems with multitime scales
Li, Jinna
Nie, Hao
Chai, Tianyou
Lewis, Frank L.
SCIENCE CHINA-INFORMATION SCIENCES, 2023, 66 (07)
[32] Adaptive and large-scale service composition based on deep reinforcement learning
Wang, Hongbing
Gu, Mingzhu
Yu, Qi
Tao, Yong
Li, Jiajie
Fei, Huanhuan
Yan, Jia
Zhao, Wei
Hong, Tianjing
KNOWLEDGE-BASED SYSTEMS, 2019, 180 : 75 - 90
[33] Improved Powered Stochastic Optimization Algorithms for Large-Scale Machine Learning
Yang, Zhuang
JOURNAL OF MACHINE LEARNING RESEARCH, 2023, 24
[34] A Large-scale Study of the Effect of Training Set Characteristics over Learning-to-Rank Algorithms
Kanoulas, Evangelos
Savev, Stefan
Metrikov, Pavel
Pavlu, Virgil
Aslam, Javed
PROCEEDINGS OF THE 34TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR'11), 2011, : 1243 - 1244
[35] Deep Learning Systems: Algorithms, Compilers, and Processors for Large-Scale Production
Rodriguez A.
Synthesis Lectures on Computer Architecture, 2021, 15 (04): : 1 - 265
[36] Large-scale parallel geophysical algorithms in Java: A feasibility study
Univ of Karlsruhe, Karlsruhe, Germany
Concurrency Pract Exper, 11-13 (1143-1153):
[37] Large-scale parallel geophysical algorithms in Java: A feasibility study
Jacob, Matthias
Philippsen, Michael
Karrenbach, Martin
Leading Edge (Tulsa, OK), 1998, 17 (12):
[38] PARAFAC algorithms for large-scale problems
Anh Huy Phan
Cichocki, Andrzej
NEUROCOMPUTING, 2011, 74 (11) : 1970 - 1984
[39] Algorithms for large-scale flat placement
Vygen, J
DESIGN AUTOMATION CONFERENCE - PROCEEDINGS 1997, 1997, : 746 - 751
[40] Algorithms for large-scale genotyping microarrays
Liu, WM
Di, XJ
Yang, G
Matsuzaki, H
Huang, J
Mei, R
Ryder, TB
Webster, TA
Dong, SL
Liu, GY
Jones, KW
Kennedy, GC
Kulp, D
BIOINFORMATICS, 2003, 19 (18) : 2397 - 2403

← 1 2 3 4 5 →