Algorithms or Actions? A Study in Large-Scale Reinforcement Learning

被引:0
|
作者
Tavares, Anderson Rocha [1 ]
Anbalagan, Sivasubramanian [2 ]
Marcolino, Leandro Soriano [2 ]
Chaimowicz, Luiz [1 ]
机构
[1] Univ Fed Minas Gerais, Comp Sci Dept, Belo Horizonte, MG, Brazil
[2] Univ Lancaster, Sch Comp & Commun, Lancaster, England
来源
PROCEEDINGS OF THE TWENTY-SEVENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE | 2018年
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Large state and action spaces are very challenging to reinforcement learning. However, in many domains there is a set of algorithms available, which estimate the best action given a state. Hence, agents can either directly learn a performance-maximizing mapping from states to actions, or from states to algorithms. We investigate several aspects of this dilemma, showing sufficient conditions for learning over algorithms to outperform over actions for a finite number of training iterations. We present synthetic experiments to further study such systems. Finally, we propose a function approximation approach, demonstrating the effectiveness of learning over algorithms in real-time strategy games.
引用
收藏
页码:2717 / 2723
页数:7
相关论文
共 50 条
  • [31] Reinforcement learning for optimal tracking of large-scale systems with multitime scales
    Li, Jinna
    Nie, Hao
    Chai, Tianyou
    Lewis, Frank L.
    SCIENCE CHINA-INFORMATION SCIENCES, 2023, 66 (07)
  • [32] Adaptive and large-scale service composition based on deep reinforcement learning
    Wang, Hongbing
    Gu, Mingzhu
    Yu, Qi
    Tao, Yong
    Li, Jiajie
    Fei, Huanhuan
    Yan, Jia
    Zhao, Wei
    Hong, Tianjing
    KNOWLEDGE-BASED SYSTEMS, 2019, 180 : 75 - 90
  • [33] Improved Powered Stochastic Optimization Algorithms for Large-Scale Machine Learning
    Yang, Zhuang
    JOURNAL OF MACHINE LEARNING RESEARCH, 2023, 24
  • [34] A Large-scale Study of the Effect of Training Set Characteristics over Learning-to-Rank Algorithms
    Kanoulas, Evangelos
    Savev, Stefan
    Metrikov, Pavel
    Pavlu, Virgil
    Aslam, Javed
    PROCEEDINGS OF THE 34TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR'11), 2011, : 1243 - 1244
  • [35] Deep Learning Systems: Algorithms, Compilers, and Processors for Large-Scale Production
    Rodriguez A.
    Synthesis Lectures on Computer Architecture, 2021, 15 (04): : 1 - 265
  • [36] Large-scale parallel geophysical algorithms in Java: A feasibility study
    Univ of Karlsruhe, Karlsruhe, Germany
    Concurrency Pract Exper, 11-13 (1143-1153):
  • [37] Large-scale parallel geophysical algorithms in Java: A feasibility study
    Jacob, Matthias
    Philippsen, Michael
    Karrenbach, Martin
    Leading Edge (Tulsa, OK), 1998, 17 (12):
  • [38] PARAFAC algorithms for large-scale problems
    Anh Huy Phan
    Cichocki, Andrzej
    NEUROCOMPUTING, 2011, 74 (11) : 1970 - 1984
  • [39] Algorithms for large-scale flat placement
    Vygen, J
    DESIGN AUTOMATION CONFERENCE - PROCEEDINGS 1997, 1997, : 746 - 751
  • [40] Algorithms for large-scale genotyping microarrays
    Liu, WM
    Di, XJ
    Yang, G
    Matsuzaki, H
    Huang, J
    Mei, R
    Ryder, TB
    Webster, TA
    Dong, SL
    Liu, GY
    Jones, KW
    Kennedy, GC
    Kulp, D
    BIOINFORMATICS, 2003, 19 (18) : 2397 - 2403