Algorithms or Actions? A Study in Large-Scale Reinforcement Learning

被引:0
|
作者
Tavares, Anderson Rocha [1 ]
Anbalagan, Sivasubramanian [2 ]
Marcolino, Leandro Soriano [2 ]
Chaimowicz, Luiz [1 ]
机构
[1] Univ Fed Minas Gerais, Comp Sci Dept, Belo Horizonte, MG, Brazil
[2] Univ Lancaster, Sch Comp & Commun, Lancaster, England
来源
PROCEEDINGS OF THE TWENTY-SEVENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE | 2018年
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Large state and action spaces are very challenging to reinforcement learning. However, in many domains there is a set of algorithms available, which estimate the best action given a state. Hence, agents can either directly learn a performance-maximizing mapping from states to actions, or from states to algorithms. We investigate several aspects of this dilemma, showing sufficient conditions for learning over algorithms to outperform over actions for a finite number of training iterations. We present synthetic experiments to further study such systems. Finally, we propose a function approximation approach, demonstrating the effectiveness of learning over algorithms in real-time strategy games.
引用
收藏
页码:2717 / 2723
页数:7
相关论文
共 50 条
  • [21] Large-scale and adaptive service composition based on deep reinforcement learning
    Liu, Jiang-Wen
    Hu, Li-Qiang
    Cai, Zhao-Quan
    Xing, Li-Ning
    Tan, Xu
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2019, 65
  • [22] Large-Scale Interactive Recommendation With Tree-Structured Reinforcement Learning
    Chen, Haokun
    Zhu, Chenxu
    Tang, Ruiming
    Zhang, Weinan
    He, Xiuqiang
    Yu, Yong
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2023, 35 (04) : 4018 - 4032
  • [23] Deep Reinforcement Learning for Network Service Recovery in Large-scale Failures
    Akashi, Kazuaki
    Fukuda, Nobukazu
    Kanai, Shunsuke
    Tayama, Kenichi
    2023 19TH INTERNATIONAL CONFERENCE ON NETWORK AND SERVICE MANAGEMENT, CNSM, 2023,
  • [24] NondBREM: Nondeterministic Offline Reinforcement Learning for Large-Scale Order Dispatching
    Zhang, Hongbo
    Wang, Guang
    Wang, Xu
    Zhou, Zhengyang
    Zhang, Chen
    Dong, Zheng
    Wang, Yang
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 1, 2024, : 401 - 409
  • [25] Automatic Hierarchical Reinforcement Learning for Efficient Large-scale Service Composition
    Wang, Hongbing
    Huang, Guicheng
    Yu, Qi
    2016 IEEE INTERNATIONAL CONFERENCE ON WEB SERVICES (ICWS), 2016, : 57 - 64
  • [26] Large-Scale Traffic Grid Signal Control with Regional Reinforcement Learning
    Chu, Tianshu
    Qu, Shuhui
    Wang, Jie
    2016 AMERICAN CONTROL CONFERENCE (ACC), 2016, : 815 - 820
  • [27] Reinforcement Learning for Sustainability: Adapting in large-scale heterogeneous dynamic environments
    Dusparic, Ivana
    2022 IEEE INTERNATIONAL CONFERENCE ON AUTONOMIC COMPUTING AND SELF-ORGANIZING SYSTEMS COMPANION (ACSOS-C 2022), 2022, : 49 - 50
  • [28] Reinforcement learning for optimal tracking of large-scale systems with multitime scales
    Jinna LI
    Hao NIE
    Tianyou CHAI
    Frank L.LEWIS
    Science China(Information Sciences), 2023, 66 (07) : 5 - 29
  • [29] Deep Reinforcement Learning-Based Large-Scale Robot Exploration
    Cao, Yuhong
    Zhao, Rui
    Wang, Yizhuo
    Xiang, Bairan
    Sartoretti, Guillaume
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2024, 9 (05) : 4631 - 4638
  • [30] Large-Scale and Adaptive Service Composition Using Deep Reinforcement Learning
    Wang, Hongbing
    Gu, Mingzhu
    Yu, Qi
    Fei, Huanhuan
    Li, Jiajie
    Tao, Yong
    SERVICE-ORIENTED COMPUTING, ICSOC 2017, 2017, 10601 : 383 - 391