Algorithms or Actions? A Study in Large-Scale Reinforcement Learning

被引：0

作者：

Tavares, Anderson Rocha ^{[1
]}

Anbalagan, Sivasubramanian ^{[2
]}

Marcolino, Leandro Soriano ^{[2
]}

Chaimowicz, Luiz ^{[1
]}

机构：

[1] Univ Fed Minas Gerais, Comp Sci Dept, Belo Horizonte, MG, Brazil

[2] Univ Lancaster, Sch Comp & Commun, Lancaster, England

来源：

PROCEEDINGS OF THE TWENTY-SEVENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE | 2018年

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Large state and action spaces are very challenging to reinforcement learning. However, in many domains there is a set of algorithms available, which estimate the best action given a state. Hence, agents can either directly learn a performance-maximizing mapping from states to actions, or from states to algorithms. We investigate several aspects of this dilemma, showing sufficient conditions for learning over algorithms to outperform over actions for a finite number of training iterations. We present synthetic experiments to further study such systems. Finally, we propose a function approximation approach, demonstrating the effectiveness of learning over algorithms in real-time strategy games.

引用

页码：2717 / 2723

页数：7

共 50 条

[21] Large-scale and adaptive service composition based on deep reinforcement learning
Liu, Jiang-Wen
Hu, Li-Qiang
Cai, Zhao-Quan
Xing, Li-Ning
Tan, Xu
JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2019, 65
[22] Large-Scale Interactive Recommendation With Tree-Structured Reinforcement Learning
Chen, Haokun
Zhu, Chenxu
Tang, Ruiming
Zhang, Weinan
He, Xiuqiang
Yu, Yong
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2023, 35 (04) : 4018 - 4032
[23] Deep Reinforcement Learning for Network Service Recovery in Large-scale Failures
Akashi, Kazuaki
Fukuda, Nobukazu
Kanai, Shunsuke
Tayama, Kenichi
2023 19TH INTERNATIONAL CONFERENCE ON NETWORK AND SERVICE MANAGEMENT, CNSM, 2023,
[24] NondBREM: Nondeterministic Offline Reinforcement Learning for Large-Scale Order Dispatching
Zhang, Hongbo
Wang, Guang
Wang, Xu
Zhou, Zhengyang
Zhang, Chen
Dong, Zheng
Wang, Yang
THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 1, 2024, : 401 - 409
[25] Automatic Hierarchical Reinforcement Learning for Efficient Large-scale Service Composition
Wang, Hongbing
Huang, Guicheng
Yu, Qi
2016 IEEE INTERNATIONAL CONFERENCE ON WEB SERVICES (ICWS), 2016, : 57 - 64
[26] Large-Scale Traffic Grid Signal Control with Regional Reinforcement Learning
Chu, Tianshu
Qu, Shuhui
Wang, Jie
2016 AMERICAN CONTROL CONFERENCE (ACC), 2016, : 815 - 820
[27] Reinforcement Learning for Sustainability: Adapting in large-scale heterogeneous dynamic environments
Dusparic, Ivana
2022 IEEE INTERNATIONAL CONFERENCE ON AUTONOMIC COMPUTING AND SELF-ORGANIZING SYSTEMS COMPANION (ACSOS-C 2022), 2022, : 49 - 50
[28] Reinforcement learning for optimal tracking of large-scale systems with multitime scales
Jinna LI
Hao NIE
Tianyou CHAI
Frank L.LEWIS
Science China(Information Sciences), 2023, 66 (07) : 5 - 29
[29] Deep Reinforcement Learning-Based Large-Scale Robot Exploration
Cao, Yuhong
Zhao, Rui
Wang, Yizhuo
Xiang, Bairan
Sartoretti, Guillaume
IEEE ROBOTICS AND AUTOMATION LETTERS, 2024, 9 (05) : 4631 - 4638
[30] Large-Scale and Adaptive Service Composition Using Deep Reinforcement Learning
Wang, Hongbing
Gu, Mingzhu
Yu, Qi
Fei, Huanhuan
Li, Jiajie
Tao, Yong
SERVICE-ORIENTED COMPUTING, ICSOC 2017, 2017, 10601 : 383 - 391

← 1 2 3 4 5 →