Bayesian Reinforcement Learning with Exploration

被引：0

作者：

Lattimore, Tor ^{[1
]}

Hutter, Marcus ^{[2
]}

机构：

[1] Univ Alberta, Edmonton, AB T6G 2M7, Canada

[2] Australian Natl Univ, Canberra, ACT 0200, Australia

来源：

ALGORITHMIC LEARNING THEORY (ALT 2014) | 2014年 / 8776卷

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We consider a general reinforcement learning problem and show that carefully combining the Bayesian optimal policy and an exploring policy leads to minimax sample-complexity bounds in a very general class of (history-based) environments. We also prove lower bounds and show that the new algorithm displays adaptive behaviour when the environment is easier than worst-case.

引用

页码：170 / 184

页数：15

共 50 条

[31] Distributional Reinforcement Learning for Efficient Exploration
Mavrin, Borislav
Yao, Hengshuai
Kong, Linglong
Wu, Kaiwen
Yu, Yaoliang
INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 97, 2019, 97
[32] Adaptive Exploration Strategies for Reinforcement Learning
Hwang, Kao-Shing
Li, Chih-Wen
Jiang, Wei-Cheng
2017 INTERNATIONAL CONFERENCE ON SYSTEM SCIENCE AND ENGINEERING (ICSSE), 2017, : 16 - 19
[33] Uncertainty Quantification and Exploration for Reinforcement Learning
Zhu, Yi
Dong, Jing
Lam, Henry
OPERATIONS RESEARCH, 2024, 72 (04) : 1689 - 1709
[34] Coordinated Exploration in Concurrent Reinforcement Learning
Dimakopoulou, Maria
Van Roy, Benjamin
INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 80, 2018, 80
[35] Overcoming Exploration in Reinforcement Learning with Demonstrations
Nair, Ashvin
McGrew, Bob
Andrychowicz, Marcin
Zaremba, Wojciech
Abbeel, Pieter
2018 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2018, : 6292 - 6299
[36] Improving Reinforcement Learning Exploration by Autoencoders
Paczolay, Gabor
Harmati, Istvan
Periodica Polytechnica Electrical Engineering and Computer Science, 2024, 68 (04): : 335 - 343
[37] Exploration Conscious Reinforcement Learning Revisited
Shani, Lior
Efroni, Yonathan
Mannor, Shie
INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 97, 2019, 97
[38] Active Exploration for Inverse Reinforcement Learning
Lindner, David
Krause, Andreas
Ramponi, Giorgia
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35, NEURIPS 2022, 2022,
[39] Adaptive Exploration for Continual Reinforcement Learning
Stulp, Freek
2012 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2012, : 1631 - 1636
[40] Bayesian Reinforcement Learning and Bayesian Deep Learning for Blockchains With Mobile Edge Computing
Asheralieva, Alia
Niyato, Dusit
IEEE TRANSACTIONS ON COGNITIVE COMMUNICATIONS AND NETWORKING, 2021, 7 (01) : 319 - 335

← 1 2 3 4 5 →