Bayesian Reinforcement Learning with Exploration

被引:0
|
作者
Lattimore, Tor [1 ]
Hutter, Marcus [2 ]
机构
[1] Univ Alberta, Edmonton, AB T6G 2M7, Canada
[2] Australian Natl Univ, Canberra, ACT 0200, Australia
来源
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We consider a general reinforcement learning problem and show that carefully combining the Bayesian optimal policy and an exploring policy leads to minimax sample-complexity bounds in a very general class of (history-based) environments. We also prove lower bounds and show that the new algorithm displays adaptive behaviour when the environment is easier than worst-case.
引用
收藏
页码:170 / 184
页数:15
相关论文
共 50 条
  • [31] Distributional Reinforcement Learning for Efficient Exploration
    Mavrin, Borislav
    Yao, Hengshuai
    Kong, Linglong
    Wu, Kaiwen
    Yu, Yaoliang
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 97, 2019, 97
  • [32] Adaptive Exploration Strategies for Reinforcement Learning
    Hwang, Kao-Shing
    Li, Chih-Wen
    Jiang, Wei-Cheng
    2017 INTERNATIONAL CONFERENCE ON SYSTEM SCIENCE AND ENGINEERING (ICSSE), 2017, : 16 - 19
  • [33] Uncertainty Quantification and Exploration for Reinforcement Learning
    Zhu, Yi
    Dong, Jing
    Lam, Henry
    OPERATIONS RESEARCH, 2024, 72 (04) : 1689 - 1709
  • [34] Coordinated Exploration in Concurrent Reinforcement Learning
    Dimakopoulou, Maria
    Van Roy, Benjamin
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 80, 2018, 80
  • [35] Overcoming Exploration in Reinforcement Learning with Demonstrations
    Nair, Ashvin
    McGrew, Bob
    Andrychowicz, Marcin
    Zaremba, Wojciech
    Abbeel, Pieter
    2018 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2018, : 6292 - 6299
  • [36] Improving Reinforcement Learning Exploration by Autoencoders
    Paczolay, Gabor
    Harmati, Istvan
    Periodica Polytechnica Electrical Engineering and Computer Science, 2024, 68 (04): : 335 - 343
  • [37] Exploration Conscious Reinforcement Learning Revisited
    Shani, Lior
    Efroni, Yonathan
    Mannor, Shie
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 97, 2019, 97
  • [38] Active Exploration for Inverse Reinforcement Learning
    Lindner, David
    Krause, Andreas
    Ramponi, Giorgia
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35, NEURIPS 2022, 2022,
  • [39] Adaptive Exploration for Continual Reinforcement Learning
    Stulp, Freek
    2012 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2012, : 1631 - 1636
  • [40] Bayesian Reinforcement Learning and Bayesian Deep Learning for Blockchains With Mobile Edge Computing
    Asheralieva, Alia
    Niyato, Dusit
    IEEE TRANSACTIONS ON COGNITIVE COMMUNICATIONS AND NETWORKING, 2021, 7 (01) : 319 - 335