Kernel-Based Reinforcement Learning

被引:1
|
作者
Hu, Guanghua [1 ]
Qiu, Yuqin [1 ]
Xiang, Liming
机构
[1] Yunnan Univ, Sch Math & Stat, Kunming 650091, Yunnan, Peoples R China
关键词
D O I
10.1007/11816157_92
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We consider the problem of approximating the cost-to-go functions in reinforcement learning. By mapping the state implicitly into a feature space, we perform a simple algorithm in the feature space, which corresponds to a complex algorithm in the original state space. Two kernel-based reinforcement learning algorithms, the e-insensitive kernel based reinforcement learning (epsilon-KRL) and the least squares kernel based reinforcement learning (LS-KRL) are proposed. An example shows that the proposed methods can deal effectively with the reinforcement learning problem without having to explore many states.
引用
收藏
页码:757 / 766
页数:10
相关论文
共 50 条
  • [1] Kernel-Based Reinforcement Learning
    Dirk Ormoneit
    Śaunak Sen
    [J]. Machine Learning, 2002, 49 : 161 - 178
  • [2] Kernel-based reinforcement learning
    Ormoneit, D
    Sen, S
    [J]. MACHINE LEARNING, 2002, 49 (2-3) : 161 - 178
  • [3] Practical Kernel-Based Reinforcement Learning
    Barreto, Andre M. S.
    Precup, Doina
    Pineau, Joelle
    [J]. JOURNAL OF MACHINE LEARNING RESEARCH, 2016, 17
  • [4] KERNEL-BASED LIFELONG POLICY GRADIENT REINFORCEMENT LEARNING
    Mowakeaa, Rami
    Kim, Seung-Jun
    Emge, Darren K.
    [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 3500 - 3504
  • [5] Kernel-Based Decentralized Policy Evaluation for Reinforcement Learning
    Liu, Jiamin
    Lian, Heng
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024,
  • [6] Kernel-Based Reinforcement Learning: A Finite-Time Analysis
    Domingues, Omar D.
    Menard, Pierre
    Pirotta, Matteo
    Kaufmann, Emilie
    Valko, Michal
    [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139
  • [7] Kernel-Based Reinforcement Learning in Robust Markov Decision Processes
    Lim, Shiau Hong
    Autef, Arnaud
    [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 97, 2019, 97
  • [8] Kernel-based reinforcement learning in average-cost problems
    Ormoneit, D
    Glynn, P
    [J]. IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2002, 47 (10) : 1624 - 1636
  • [9] Kernel-based least squares policy iteration for reinforcement learning
    Xu, Xin
    Hu, Dewen
    Lu, Xicheng
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS, 2007, 18 (04): : 973 - 992
  • [10] The Characteristics of Kernel and Kernel-based Learning
    Tan, Fuxiao
    Han, Dezhi
    [J]. 2019 3RD INTERNATIONAL SYMPOSIUM ON AUTONOMOUS SYSTEMS (ISAS 2019), 2019, : 406 - 411