KERNEL-BASED LIFELONG POLICY GRADIENT REINFORCEMENT LEARNING

被引:1
|
作者
Mowakeaa, Rami [1 ]
Kim, Seung-Jun [1 ]
Emge, Darren K. [2 ]
机构
[1] Univ Maryland Baltimore Cty, Dept Comp Sci & Elect Engn, Baltimore, MD 21250 USA
[2] Chem Biol Ctr RDCB DRC P, Combat Capabil Dev Command, Gunpowder, MD USA
基金
美国国家科学基金会;
关键词
Reinforcement learning; lifelong learning; kernel methods; policy gradients; dictionary learning;
D O I
10.1109/ICASSP39728.2021.9414511
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Policy gradient methods have been widely used in reinforcement learning (RL), especially thanks to their facility to handle continuous state spaces, strong convergence guarantees, and low-complexity updates. Training of the methods for individual tasks, however, can still be taxing in terms of the learning speed and the sample trajectory collection. Lifelong learning aims to exploit the intrinsic structure shared among a suite of RL tasks, akin to multitask learning, but in an efficient online fashion. In this work, we propose a lifelong RL algorithm based on the kernel method to leverage nonlinear features of the data based on a popular union-of-subspace model. Experimental results on a set of simple related tasks verify the advantage of the proposed strategy, compared to the single-task and the parametric counterparts.
引用
收藏
页码:3500 / 3504
页数:5
相关论文
共 50 条
  • [1] Kernel-Based Decentralized Policy Evaluation for Reinforcement Learning
    Liu, Jiamin
    Lian, Heng
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024,
  • [2] Kernel-Based Reinforcement Learning
    Hu, Guanghua
    Qiu, Yuqin
    Xiang, Liming
    [J]. INTELLIGENT COMPUTING, PART I: INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING, ICIC 2006, PART I, 2006, 4113 : 757 - 766
  • [3] Kernel-Based Reinforcement Learning
    Dirk Ormoneit
    Śaunak Sen
    [J]. Machine Learning, 2002, 49 : 161 - 178
  • [4] Kernel-based reinforcement learning
    Ormoneit, D
    Sen, S
    [J]. MACHINE LEARNING, 2002, 49 (2-3) : 161 - 178
  • [5] Kernel-based least squares policy iteration for reinforcement learning
    Xu, Xin
    Hu, Dewen
    Lu, Xicheng
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS, 2007, 18 (04): : 973 - 992
  • [6] KERNEL-BASED EFFICIENT LIFELONG LEARNING ALGORITHM
    Kim, Seung-Jun
    Mowakeaa, Rami
    [J]. 2019 IEEE DATA SCIENCE WORKSHOP (DSW), 2019, : 175 - 179
  • [7] Practical Kernel-Based Reinforcement Learning
    Barreto, Andre M. S.
    Precup, Doina
    Pineau, Joelle
    [J]. JOURNAL OF MACHINE LEARNING RESEARCH, 2016, 17
  • [8] Kernel-based direct policy search reinforcement learning based on variational Bayesian inference
    Yamaguchi, Nobuhiko
    Fukuda, Osamu
    Okumura, Hiroshi
    [J]. 2019 SEVENTH INTERNATIONAL SYMPOSIUM ON COMPUTING AND NETWORKING WORKSHOPS (CANDARW 2019), 2019, : 184 - 187
  • [9] Kernel-Based Reinforcement Learning: A Finite-Time Analysis
    Domingues, Omar D.
    Menard, Pierre
    Pirotta, Matteo
    Kaufmann, Emilie
    Valko, Michal
    [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139
  • [10] Kernel-Based Reinforcement Learning in Robust Markov Decision Processes
    Lim, Shiau Hong
    Autef, Arnaud
    [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 97, 2019, 97