Low-Rank Representation of Reinforcement Learning Policies

被引:0
|
作者
Mazoure, Bogdan [1 ]
Doan, Thang [1 ]
Li, Tianyu [1 ]
Makarenkov, Vladimir [2 ]
Pineau, Joelle [3 ]
Precup, Doina [3 ]
Rabuseau, Guillaume [4 ]
机构
[1] McGill Univ, Sch Comp Sci, Montreal, PQ, Canada
[2] Univ Quebec Montreal, Dept Informat, Montreal, PQ, Canada
[3] McGill Univ, Sch Comp Sci, Montreal, PQ, Canada
[4] Univ Montreal, Dept Comp Sci & Operat Res Mila, CIFAR AI Chair, Montreal, PQ, Canada
关键词
HILBERT-SPACE; ALGORITHMS; SAFE;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We propose a general framework for policy representation for reinforcement learning tasks. This framework involves finding a low-dimensional embedding of the policy on a reproducing kernel Hilbert space (RKHS). The usage of RKHS based methods allows us to derive strong theoretical guarantees on the expected return of the reconstructed policy. Such guarantees are typically lacking in black-box models, but are very desirable in tasks requiring stability and convergence guarantees. We conduct several experiments on classic RL domains. The results confirm that the policies can be robustly represented in a low-dimensional space while the embedded policy incurs almost no decrease in returns.
引用
收藏
页码:597 / 636
页数:40
相关论文
共 50 条
  • [1] Learning Low-Rank Representation for Matrix Completion
    Kwon, Minsu
    Choi, Ho-Jin
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON BIG DATA AND SMART COMPUTING (BIGCOMP 2020), 2020, : 161 - 164
  • [2] Subspace Learning Based Low-Rank Representation
    Tang, Kewei
    Liu, Xiaodong
    Su, Zhixun
    Jiang, Wei
    Dong, Jiangxin
    [J]. COMPUTER VISION - ACCV 2016, PT I, 2017, 10111 : 416 - 431
  • [3] Learning Discriminative Low-rank Representation for Image Classification
    Li, Jun
    Chang, Heyou
    Yang, Jian
    [J]. PROCEEDINGS OF THE 2014 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2014, : 504 - 509
  • [4] Integrating feature and graph learning with low-rank representation
    Peng, Chong
    Kang, Zhao
    Cheng, Qiang
    [J]. NEUROCOMPUTING, 2017, 249 : 106 - 116
  • [5] Low-rank and sparse matrices fitting algorithm for low-rank representation
    Zhao, Jianxi
    Zhao, Lina
    [J]. COMPUTERS & MATHEMATICS WITH APPLICATIONS, 2020, 79 (02) : 407 - 425
  • [6] Agnostic Reinforcement Learning with Low-Rank MDPs and Rich Observations
    Dann, Christoph
    Mansour, Yishay
    Mohri, Mehryar
    Sekhari, Ayush
    Sridharan, Karthik
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [7] Canonical Correlation Analysis With Low-Rank Learning for Image Representation
    Lu, Yuwu
    Wang, Wenjing
    Zeng, Biqing
    Lai, Zhihui
    Shen, Linlin
    Li, Xuelong
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 31 : 7048 - 7062
  • [8] Laplacian regularized low-rank sparse representation transfer learning
    Lin Guo
    Qun Dai
    [J]. International Journal of Machine Learning and Cybernetics, 2021, 12 : 807 - 821
  • [9] Learning Structured Low-Rank Representation via Matrix Factorization
    Shen, Jie
    Li, Ping
    [J]. ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 51, 2016, 51 : 500 - 509
  • [10] Structured low-rank representation learning for hyperspectral sparse unmixing
    Zhang, Jian
    Dong, Hongsong
    Gao, Wenlian
    Zhang, Li
    Xue, Zhiwen
    Shen, Xiangfei
    [J]. INTERNATIONAL JOURNAL OF REMOTE SENSING, 2024, 45 (02) : 351 - 375