Variational Skill Embeddings for Meta Reinforcement Learning

被引:2
|
作者
Chien, Jen-Tzung [1 ]
Lai, Weiwei [1 ]
机构
[1] Natl Yang Ming Chiao Tung Univ, Inst Elect & Comp Engn, Hsinchu, Taiwan
关键词
D O I
10.1109/IJCNN54540.2023.10191425
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Meta reinforcement learning (meta-RL) aims to learn useful prior knowledge across tasks which can be generalized to unseen but similar tasks with only a small number of adaptation steps. Traditionally, the gradient-based metal RL was proposed to use the gradients to learn the parameters of an adaptive policy from different tasks which likely lacked sample efficiency. Recently, the context-based meta-RL improved the efficiency by learning the embeddings of the trajectories based on context representation. The learned policy can be adapted to new tasks, but the performance is bounded due to a simple context encoder. To deal with this insufficiency, this paper presents a novel regularized meta-RL where the generalization of policy is enhanced through a context-based meta-RL where the conditional variational autoencoder consisting of a context-skill encoder and a soft-actor-critic decoder is implemented. The proposed method pursues the model regularization by discovering the shared skill patterns across tasks in implementation of context-based meta-RL. The experiments on a number of benchmark tasks show the merit of variational skill embeddings for regularized meta-RL.
引用
收藏
页数:8
相关论文
共 50 条
  • [1] Variational meta reinforcement learning for social robotics
    Ballou, Anand
    Alameda-Pineda, Xavier
    Reinke, Chris
    [J]. APPLIED INTELLIGENCE, 2023, 53 (22) : 27249 - 27268
  • [2] Variational meta reinforcement learning for social robotics
    Anand Ballou
    Xavier Alameda-Pineda
    Chris Reinke
    [J]. Applied Intelligence, 2023, 53 : 27249 - 27268
  • [3] Variational Information Bottleneck Regularized Deep Reinforcement Learning for Efficient Robotic Skill Adaptation
    Xiang, Guofei
    Dian, Songyi
    Du, Shaofeng
    Lv, Zhonghui
    [J]. SENSORS, 2023, 23 (02)
  • [4] Skill Learning with Empowerment in Reinforcement Learning
    Moscow Institute of Physics and Technology , Moscow oblast, Dolgoprudnyi
    141701, Russia
    不详
    105064, Russia
    不详
    119333, Russia
    [J]. Pattern Recogn. Image Anal., 2024, 3 (535-542):
  • [5] Skill combination for reinforcement learning
    Luo, Zhihui
    Bell, David
    McCollum, Barry
    [J]. INTELLIGENT DATA ENGINEERING AND AUTOMATED LEARNING - IDEAL 2007, 2007, 4881 : 87 - 96
  • [6] A framework for the robot skill learning using reinforcement learning
    Wei, YZ
    Zhao, MY
    [J]. FIFTH INTERNATIONAL SYMPOSIUM ON INSTRUMENTATION AND CONTROL TECHNOLOGY, 2003, 5253 : 910 - 914
  • [7] Improving Textual Network Learning with Variational Homophilic Embeddings
    Wang, Wenlin
    Tao, Chenyang
    Gan, Zhe
    Wang, Guoyin
    Chen, Liqun
    Zhang, Xinyuan
    Zhang, Ruiyi
    Yang, Qian
    Henao, Ricardo
    Carin, Lawrence
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
  • [8] Variational Regret Bounds for Reinforcement Learning
    Ortner, Ronald
    Gajane, Pratik
    Auer, Peter
    [J]. 35TH UNCERTAINTY IN ARTIFICIAL INTELLIGENCE CONFERENCE (UAI 2019), 2020, 115 : 81 - 90
  • [9] Deep Variational Reinforcement Learning for POMDPs
    Igl, Maximilian
    Zintgraf, Luisa
    Le, Tuan Anh
    Wood, Frank
    Whiteson, Shimon
    [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 80, 2018, 80
  • [10] Learning Word Meta-Embeddings
    Yin, Wenpeng
    Schuetze, Hinrich
    [J]. PROCEEDINGS OF THE 54TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 1, 2016, : 1351 - 1360