Mix-up Consistent Cross Representations for Data-Efficient Reinforcement Learning

被引:0
|
作者
Liu, Shiyu [1 ]
Cao, Guitao [1 ]
Liu, Yong [1 ]
Li, Yan [1 ]
Wu, Chunwei [1 ]
Xi, Xidong [1 ]
机构
[1] East China Normal Univ, Shanghai Key Lab Trustworthy Comp, MoE Engn Res Ctr SW HW Codesign Technol & Applica, Shanghai 200062, Peoples R China
基金
中国国家自然科学基金;
关键词
mutual information; smoothness; self-supervised learning; reinforcement learning; LEVEL;
D O I
10.1109/IJCNN55064.2022.9892416
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Deep reinforcement learning (RL) has achieved remarkable performance in sequential decision-making problems. However, it is a challenge for deep RL methods to extract task-relevant semantic information when interacting with limited data from the environment. In this paper, we propose Mixup Consistent Cross Representations (MCCR), a novel selfsupervised auxiliary task, which aims to improve data efficiency and encourage representation prediction. Specifically, we calculate the contrastive loss between low-dimensional and high-dimensional representations of different state observations to boost the mutual information between states, thus improving data efficiency. Furthermore, we employ a mixed strategy to generate intermediate samples, increasing data diversity and the smoothness of representations prediction in nearby timesteps. Experimental results show that MCCR achieves competitive results over the state-of-the-art approaches for complex control tasks in DeepMind Control Suite, notably improving the ability of pretrained encoders to generalize to unseen tasks.
引用
收藏
页数:8
相关论文
共 50 条
  • [21] Shielded Planning Guided Data-Efficient and Safe Reinforcement Learning
    Wang, Hao
    Qin, Jiahu
    Kan, Zhen
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, : 1 - 12
  • [22] Unsupervised Salient Patch Selection for Data-Efficient Reinforcement Learning
    Jiang, Zhaohui
    Weng, Paul
    [J]. MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES: RESEARCH TRACK, ECML PKDD 2023, PT IV, 2023, 14172 : 556 - 572
  • [23] Data Based Optimal Control with Neural Networks and Data-Efficient Reinforcement Learning
    Runkler, Thomas A.
    Udluft, Steffen
    Duell, Siegmund
    [J]. AT-AUTOMATISIERUNGSTECHNIK, 2012, 60 (10) : 641 - 647
  • [24] A Data-Efficient Reinforcement Learning Method Based on Local Koopman Operators
    Song, Lixing
    Wang, Junheng
    Xu, Junhong
    [J]. 20TH IEEE INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA 2021), 2021, : 515 - 520
  • [25] Data-Efficient Off-Policy Policy Evaluation for Reinforcement Learning
    Thomas, Philip S.
    Brunskill, Emma
    [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 48, 2016, 48
  • [26] Data-Efficient Hierarchical Reinforcement Learning for Robotic Assembly Control Applications
    Hou, Zhimin
    Fei, Jiajun
    Deng, Yuelin
    Xu, Jing
    [J]. IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS, 2021, 68 (11) : 11565 - 11575
  • [27] DATA-EFFICIENT MODEL-BASED REINFORCEMENT LEARNING FOR ROBOT CONTROL
    Sun, Ming
    Gao, Yue
    Liu, Wei
    Li, Shaoyuan
    [J]. INTERNATIONAL JOURNAL OF ROBOTICS & AUTOMATION, 2021, 36 (04): : 211 - 218
  • [28] Data-efficient model-based reinforcement learning with trajectory discrimination
    Tuo Qu
    Fuqing Duan
    Junge Zhang
    Bo Zhao
    Wenzhen Huang
    [J]. Complex & Intelligent Systems, 2024, 10 : 1927 - 1936
  • [29] Data-efficient model-based reinforcement learning with trajectory discrimination
    Qu, Tuo
    Duan, Fuqing
    Zhang, Junge
    Zhao, Bo
    Huang, Wenzhen
    [J]. COMPLEX & INTELLIGENT SYSTEMS, 2024, 10 (02) : 1927 - 1936