Mix-up Consistent Cross Representations for Data-Efficient Reinforcement Learning

被引:0
|
作者
Liu, Shiyu [1 ]
Cao, Guitao [1 ]
Liu, Yong [1 ]
Li, Yan [1 ]
Wu, Chunwei [1 ]
Xi, Xidong [1 ]
机构
[1] East China Normal Univ, Shanghai Key Lab Trustworthy Comp, MoE Engn Res Ctr SW HW Codesign Technol & Applica, Shanghai 200062, Peoples R China
基金
中国国家自然科学基金;
关键词
mutual information; smoothness; self-supervised learning; reinforcement learning; LEVEL;
D O I
10.1109/IJCNN55064.2022.9892416
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Deep reinforcement learning (RL) has achieved remarkable performance in sequential decision-making problems. However, it is a challenge for deep RL methods to extract task-relevant semantic information when interacting with limited data from the environment. In this paper, we propose Mixup Consistent Cross Representations (MCCR), a novel selfsupervised auxiliary task, which aims to improve data efficiency and encourage representation prediction. Specifically, we calculate the contrastive loss between low-dimensional and high-dimensional representations of different state observations to boost the mutual information between states, thus improving data efficiency. Furthermore, we employ a mixed strategy to generate intermediate samples, increasing data diversity and the smoothness of representations prediction in nearby timesteps. Experimental results show that MCCR achieves competitive results over the state-of-the-art approaches for complex control tasks in DeepMind Control Suite, notably improving the ability of pretrained encoders to generalize to unseen tasks.
引用
收藏
页数:8
相关论文
共 50 条
  • [41] Load Balancing for Communication Networks via Data-Efficient Deep Reinforcement Learning
    Wu, Di
    Kang, Jikun
    Xu, Yi Tian
    Li, Hang
    Li, Jimmy
    Chen, Xi
    Rivkin, Dmitriy
    Jenkin, Michael
    Lee, Taeseop
    Park, Intaik
    Liu, Xue
    Dudek, Gregory
    [J]. 2021 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM), 2021,
  • [42] A Mix-up Strategy to Enhance Adversarial Training with Imbalanced Data
    Wang, Wentao
    Shomer, Harry
    Wan, Yuxuan
    Li, Yaxin
    Huang, Jiangtao
    Liu, Hui
    [J]. PROCEEDINGS OF THE 32ND ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, CIKM 2023, 2023, : 2637 - 2645
  • [43] Dynamic video mix-up for cross-domain action recognition
    Wu, Han
    Song, Chunfeng
    Yue, Shaolong
    Wang, Zhenyu
    Xiao, Jun
    Liu, Yanyang
    [J]. NEUROCOMPUTING, 2022, 471 : 358 - 368
  • [44] Uniform Priors for Data-Efficient Learning
    Sinha, Samarth
    Roth, Karsten
    Goyal, Anirudh
    Ghassemi, Marzyeh
    Akata, Zeynep
    Larochelle, Hugo
    Garg, Animesh
    [J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2022, 2022, : 4026 - 4037
  • [45] PerSim: Data-efficient Offline Reinforcement Learning with Heterogeneous Agents via Personalized Simulators
    Agarwal, Anish
    Alomar, Abdullah
    Alumootil, Varkey
    Shah, Devavrat
    Shen, Dennis
    Xu, Zhi
    Yang, Cindy
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [46] Data-Efficient Reinforcement Learning in Continuous State-Action Gaussian-POMDPs
    McAllister, Rowan Thomas
    Rasmussen, Carl Edward
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 30 (NIPS 2017), 2017, 30
  • [47] DATA-EFFICIENT DEEP REINFORCEMENT LEARNING WITH CONVOLUTION-BASED STATE ENCODER NETWORKS
    Fang, Qiang
    Xu, Xin
    Lan, Yixin
    Zhang, Yichuan
    Zeng, Yujun
    Tang, Tao
    [J]. INTERNATIONAL JOURNAL OF ROBOTICS & AUTOMATION, 2021, 36
  • [48] Deep reinforcement learning for data-efficient weakly supervised business process anomaly detection
    Elaziz, Eman Abd
    Fathalla, Radwa
    Shaheen, Mohamed
    [J]. JOURNAL OF BIG DATA, 2023, 10 (01)
  • [49] Deep reinforcement learning for data-efficient weakly supervised business process anomaly detection
    Eman Abd Elaziz
    Radwa Fathalla
    Mohamed Shaheen
    [J]. Journal of Big Data, 10
  • [50] Explainability-Based Mix-Up Approach for Text Data Augmentation
    Kwon, Soonki
    Lee, Younghoon
    [J]. ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA, 2023, 17 (01)