Temporal prediction model with context-aware data augmentation for robust visual reinforcement learning

被引:0
|
作者
Yue, Xinkai [1 ]
Ge, Hongwei [1 ]
He, Xin [1 ]
Hou, Yaqing [1 ]
机构
[1] College of Computer Science and Technology, Dalian University of Technology, Dalian, China
基金
中国国家自然科学基金;
关键词
Benchmarking - Forecasting - Learning systems - Pixels - Robotics;
D O I
10.1007/s00521-024-10251-w
中图分类号
学科分类号
摘要
While reinforcement learning has shown promising abilities to solve continuous control tasks from visual inputs, it remains a challenge to learn robust representations from high-dimensional observations and generalize to unseen environments with distracting elements. Recently, strong data augmentation has been applied to increase the diversity of the training data, but it may damage the task-relevant pixels and thus hinder the optimization of reinforcement learning. To this end, this paper proposes temporal prediction model with context-aware data augmentation (TPMC), a framework which incorporates context-aware strong augmentation into the dynamic model for learning robust policies. Specifically, TPMC utilizes the gradient-based saliency map to identify and preserve task-relevant pixels during strong augmentation, generating reliable augmented images for stable training. Moreover, the temporal prediction consistency between strong and weak augmented views is enforced to construct a contrastive objective for learning shared task-relevant representations. Extensive experiments are conducted to evaluate the performance on DMControl-GB benchmarks and several robotic manipulation tasks. Experimental results demonstrate that TPMC achieves superior data-efficiency and generalization to other state-of-the-art methods. © The Author(s), under exclusive licence to Springer-Verlag London Ltd., part of Springer Nature 2024.
引用
收藏
页码:19337 / 19352
页数:15
相关论文
共 50 条
  • [31] Learning visual relationship and context-aware attention for image captioning
    Wang, Junbo
    Wang, Wei
    Wang, Liang
    Wang, Zhiyong
    Feng, David Dagan
    Tan, Tieniu
    PATTERN RECOGNITION, 2020, 98
  • [32] Learning Context-aware Latent Representations for Context-aware Collaborative Filtering
    Liu, Xin
    Wu, Wei
    SIGIR 2015: PROCEEDINGS OF THE 38TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2015, : 887 - 890
  • [33] Deep Reinforcement Learning in Ice Hockey for Context-Aware Player Evaluation
    Liu, Guiliang
    Schulte, Oliver
    PROCEEDINGS OF THE TWENTY-SEVENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2018, : 3442 - 3448
  • [34] Context-Aware Safe Reinforcement Learning for Non-Stationary Environments
    Chen, Baiming
    Liu, Zuxin
    Zhu, Jiacheng
    Xu, Mengdi
    Ding, Wenhao
    Li, Liang
    Zhao, Ding
    2021 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2021), 2021, : 10689 - 10695
  • [35] Context-Aware Adaptive Route Mutation Scheme: A Reinforcement Learning Approach
    Xu, Changqiao
    Zhang, Tao
    Kuang, Xiaohui
    Zhou, Zan
    Yu, Shui
    IEEE INTERNET OF THINGS JOURNAL, 2021, 8 (17): : 13528 - 13541
  • [36] Dynamics Generalisation in Reinforcement Learning via Adaptive Context-Aware Policies
    Beukman, Michael
    Jarvis, Devon
    Klein, Richard
    James, Steven
    Rosman, Benjamin
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [37] A novel intelligent traffic recovery model for emergency vehicles based on context-aware reinforcement learning
    Kiani, Farzad
    Sarac, Omer Faruk
    INFORMATION SCIENCES, 2023, 619 : 288 - 309
  • [38] A context-aware sensing strategy with deep reinforcement learning for smart healthcare
    Wang, Lili
    Xi, Siyao
    Qian, Yuwen
    Huang, Cheng
    PERVASIVE AND MOBILE COMPUTING, 2022, 83
  • [39] Context-aware pub/sub control method using reinforcement learning
    Kim, Joohyun
    Hong, Seohee
    Hong, Sengphil
    Kim, Jaehoon
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2021, 33 (15):
  • [40] Context-aware Reinforcement Learning for Re-identification in a Video Network
    Thakoor, Ninad
    Bhanu, Bir
    2013 SEVENTH INTERNATIONAL CONFERENCE ON DISTRIBUTED SMART CAMERAS (ICDSC), 2013,