Temporal prediction model with context-aware data augmentation for robust visual reinforcement learning

被引:0
|
作者
Yue, Xinkai [1 ]
Ge, Hongwei [1 ]
He, Xin [1 ]
Hou, Yaqing [1 ]
机构
[1] College of Computer Science and Technology, Dalian University of Technology, Dalian, China
基金
中国国家自然科学基金;
关键词
Benchmarking - Forecasting - Learning systems - Pixels - Robotics;
D O I
10.1007/s00521-024-10251-w
中图分类号
学科分类号
摘要
While reinforcement learning has shown promising abilities to solve continuous control tasks from visual inputs, it remains a challenge to learn robust representations from high-dimensional observations and generalize to unseen environments with distracting elements. Recently, strong data augmentation has been applied to increase the diversity of the training data, but it may damage the task-relevant pixels and thus hinder the optimization of reinforcement learning. To this end, this paper proposes temporal prediction model with context-aware data augmentation (TPMC), a framework which incorporates context-aware strong augmentation into the dynamic model for learning robust policies. Specifically, TPMC utilizes the gradient-based saliency map to identify and preserve task-relevant pixels during strong augmentation, generating reliable augmented images for stable training. Moreover, the temporal prediction consistency between strong and weak augmented views is enforced to construct a contrastive objective for learning shared task-relevant representations. Extensive experiments are conducted to evaluate the performance on DMControl-GB benchmarks and several robotic manipulation tasks. Experimental results demonstrate that TPMC achieves superior data-efficiency and generalization to other state-of-the-art methods. © The Author(s), under exclusive licence to Springer-Verlag London Ltd., part of Springer Nature 2024.
引用
收藏
页码:19337 / 19352
页数:15
相关论文
共 50 条
  • [21] A context-aware hybrid deep learning model for the prediction of tropical cyclone trajectories
    Farmanifard, Sahar
    Alesheikh, Ali Asghar
    Sharif, Mohammad
    EXPERT SYSTEMS WITH APPLICATIONS, 2023, 231
  • [22] Context-Aware Neural Model for Temporal Information Extraction
    Meng, Yuanliang
    Rumshisky, Anna
    PROCEEDINGS OF THE 56TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL), VOL 1, 2018, : 527 - 536
  • [23] Context-Aware Visual Tracking
    Yang, Ming
    Wu, Ying
    Hua, Gang
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2009, 31 (07) : 1195 - 1209
  • [24] Context-aware driver risk prediction with telematics data
    Moosavi, Sobhan
    Ramnath, Rajiv
    ACCIDENT ANALYSIS AND PREVENTION, 2023, 192
  • [25] A graphical model for context-aware visual content recommendation
    Boutemedjet, Sabri
    Ziou, Djemel
    IEEE TRANSACTIONS ON MULTIMEDIA, 2008, 10 (01) : 52 - 62
  • [26] Context-Aware Feature Learning for Noise Robust Person Search
    Zhao, Cairong
    Chen, Zhicheng
    Dou, Shuguang
    Qu, Zefan
    Yao, Jiawei
    Wu, Jun
    Miao, Duoqian
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (10) : 7047 - 7060
  • [27] Context-aware reinforcement learning for cooling operation of data centers with an Aquifer Thermal Energy Storage
    Leindals, Lukas
    Gronning, Peter
    Dominkovic, Dominik Franjo
    Junker, Rune Gronborg
    ENERGY AND AI, 2024, 17
  • [28] Context-Aware Data Augmentation for Efficient Object Detection by UAV Surveillance
    Gordienko, Yuri
    Rokovyi, Oleksandr
    Alienin, Oleg
    Stirenko, Sergii
    2022 10TH INTERNATIONAL SYMPOSIUM ON DIGITAL FORENSICS AND SECURITY (ISDFS), 2022,
  • [29] Context-aware Attention-based Data Augmentation for POI Recommendation
    Li, Yang
    Luo, Yadan
    Zhang, Zheng
    Sadiq, Shazia
    Cui, Peng
    2019 IEEE 35TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING WORKSHOPS (ICDEW 2019), 2019, : 177 - 184
  • [30] Context-Aware Trajectory Prediction
    Bartoli, Federico
    Lisanti, Giuseppe
    Ballan, Lamberto
    Del Bimbo, Alberto
    2018 24TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2018, : 1941 - 1946