CST-RL: Contrastive Spatio-Temporal Representations for Reinforcement Learning

被引:0
|
作者
Ho, Chi-Kai [1 ]
King, Chung-Ta [1 ]
机构
[1] Natl Tsing Hua Univ, Dept Comp Sci, Hsinchu 300044, Taiwan
关键词
Spatiotemporal phenomena; Task analysis; Correlation; Three-dimensional displays; Representation learning; Reinforcement learning; Feature extraction; 3D CNNs; contrastive learning; spatio-temporal representation learning; sample efficiency;
D O I
10.1109/ACCESS.2023.3258540
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Learning representations from high-dimensional observations is critical for training of pixel-based continuous control tasks with reinforcement learning (RL). Without proper representations, the training will be very inefficient, requiring long training time and huge training data to learn directly from low-level pixel observations. Yet, a lot of information in such observations may be redundant or irrelevant. A common approach to solving this problem is to train auxiliary objectives alongside the main RL objective. The additional objectives provide more signals to the model and reduce the training time, resulting in better sample efficiency. A representative work is Contrastive Unsupervised Representations for Reinforcement Learning (CURL), which leverages contrastive learning to assist RL to learn useful representations. Although CURL performs very well in extracting spatial information from pixel inputs, it is found to overlook potential temporal signals. In this paper, a contrastive spatio-temporal representation learning framework for RL, called CST-RL, is introduced, which leverages 3D Convolutional Neural Network (3D CNN) alongside contrastive learning for sample-efficient RL. It pays attention to both spatial and temporal signals in pixel observations. Experiments based on DMControl show that CST-RL outperforms CURL in all six environments after 500K environment steps and only needs half of the steps to achieve the standard score in the majority of cases.
引用
收藏
页码:26820 / 26831
页数:12
相关论文
共 50 条
  • [1] STACoRe: Spatio-temporal and action-based contrastive representations for reinforcement learning in Atari
    Lee, Young Jae
    Kim, Jaehoon
    Kwak, Mingu
    Park, Young Joon
    Kim, Seoung Bum
    [J]. NEURAL NETWORKS, 2023, 160 : 1 - 11
  • [2] Spatio-Temporal Meta Contrastive Learning
    Tang, Jiabin
    Xia, Lianghao
    Hu, Jie
    Huang, Chao
    [J]. PROCEEDINGS OF THE 32ND ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, CIKM 2023, 2023, : 2412 - 2421
  • [3] Dual Contrastive Learning for Spatio-temporal Representation
    Ding, Shuangrui
    Qian, Rui
    Xiong, Hongkai
    [J]. PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022, : 5649 - 5658
  • [4] Learning Representations by Contrastive Spatio-Temporal Clustering for Skeleton-Based Action Recognition
    Wang, Mingdao
    Li, Xueming
    Chen, Siqi
    Zhang, Xianlin
    Ma, Lei
    Zhang, Yue
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 3207 - 3220
  • [5] LEARNING SPATIO-TEMPORAL REPRESENTATIONS WITH TEMPORAL SQUEEZE POOLING
    Huang, Guoxi
    Bors, Adrian G.
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 2103 - 2107
  • [6] Spatio-temporal fusion and contrastive learning for urban flow prediction
    Zhang, Xu
    Gong, Yongshun
    Zhang, Chengqi
    Wu, Xiaoming
    Guo, Ying
    Lu, Wenpeng
    Zhao, Long
    Dong, Xiangjun
    [J]. KNOWLEDGE-BASED SYSTEMS, 2023, 282
  • [7] Spatio-Temporal Enhanced Contrastive and Contextual Learning for Weather Forecasting
    Gong, Yongshun
    He, Tiantian
    Chen, Meng
    Wang, Bin
    Nie, Liqiang
    Yin, Yilong
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2024, 36 (08) : 4260 - 4274
  • [8] Contextualized Spatio-Temporal Contrastive Learning with Self-Supervision
    Yuan, Liangzhe
    Qian, Rui
    Cui, Yin
    Gong, Boqing
    Schroff, Florian
    Yang, Ming-Hsuan
    Adam, Hartwig
    Liu, Ting
    [J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 13957 - 13966
  • [9] Quantum Reinforcement Learning for Spatio-Temporal Prioritization in Metaverse
    Park, Soohyun
    Baek, Hankyul
    Kim, Joongheon
    [J]. IEEE ACCESS, 2024, 12 : 54732 - 54744
  • [10] Estimating spatio-temporal fields through reinforcement learning
    Padrao, Paulo
    Fuentes, Jose
    Bobadilla, Leonardo
    Smith, Ryan N.
    [J]. FRONTIERS IN ROBOTICS AND AI, 2022, 9