A Large-Scale Study on Unsupervised Spatiotemporal Representation Learning

被引:138
|
作者
Feichtenhofer, Christoph [1 ]
Fan, Haoqi [1 ]
Xiong, Bo [1 ]
Girshick, Ross [1 ]
He, Kaiming [1 ]
机构
[1] Facebook AI Res FAIR, Paris, France
来源
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021 | 2021年
关键词
D O I
10.1109/CVPR46437.2021.00331
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present a large-scale study on unsupervised spatiotemporal representation learning from videos. With a unified perspective on four recent image-based frameworks, we study a simple objective that can easily generalize all these methods to space-time. Our objective encourages temporally-persistent features in the same video, and in spite of its simplicity, it works surprisingly well across: (i) different unsupervised frameworks, (ii) pre-training datasets, (iii) downstream datasets, and (iv) backbone architectures. We draw a series of intriguing observations from this study, e.g., we discover that encouraging long-spanned persistency can be effective even if the timespan is 60 seconds. In addition to state-of-the-art results in multiple benchmarks, we report a few promising cases in which unsupervised pre-training can outperform its supervised counterpart.
引用
收藏
页码:3298 / 3308
页数:11
相关论文
共 50 条
  • [21] A Decomposition Method for Large-Scale Sparse Coding in Representation Learning
    Li, Yifeng
    Caron, Richard J.
    Ngom, Alioune
    PROCEEDINGS OF THE 2014 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2014, : 3732 - 3738
  • [22] Large-scale entity representation learning for biomedical relationship extraction
    Saenger, Mario
    Leser, Ulf
    BIOINFORMATICS, 2021, 37 (02) : 236 - 242
  • [23] DropNaE: Alleviating irregularity for large-scale graph representation learning
    Liu, Xin
    Xiong, Xunbin
    Yan, Mingyu
    Xue, Runzhen
    Pan, Shirui
    Pei, Songwen
    Deng, Lei
    Ye, Xiaochun
    Fan, Dongrui
    NEURAL NETWORKS, 2025, 183
  • [24] Kernel-Based Autoencoders for Large-Scale Representation Learning
    Bao, Jinzhou
    Zhao, Bo
    Guo, Ping
    2021 7TH INTERNATIONAL CONFERENCE ON ROBOTICS AND ARTIFICIAL INTELLIGENCE, ICRAI 2021, 2021, : 112 - 117
  • [25] Neural Binary Representation Learning for Large-Scale Collaborative Filtering
    Zhang, Yujia
    Wu, Jun
    Wang, Haishuai
    IEEE ACCESS, 2019, 7 : 60752 - 60763
  • [26] PartNRL: Partial Nodes Representation Learning in Large-Scale Network
    Li, Juan-Hui
    Huang, Ling
    Wang, Chang-Dong
    Huang, Dong
    Lai, Jian-Huang
    IEEE ACCESS, 2019, 7 : 56457 - 56468
  • [27] Graph Representation Learning for Large-Scale Neuronal Morphological Analysis
    Zhao, Jie
    Chen, Xuejin
    Xiong, Zhiwei
    Zha, Zheng-Jun
    Wu, Feng
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 35 (04) : 5473 - 5487
  • [28] EEG Correlates of Unsupervised Spatial Learning in Immersive, Large-Scale Virtual Environments
    Plank, Markus
    Snider, Joseph
    Kaestner, Erik
    Halgren, Eric
    Poizner, Howard
    2013 6TH INTERNATIONAL IEEE/EMBS CONFERENCE ON NEURAL ENGINEERING (NER), 2013, : 1346 - 1349
  • [29] Meta Clustering Learning for Large-scale Unsupervised Person Re-identification
    Jin, Xin
    He, Tianyu
    Shen, Xu
    Liu, Tongliang
    Wang, Xinchao
    Huang, Jianqiang
    Chen, Zhibo
    Hua, Xian-Sheng
    PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022, : 2163 - 2172
  • [30] Self-supervised contrastive representation learning for large-scale trajectories
    Li, Shuzhe
    Chen, Wei
    Yan, Bingqi
    Li, Zhen
    Zhu, Shunzhi
    Yu, Yanwei
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2023, 148 : 357 - 366