Is an Object-Centric Video Representation Beneficial for Transfer?

被引:0
|
作者
Zhang, Chuhan [1 ]
Gupta, Ankush [2 ]
Zisserman, Andrew [1 ]
机构
[1] Univ Oxford, Dept Engn Sci, Visual Geometry Grp, Oxford, England
[2] DeepMind, London, England
来源
COMPUTER VISION - ACCV 2022, PT IV | 2023年 / 13844卷
基金
英国工程与自然科学研究理事会;
关键词
Video action recognition; Object centric representations; Transfer learning;
D O I
10.1007/978-3-031-26316-3_23
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The objective of this work is to learn an object-centric video representation, with the aim of improving transferability to novel tasks, i.e., tasks different from the pre-training task of action classification. To this end, we introduce a new object-centric video recognition model based on a transformer architecture. The model learns a set of object-centric summary vectors for the video, and uses these vectors to fuse the visual and spatio-temporal trajectory 'modalities' of the video clip. We also introduce a novel trajectory contrast loss to further enhance objectness in these summary vectors. With experiments on four datasets-SomethingSomething-V2, Something-Else, Action Genome and EpicKitchens-we show that the object-centric model outperforms prior video representations (both object-agnostic and object-aware), when: (1) classifying actions on unseen objects and unseen environments; (2) low-shot learning of novel classes; (3) linear probe to other downstream tasks; as well as (4) for standard action classification.
引用
收藏
页码:379 / 397
页数:19
相关论文
共 50 条
  • [21] 3D Video Object Detection with Learnable Object-Centric Global Optimization
    He, Jiawei
    Chen, Yuntao
    Wang, Naiyan
    Zhang, Zhaoxiang
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR, 2023, : 5106 - 5115
  • [22] Weakly Supervised Referring Video Object Segmentation With Object-Centric Pseudo-Guidance
    Wang, Weikang
    Su, Yuting
    Liu, Jing
    Sun, Wei
    Zhai, Guangtao
    IEEE TRANSACTIONS ON MULTIMEDIA, 2025, 27 : 1320 - 1333
  • [23] Multi-Object Representation Learning via Feature Connectivity and Object-Centric Regularization
    Foo, Alex
    Hsu, Wynne
    Lee, Mong Li
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [24] Unsupervised object-centric video generation and decomposition in 3D
    Henderson, Paul
    Lampert, Christoph H.
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
  • [25] Object-Centric Representation Learning with Generative Spatial-Temporal Factorization
    Nanbo, Li
    Raza, Muhammad Ahmed
    Hu Wenbin
    Sun, Zhaole
    Fisher, Robert B.
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [26] Object-Centric Predictive Process Monitoring
    Gherissi, Wissam
    El Haddad, Joyce
    Grigori, Daniela
    SERVICE-ORIENTED COMPUTING - ICSOC 2022 WORKSHOPS, 2023, 13821 : 27 - 39
  • [27] OBJECT-CENTRIC AND MEMORY-GUIDED NORMALITY RECONSTRUCTION FOR VIDEO ANOMALY DETECTION
    Bergaoui, Khalil
    Naji, Yassine
    Setkov, Aleksandr
    Loesch, Angelique
    Gouiffes, Michele
    Audigier, Romaric
    2022 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2022, : 2691 - 2695
  • [28] OSIN: Object-Centric Scene Inference Network for Unsupervised Video Anomaly Detection
    Liu, Yang
    Guo, Zhengliang
    Liu, Jing
    Li, Chengfang
    Song, Liang
    IEEE SIGNAL PROCESSING LETTERS, 2023, 30 : 359 - 363
  • [29] OPerA: Object-Centric Performance Analysis
    Park, Gyunam
    Adams, Jan Niklas
    van der Aalst, Wil M. P.
    CONCEPTUAL MODELING (ER 2022), 2022, 13607 : 281 - 292
  • [30] OCπ: Object-Centric Process Insights
    Adams, Jan Niklas
    van der Aalst, Wil M. P.
    APPLICATION AND THEORY OF PETRI NETS AND CONCURRENCY (PETRI NETS 2022), 2022, 13288 : 139 - 150