Is an Object-Centric Video Representation Beneficial for Transfer?

被引:0
|
作者
Zhang, Chuhan [1 ]
Gupta, Ankush [2 ]
Zisserman, Andrew [1 ]
机构
[1] Univ Oxford, Dept Engn Sci, Visual Geometry Grp, Oxford, England
[2] DeepMind, London, England
来源
基金
英国工程与自然科学研究理事会;
关键词
Video action recognition; Object centric representations; Transfer learning;
D O I
10.1007/978-3-031-26316-3_23
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The objective of this work is to learn an object-centric video representation, with the aim of improving transferability to novel tasks, i.e., tasks different from the pre-training task of action classification. To this end, we introduce a new object-centric video recognition model based on a transformer architecture. The model learns a set of object-centric summary vectors for the video, and uses these vectors to fuse the visual and spatio-temporal trajectory 'modalities' of the video clip. We also introduce a novel trajectory contrast loss to further enhance objectness in these summary vectors. With experiments on four datasets-SomethingSomething-V2, Something-Else, Action Genome and EpicKitchens-we show that the object-centric model outperforms prior video representations (both object-agnostic and object-aware), when: (1) classifying actions on unseen objects and unseen environments; (2) low-shot learning of novel classes; (3) linear probe to other downstream tasks; as well as (4) for standard action classification.
引用
收藏
页码:379 / 397
页数:19
相关论文
共 50 条
  • [31] Discovering Object-centric Petri Nets
    van der Aalst, Wil M. P.
    Berti, Alessandro
    FUNDAMENTA INFORMATICAE, 2020, 175 (1-4) : 1 - 40
  • [32] Object-Centric Unsupervised Image Captioning
    Meng, Zihang
    Yang, David
    Cao, Xuefei
    Shah, Ashish
    Lim, Ser-Nam
    COMPUTER VISION, ECCV 2022, PT XXXVI, 2022, 13696 : 219 - 235
  • [33] Permission Analysis for Object-Centric Processes
    Breitmayer, Marius
    Arnold, Lisa
    Reichert, Manfred
    INTELLIGENT INFORMATION SYSTEMS, CAISE FORUM 2024, 2024, 520 : 11 - 19
  • [34] Discovery of Object-Centric Declarative Models
    Christfort, Axel K. F.
    Rivkin, Audrey
    Fahland, Dirk
    Hildebrandt, Thomas T.
    Slaats, Tijs
    2024 6TH INTERNATIONAL CONFERENCE ON PROCESS MINING, ICPM, 2024, : 137 - 144
  • [35] Object-centric process predictive analytics
    Galanti, Riccardo
    De Leoni, Massimiliano
    Navarin, Nicola
    Marazzi, Alan
    EXPERT SYSTEMS WITH APPLICATIONS, 2023, 213
  • [36] Object-Centric Learning with Slot Attention
    Locatello, Francesco
    Weissenborn, Dirk
    Unterthiner, Thomas
    Mahendran, Aravindh
    Heigold, Georg
    Uszkoreit, Jakob
    Dosovitskiy, Alexey
    Kipf, Thomas
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
  • [37] Discovery of Object-Centric Declarative Models
    Christfort, Axel K. F.
    Rivkin, Andrey
    Fahland, Dirk
    Hildebrandt, Thomas T.
    Slaats, Tijs
    2024 6TH INTERNATIONAL CONFERENCE ON PROCESS MINING, ICPM, 2024, : 121 - 128
  • [38] Provably Learning Object-Centric Representations
    Brady, Jack
    Zimmermann, Roland S.
    Sharma, Yash
    Schoelkopf, Bernhard
    von Kuegelgen, Julius
    Brendel, Wieland
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 202, 2023, 202
  • [39] Object-Centric Conformance Alignments with Synchronization
    Gianola, Alessandro
    Montali, Marco
    Winkler, Sarah
    ADVANCED INFORMATION SYSTEMS ENGINEERING, CAISE 2024, 2024, 14663 : 3 - 19
  • [40] Time-Conditioned Generative Modeling of Object-Centric Representations for Video Decomposition and Prediction
    Gao, Chengmin
    Li, Bin
    UNCERTAINTY IN ARTIFICIAL INTELLIGENCE, 2023, 216 : 613 - 623