Object-Centric Multiple Object Tracking

被引:0
|
作者
Zhao, Zixu [1 ]
Wang, Jiaze [2 ]
Horn, Max [1 ]
Ding, Yizhuo [3 ]
He, Tong [1 ]
Bai, Zechen [1 ]
Zietlow, Dominik [1 ]
Simon-Gabriel, Carl-Johann [1 ]
Shuai, Bing [1 ]
Tu, Zhuowen [1 ]
Brox, Thomas [1 ]
Schiele, Bernt [1 ]
Fu, Yanwei [3 ]
Locatello, Francesco [1 ]
Zhang, Zheng [1 ]
Xiao, Tianjun [1 ]
机构
[1] Amazon Web Serv, Beijing, Peoples R China
[2] Chinese Univ Hong Kong, Hong Kong, Peoples R China
[3] Fudan Univ, Shanghai, Peoples R China
关键词
D O I
10.1109/ICCV51070.2023.01522
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Unsupervised object-centric learning methods allow the partitioning of scenes into entities without additional localization information and are excellent candidates for reducing the annotation burden of multiple-object tracking (MOT) pipelines. Unfortunately, they lack two key properties: objects are often split into parts and are not consistently tracked over time. In fact, state-of-the-art models achieve pixel-level accuracy and temporal consistency by relying on supervised object detection with additional ID labels for the association through time. This paper proposes a video object-centric model for MOT. It consists of an index-merge module that adapts the object-centric slots into detection outputs and an object memory module that builds complete object prototypes to handle occlusions. Benefited from object-centric learning, we only require sparse detection labels (0%-6.25%) for object localization and feature binding. Relying on our self-supervised Expectation-Maximization-inspired loss for object association, our approach requires no ID labels. Our experiments significantly narrow the gap between the existing object-centric model and the fully supervised state-of-theart and outperform several unsupervised trackers. Code is available at https://github.com/amazon-science/object-centric-multiple-object-tracking.
引用
收藏
页码:16555 / 16565
页数:11
相关论文
共 50 条
  • [21] Time-traveling object-centric breakpoints
    Bourcier, Valentin
    Costiou, Steven
    Santander, Maximilian Ignacio Willembrinck
    Vanegue, Adrien
    Etien, Anne
    [J]. JOURNAL OF COMPUTER LANGUAGES, 2024, 80
  • [22] An Object-Centric Paradigm for Robot Programming by Demonstration
    Huang, Di-Wei
    Katz, Garrett E.
    Langsfeld, Joshua D.
    Oh, Hyuk
    Gentili, Rodolphe J.
    Reggia, James A.
    [J]. FOUNDATIONS OF AUGMENTED COGNITION, AC 2015, 2015, 9183 : 745 - 756
  • [23] Is an Object-Centric Video Representation Beneficial for Transfer?
    Zhang, Chuhan
    Gupta, Ankush
    Zisserman, Andrew
    [J]. COMPUTER VISION - ACCV 2022, PT IV, 2023, 13844 : 379 - 397
  • [24] Precision and Fitness in Object-Centric Process Mining
    Adams, Jan Niklas
    van der Aalst, Wil M. P.
    [J]. 2021 3RD INTERNATIONAL CONFERENCE ON PROCESS MINING (ICPM 2021), 2021, : 128 - 135
  • [25] SSVEP stimuli design for object-centric BCI
    Gergondet, Pierre
    Kheddar, Abderrahmane
    [J]. BRAIN-COMPUTER INTERFACES, 2015, 2 (01) : 11 - 28
  • [26] Object-Centric Spatial Pooling for Image Classification
    Russakovsky, Olga
    Lin, Yuanqing
    Yu, Kai
    Li Fei-Fei
    [J]. COMPUTER VISION - ECCV 2012, PT II, 2012, 7573 : 1 - 15
  • [27] Manifold geometric invariants and object-centric approach
    Jannson, TP
    [J]. APPLICATIONS AND SCIENCE OF NEURAL NETWORKS, FUZZY SYSTEMS, AND EVOLUTIONARY COMPUTATION V, 2002, 4787 : 158 - 173
  • [28] Object-Centric Stereo Matching for 3D Object Detection
    Pon, Alex D.
    Ku, Jason
    Li, Chengyao
    Waslander, Steven L.
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2020, : 8383 - 8389
  • [29] Deep convolution neural network with scene-centric and object-centric information for object detection
    Shen, Zong-Ying
    Han, Shiang-Yu
    Fu, Li-Chen
    Hsiao, Pei-Yung
    Lau, Yo-Chung
    Chang, Sheng-Jen
    [J]. IMAGE AND VISION COMPUTING, 2019, 85 : 14 - 25
  • [30] Object-centric Learning with Capsule Networks: A Survey
    Ribeiro, Fabio De Sousa
    Duarte, Kevin
    Everett, Miles
    Leontidis, Georgios
    Shah, Mubarak
    [J]. ACM COMPUTING SURVEYS, 2024, 56 (11)