Object-centric Learning with Capsule Networks: A Survey

被引:0
|
作者
Ribeiro, Fabio De Sousa [1 ]
Duarte, Kevin [2 ]
Everett, Miles [3 ]
Leontidis, Georgios [3 ]
Shah, Mubarak [2 ]
机构
[1] Imperial Coll London, Dept Comp, London, England
[2] Univ Cent Florida, Elect Engn & Comp Sci, Orlando, FL 32816 USA
[3] Univ Aberdeen, Dept Comp Sci, Aberdeen, Scotland
关键词
Deep learning; capsule networks; deep neural networks; convolutional neural networks; transformers; routing-by-agreement; self-attention; representation learning; object-centric learning; generative models; computer vision; ATTENTION; IMAGES;
D O I
10.1145/3674500
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Capsule networks emerged as a promising alternative to convolutional neural networks for learning object-centric representations. The idea is to explicitly model part-whole hierarchies by using groups of neurons called capsules to encode visual entities, then learn the relationships between these entities dynamically from data. However, a major hurdle for capsule network research has been the lack of a reliable point of reference for understanding their foundational ideas and motivations. This survey provides a comprehensive and critical overview of capsule networks, which aims to serve as a main point of reference going forward. To that end, we introduce the fundamental concepts and motivations behind capsule networks, such as equivariant inference. We then cover various technical advances in capsule routing algorithms as well as alternative geometric and generative formulations. We provide a detailed explanation of how capsule networks relate to the attention mechanism in Transformers and uncover non-trivial conceptual similarities between them in the context of object-centric representation learning. We also review the extensive applications of capsule networks in computer vision, video and motion, graph representation learning, natural language processing, medical imaging, and many others. To conclude, we provide an in-depth discussion highlighting promising directions for future work.
引用
收藏
页数:43
相关论文
共 50 条
  • [31] Representation learning from videos in-the-wild: An object-centric approach
    Romijnders, Rob
    Mahendran, Aravindh
    Tschannen, Michael
    Djolonga, Josip
    Ritter, Marvin
    Houlsby, Neil
    Lucic, Mario
    [J]. 2021 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2021), 2021, : 177 - 187
  • [32] Permission Analysis for Object-Centric Processes
    Breitmayer, Marius
    Arnold, Lisa
    Reichert, Manfred
    [J]. INTELLIGENT INFORMATION SYSTEMS, CAISE FORUM 2024, 2024, 520 : 11 - 19
  • [33] Data Dreaming for Object Detection: Learning Object-Centric State Representations for Visual Imitation
    Sieb, Maximilian
    Fragkiadaki, Katerina
    [J]. 2018 IEEE-RAS 18TH INTERNATIONAL CONFERENCE ON HUMANOID ROBOTS (HUMANOIDS), 2018, : 806 - 813
  • [34] Multi-Object Representation Learning via Feature Connectivity and Object-Centric Regularization
    Foo, Alex
    Hsu, Wynne
    Lee, Mong Li
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [35] Shepherding Slots to Objects: Towards Stable and Robust Object-Centric Learning
    Kim, Jinwoo
    Choi, Janghyuk
    Choi, Ho-Jin
    Kim, Seon Joo
    [J]. 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 19198 - 19207
  • [36] Learning Object-Centric Representations of Multi-Object Scenes from Multiple Views
    Nanbo, Li
    Eastwood, Cian
    Fisher, Robert B.
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
  • [37] Object-Centric Conformance Alignments with Synchronization
    Gianola, Alessandro
    Montali, Marco
    Winkler, Sarah
    [J]. ADVANCED INFORMATION SYSTEMS ENGINEERING, CAISE 2024, 2024, 14663 : 3 - 19
  • [38] Object-centric process predictive analytics
    Galanti, Riccardo
    De Leoni, Massimiliano
    Navarin, Nicola
    Marazzi, Alan
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2023, 213
  • [39] Adapting Object-Centric Probabilistic Movement Primitives with Residual Reinforcement Learning
    Carvalho, Joao
    Koert, Dorothea
    Daniv, Marek
    Peters, Jan
    [J]. 2022 IEEE-RAS 21ST INTERNATIONAL CONFERENCE ON HUMANOID ROBOTS (HUMANOIDS), 2022, : 405 - 412
  • [40] Multimodal embodied attribute learning by robots for object-centric action policies
    Zhang, Xiaohan
    Amiri, Saeid
    Sinapov, Jivko
    Thomason, Jesse
    Stone, Peter
    Zhang, Shiqi
    [J]. AUTONOMOUS ROBOTS, 2023, 47 (05) : 505 - 528