Capsule Networks as Generative Models

被引:0
|
作者
Kiefer, Alex B. [1 ,2 ]
Millidge, Beren [1 ,3 ]
Tschantz, Alexander [1 ]
Buckley, Christopher L. [1 ,4 ]
机构
[1] VERSES Res Lab, Culver City, CA 90230 USA
[2] Monash Univ, Melbourne, Australia
[3] Univ Oxford, MRC Brain Network Dynam Unit, Oxford, England
[4] Univ Sussex, Sussex AI Grp, Dept Informat, Brighton, England
来源
ACTIVE INFERENCE, IWAI 2022 | 2023年 / 1721卷
关键词
D O I
10.1007/978-3-031-28719-0_14
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Capsule networks are a neural network architecture specialized for visual scene recognition. Features and pose information are extracted from a scene and then dynamically routed through a hierarchy of vector-valued nodes called 'capsules' to create an implicit scene graph, with the ultimate aim of learning vision directly as inverse graphics. Despite these intuitions, however, capsule networks are not formulated as explicit probabilistic generative models; moreover, the routing algorithms typically used are ad-hoc and primarily motivated by algorithmic intuition. In this paper, we derive an alternative capsule routing algorithm utilizing iterative inference under sparsity constraints. We then introduce an explicit probabilistic generative model for capsule networks based on the self-attention operation in transformer networks and show how it is related to a variant of predictive coding networks using Von-Mises-Fisher (VMF) circular Gaussian distributions.
引用
收藏
页码:192 / 209
页数:18
相关论文
共 50 条
  • [1] Capsule Generative Models
    Li, Yifeng
    Zhu, Xiaodan
    [J]. ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2019: THEORETICAL NEURAL COMPUTATION, PT I, 2019, 11727 : 281 - 295
  • [2] Inference and Learning for Generative Capsule Models
    Nazabal, Alfredo
    Tsagkas, Nikolaos
    Williams, Christopher K. I.
    [J]. NEURAL COMPUTATION, 2023, 35 (04) : 727 - 761
  • [3] Refined Image Colorization Using Capsule Generative Adversarial Networks
    Hosni, Raggi
    Hussein, Walid
    [J]. TWELFTH INTERNATIONAL CONFERENCE ON MACHINE VISION (ICMV 2019), 2020, 11433
  • [4] Generative Models for Cellular Social Networks
    Deka, Deepjyoti
    Vishwanath, Sriram
    [J]. 2014 INTERNATIONAL CONFERENCE ON COMPUTING, NETWORKING AND COMMUNICATIONS (ICNC), 2014, : 1108 - 1112
  • [5] Microscopic Generative Models for Complex Networks
    Zhang, Yueli
    Marbach, Peter
    [J]. 2014 48TH ANNUAL CONFERENCE ON INFORMATION SCIENCES AND SYSTEMS (CISS), 2014,
  • [6] Deep Generative Models for Spatial Networks
    Guo, Xiaojie
    Du, Yuanqi
    Zhao, Liang
    [J]. KDD '21: PROCEEDINGS OF THE 27TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2021, : 505 - 515
  • [7] On the Evaluation of Generative Adversarial Networks By Discriminative Models
    Torfi, Amirsina
    Beyki, Mohammadreza
    Fox, Edward A.
    [J]. 2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 991 - 998
  • [8] Neural networks generative models for time series
    Gatta, Federico
    Giampaolo, Fabio
    Prezioso, Edoardo
    Mei, Gang
    Cuomo, Salvatore
    Piccialli, Francesco
    [J]. JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES, 2022, 34 (10) : 7920 - 7939
  • [9] OPENING DEEP NEURAL NETWORKS WITH GENERATIVE MODELS
    Vendramini, Marcos
    Oliveira, Hugo
    Machado, Alexei
    dos Santos, Jefersson A.
    [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2021, : 1314 - 1318
  • [10] Extending generative models of large scale networks
    Lofdahl, Corey
    Stickgold, Eli
    Skarin, Bruce
    Stewart, Ian
    [J]. 6TH INTERNATIONAL CONFERENCE ON APPLIED HUMAN FACTORS AND ERGONOMICS (AHFE 2015) AND THE AFFILIATED CONFERENCES, AHFE 2015, 2015, 3 : 3868 - 3875