Subitizing with Variational Autoencoders

被引:0
|
作者
Wever, Rijnder [1 ]
Runia, Tom F. H. [1 ]
机构
[1] Univ Amsterdam, Intelligent Sensory Informat Syst, Amsterdam, Netherlands
关键词
Object counting; Numerosity; Variational autoencoders; VISUAL SENSE; NUMBER; PARIETAL; REPRESENTATION; NUMEROSITY;
D O I
10.1007/978-3-030-11015-4_47
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Numerosity, the number of objects in a set, is a basic property of a given visual scene. Many animals develop the perceptual ability to subitize: the near-instantaneous identification of the numerosity in small sets of visual items. In computer vision, it has been shown that numerosity emerges as a statistical property in neural networks during unsupervised learning from simple synthetic images. In this work, we focus on more complex natural images using unsupervised hierarchical neural networks. Specifically, we show that variational autoencoders are able to spontaneously perform subitizing after training without supervision on a large amount of images from the Salient Object Subitizing dataset. While our method is unable to outperform supervised convolutional networks for subitizing, we observe that the networks learn to encode numerosity as a basic visual property. Moreover, we find that the learned representations are likely invariant to object area; an observation in alignment with studies on biological neural networks in cognitive neuroscience.
引用
收藏
页码:617 / 627
页数:11
相关论文
共 50 条
  • [1] Mixture variational autoencoders
    Jiang, Shuoran
    Chen, Yarui
    Yang, Jucheng
    Zhang, Chuanlei
    Zhao, Tingting
    [J]. PATTERN RECOGNITION LETTERS, 2019, 128 : 263 - 269
  • [2] An Introduction to Variational Autoencoders
    Kingma, Diederik P.
    Welling, Max
    [J]. FOUNDATIONS AND TRENDS IN MACHINE LEARNING, 2019, 12 (04): : 4 - 89
  • [3] Mixtures of Variational Autoencoders
    Ye, Fei
    Bors, Adrian G.
    [J]. 2020 TENTH INTERNATIONAL CONFERENCE ON IMAGE PROCESSING THEORY, TOOLS AND APPLICATIONS (IPTA), 2020,
  • [4] Variational Laplace Autoencoders
    Park, Yookoon
    Kim, Chris Dongjoo
    Kim, Gunhee
    [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 97, 2019, 97
  • [5] Diffusion Variational Autoencoders
    Rey, Luis A. Perez
    Menkovski, Vlado
    Portegies, Jim
    [J]. PROCEEDINGS OF THE TWENTY-NINTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2020, : 2704 - 2710
  • [6] Ladder Variational Autoencoders
    Sonderby, Casper Kaae
    Raiko, Tapani
    Maaloe, Lars
    Sonderby, Soren Kaae
    Winther, Ole
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 29 (NIPS 2016), 2016, 29
  • [7] Overdispersed Variational Autoencoders
    Shah, Harshil
    Barber, David
    Botev, Aleksandar
    [J]. 2017 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2017, : 1109 - 1116
  • [8] Tree Variational Autoencoders
    Manduchi, Laura
    Vandenhirtz, Moritz
    Ryser, Alain
    Vogt, Julia E.
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [9] Affine Variational Autoencoders
    Bidart, Rene
    Wong, Alexander
    [J]. IMAGE ANALYSIS AND RECOGNITION, ICIAR 2019, PT I, 2019, 11662 : 461 - 472
  • [10] Clockwork Variational Autoencoders
    Saxena, Vaibhav
    Ba, Jimmy
    Hafner, Danijar
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34