Subitizing with Variational Autoencoders

被引：0

作者：

Wever, Rijnder ^{[1
]}

Runia, Tom F. H. ^{[1
]}

机构：

[1] Univ Amsterdam, Intelligent Sensory Informat Syst, Amsterdam, Netherlands

来源：

COMPUTER VISION - ECCV 2018 WORKSHOPS, PT III | 2019年 / 11131卷

关键词：

Object counting; Numerosity; Variational autoencoders; VISUAL SENSE; NUMBER; PARIETAL; REPRESENTATION; NUMEROSITY;

D O I：

10.1007/978-3-030-11015-4_47

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Numerosity, the number of objects in a set, is a basic property of a given visual scene. Many animals develop the perceptual ability to subitize: the near-instantaneous identification of the numerosity in small sets of visual items. In computer vision, it has been shown that numerosity emerges as a statistical property in neural networks during unsupervised learning from simple synthetic images. In this work, we focus on more complex natural images using unsupervised hierarchical neural networks. Specifically, we show that variational autoencoders are able to spontaneously perform subitizing after training without supervision on a large amount of images from the Salient Object Subitizing dataset. While our method is unable to outperform supervised convolutional networks for subitizing, we observe that the networks learn to encode numerosity as a basic visual property. Moreover, we find that the learned representations are likely invariant to object area; an observation in alignment with studies on biological neural networks in cognitive neuroscience.

引用

页码：617 / 627

页数：11

共 50 条

[1] Mixture variational autoencoders
Jiang, Shuoran
Chen, Yarui
Yang, Jucheng
Zhang, Chuanlei
Zhao, Tingting
[J]. PATTERN RECOGNITION LETTERS, 2019, 128 : 263 - 269
[2] An Introduction to Variational Autoencoders
Kingma, Diederik P.
Welling, Max
[J]. FOUNDATIONS AND TRENDS IN MACHINE LEARNING, 2019, 12 (04): : 4 - 89
[3] Mixtures of Variational Autoencoders
Ye, Fei
Bors, Adrian G.
[J]. 2020 TENTH INTERNATIONAL CONFERENCE ON IMAGE PROCESSING THEORY, TOOLS AND APPLICATIONS (IPTA), 2020,
[4] Variational Laplace Autoencoders
Park, Yookoon
Kim, Chris Dongjoo
Kim, Gunhee
[J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 97, 2019, 97
[5] Diffusion Variational Autoencoders
Rey, Luis A. Perez
Menkovski, Vlado
Portegies, Jim
[J]. PROCEEDINGS OF THE TWENTY-NINTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2020, : 2704 - 2710
[6] Ladder Variational Autoencoders
Sonderby, Casper Kaae
Raiko, Tapani
Maaloe, Lars
Sonderby, Soren Kaae
Winther, Ole
[J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 29 (NIPS 2016), 2016, 29
[7] Overdispersed Variational Autoencoders
Shah, Harshil
Barber, David
Botev, Aleksandar
[J]. 2017 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2017, : 1109 - 1116
[8] Tree Variational Autoencoders
Manduchi, Laura
Vandenhirtz, Moritz
Ryser, Alain
Vogt, Julia E.
[J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
[9] Affine Variational Autoencoders
Bidart, Rene
Wong, Alexander
[J]. IMAGE ANALYSIS AND RECOGNITION, ICIAR 2019, PT I, 2019, 11662 : 461 - 472
[10] Clockwork Variational Autoencoders
Saxena, Vaibhav
Ba, Jimmy
Hafner, Danijar
[J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34

← 1 2 3 4 5 →