A brain-inspired object-based attention network for multiobject recognition and visual reasoning

被引:4
|
作者
Adeli, Hossein [1 ]
Ahn, Seoyoung [1 ]
Zelinsky, Gregory J. [1 ,2 ]
机构
[1] SUNY Stony Brook, Dept Psychol, Stony Brook, NY USA
[2] SUNY Stony Brook, Dept Comp Sci, Stony Brook, NY USA
来源
JOURNAL OF VISION | 2023年 / 23卷 / 05期
关键词
CONVOLUTIONAL NEURAL-NETWORKS; ZOOM LENS; PERCEPTION; MODEL; MECHANISMS; GRADIENT; TASK;
D O I
10.1167/jov.23.5.16
中图分类号
R77 [眼科学];
学科分类号
100212 ;
摘要
The visual system uses sequences of selective glimpses to objects to support goal-directed behavior, but how is this attention control learned? Here we present an encoder-decoder model inspired by the interacting bottom-up and top-down visual pathways making up the recognition-attention system in the brain. At every iteration, a new glimpse is taken from the image and is processed through the "what" encoder, a hierarchy of feedforward, recurrent, and capsule layers, to obtain an object-centric (object-file) representation. This representation feeds to the "where" decoder, where the evolving recurrent representation provides top-down attentional modulation to plan subsequent glimpses and impact routing in the encoder. We demonstrate how the attention mechanism significantly improves the accuracy of classifying highly overlapping digits. In a visual reasoning task requiring comparison of two objects, our model achieves near-perfect accuracy and significantly outperforms larger models in generalizing to unseen stimuli. Our work demonstrates the benefits of object-based attention mechanisms taking sequential glimpses of objects.
引用
收藏
页数:17
相关论文
共 50 条
  • [31] An Object-Based Visual Attention Model for Robotic Applications
    Yu, Yuanlong
    Mann, George K. I.
    Gosine, Raymond G.
    IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 2010, 40 (05): : 1398 - 1412
  • [32] Role of feature space in object-based visual attention
    Reilly, CE
    JOURNAL OF NEUROLOGY, 2000, 247 (12) : 987 - 988
  • [33] Object-based visual selective attention and perceptual organization
    Stephen E. Watson
    Arthur F. Kramer
    Perception & Psychophysics, 1999, 61 : 31 - 49
  • [34] The effects of visual search efficiency on object-based attention
    Greenberg, Adam S.
    Rosen, Maya
    Cutrone, Elizabeth
    Behrmann, Marlene
    ATTENTION PERCEPTION & PSYCHOPHYSICS, 2015, 77 (05) : 1544 - 1557
  • [35] Random visual noise impairs object-based attention
    Richard A. Abrams
    Mark B. Law
    Experimental Brain Research, 2002, 142 : 349 - 353
  • [36] Visual parsing and object-based attention: A developmental perspective
    Baylis, GC
    COGNITIVE NEUROSCIENCE OF ATTENTION: A DEVELOPMENTAL PERSPECTIVE, 1998, : 251 - 286
  • [37] The effects of visual search efficiency on object-based attention
    Adam S. Greenberg
    Maya Rosen
    Elizabeth Cutrone
    Marlene Behrmann
    Attention, Perception, & Psychophysics, 2015, 77 : 1544 - 1557
  • [38] Random visual noise impairs object-based attention
    Abrams, RA
    Law, MB
    EXPERIMENTAL BRAIN RESEARCH, 2002, 142 (03) : 349 - 353
  • [39] BrainCog: A spiking neural network based, brain-inspired cognitive intelligence engine for brain-inspired AI and brain simulation
    Zeng, Yi
    Zhao, Dongcheng
    Zhao, Feifei
    Shen, Guobin
    Dong, Yiting
    Lu, Enmeng
    Zhang, Qian
    Sun, Yinqian
    Liang, Qian
    Zhao, Yuxuan
    Zhao, Zhuoya
    Fang, Hongjian
    Wang, Yuwei
    Li, Yang
    Liu, Xin
    Du, Chengcheng
    Kong, Qingqun
    Ruan, Zizhe
    Bi, Weida
    PATTERNS, 2023, 4 (08):
  • [40] Space-based and object-based functions of visual attention
    Mueller-Plath, G
    PERCEPTION, 2002, 31 : 172 - 172