A brain-inspired object-based attention network for multiobject recognition and visual reasoning

被引:4
|
作者
Adeli, Hossein [1 ]
Ahn, Seoyoung [1 ]
Zelinsky, Gregory J. [1 ,2 ]
机构
[1] SUNY Stony Brook, Dept Psychol, Stony Brook, NY USA
[2] SUNY Stony Brook, Dept Comp Sci, Stony Brook, NY USA
来源
JOURNAL OF VISION | 2023年 / 23卷 / 05期
关键词
CONVOLUTIONAL NEURAL-NETWORKS; ZOOM LENS; PERCEPTION; MODEL; MECHANISMS; GRADIENT; TASK;
D O I
10.1167/jov.23.5.16
中图分类号
R77 [眼科学];
学科分类号
100212 ;
摘要
The visual system uses sequences of selective glimpses to objects to support goal-directed behavior, but how is this attention control learned? Here we present an encoder-decoder model inspired by the interacting bottom-up and top-down visual pathways making up the recognition-attention system in the brain. At every iteration, a new glimpse is taken from the image and is processed through the "what" encoder, a hierarchy of feedforward, recurrent, and capsule layers, to obtain an object-centric (object-file) representation. This representation feeds to the "where" decoder, where the evolving recurrent representation provides top-down attentional modulation to plan subsequent glimpses and impact routing in the encoder. We demonstrate how the attention mechanism significantly improves the accuracy of classifying highly overlapping digits. In a visual reasoning task requiring comparison of two objects, our model achieves near-perfect accuracy and significantly outperforms larger models in generalizing to unseen stimuli. Our work demonstrates the benefits of object-based attention mechanisms taking sequential glimpses of objects.
引用
收藏
页数:17
相关论文
共 50 条
  • [21] Efficient Visual Recognition: A Survey on Recent Advances and Brain-inspired Methodologies
    Wu, Yang
    Wang, Ding-Heng
    Lu, Xiao-Tong
    Yang, Fan
    Yao, Man
    Dong, Wei-Sheng
    Shi, Jian-Bo
    Li, Guo-Qi
    MACHINE INTELLIGENCE RESEARCH, 2022, 19 (05) : 366 - 411
  • [22] Efficient Visual Recognition: A Survey on Recent Advances and Brain-inspired Methodologies
    Yang Wu
    Ding-Heng Wang
    Xiao-Tong Lu
    Fan Yang
    Man Yao
    Wei-Sheng Dong
    Jian-Bo Shi
    Guo-Qi Li
    Machine Intelligence Research, 2022, 19 (05) : 366 - 411
  • [23] Efficient Visual Recognition: A Survey on Recent Advances and Brain-inspired Methodologies
    Yang Wu
    Ding-Heng Wang
    Xiao-Tong Lu
    Fan Yang
    Man Yao
    Wei-Sheng Dong
    Jian-Bo Shi
    Guo-Qi Li
    Machine Intelligence Research, 2022, 19 : 366 - 411
  • [24] Brain-inspired modular echo state network for EEG-based emotion recognition
    Yang, Liuyi
    Wang, Zhaoze
    Wang, Guoyu
    Liang, Lixin
    Liu, Meng
    Wang, Junsong
    FRONTIERS IN NEUROSCIENCE, 2024, 18
  • [25] Object-based attention requires monocular visual pathways
    Strommer, N.
    Al-Janabi, S.
    Greenberg, A. S.
    Gabay, S.
    PSYCHONOMIC BULLETIN & REVIEW, 2024, 31 (04) : 1880 - 1890
  • [26] Object-based visual selective attention and perceptual organization
    Watson, SE
    Kramer, AF
    PERCEPTION & PSYCHOPHYSICS, 1999, 61 (01): : 31 - 49
  • [27] On the spatial extent of attention in object-based visual selection
    Lavie, N
    Driver, J
    PERCEPTION & PSYCHOPHYSICS, 1996, 58 (08): : 1238 - 1251
  • [28] A Novel Hierarchical Framework for Object-Based Visual Attention
    Marfil, Rebecca
    Bandera, Antonio
    Antonio Rodriguez, Juan
    Sandoval, Francisco
    ATTENTION IN COGNITIVE SYSTEMS, 2009, 5395 : 27 - 40
  • [29] A model of space and object-based attention for visual saliency
    Zhong, Jingjing
    Luo, Siwei
    ICICIC 2006: FIRST INTERNATIONAL CONFERENCE ON INNOVATIVE COMPUTING, INFORMATION AND CONTROL, VOL 2, PROCEEDINGS, 2006, : 237 - +
  • [30] Object-based visual attention in luminance increment detection?
    Stuart, GW
    Maruff, P
    Currie, J
    NEUROPSYCHOLOGIA, 1997, 35 (06) : 843 - 853