A brain-inspired object-based attention network for multiobject recognition and visual reasoning

被引:4
|
作者
Adeli, Hossein [1 ]
Ahn, Seoyoung [1 ]
Zelinsky, Gregory J. [1 ,2 ]
机构
[1] SUNY Stony Brook, Dept Psychol, Stony Brook, NY USA
[2] SUNY Stony Brook, Dept Comp Sci, Stony Brook, NY USA
来源
JOURNAL OF VISION | 2023年 / 23卷 / 05期
关键词
CONVOLUTIONAL NEURAL-NETWORKS; ZOOM LENS; PERCEPTION; MODEL; MECHANISMS; GRADIENT; TASK;
D O I
10.1167/jov.23.5.16
中图分类号
R77 [眼科学];
学科分类号
100212 ;
摘要
The visual system uses sequences of selective glimpses to objects to support goal-directed behavior, but how is this attention control learned? Here we present an encoder-decoder model inspired by the interacting bottom-up and top-down visual pathways making up the recognition-attention system in the brain. At every iteration, a new glimpse is taken from the image and is processed through the "what" encoder, a hierarchy of feedforward, recurrent, and capsule layers, to obtain an object-centric (object-file) representation. This representation feeds to the "where" decoder, where the evolving recurrent representation provides top-down attentional modulation to plan subsequent glimpses and impact routing in the encoder. We demonstrate how the attention mechanism significantly improves the accuracy of classifying highly overlapping digits. In a visual reasoning task requiring comparison of two objects, our model achieves near-perfect accuracy and significantly outperforms larger models in generalizing to unseen stimuli. Our work demonstrates the benefits of object-based attention mechanisms taking sequential glimpses of objects.
引用
收藏
页数:17
相关论文
共 50 条
  • [1] Brain-inspired models for visual object recognition: an overview
    Yang, Xi
    Yan, Jie
    Wang, Wen
    Li, Shaoyi
    Hu, Bo
    Lin, Jian
    ARTIFICIAL INTELLIGENCE REVIEW, 2022, 55 (07) : 5263 - 5311
  • [2] Brain-inspired models for visual object recognition: an overview
    Xi Yang
    Jie Yan
    Wen Wang
    Shaoyi Li
    Bo Hu
    Jian Lin
    Artificial Intelligence Review, 2022, 55 (7) : 5263 - 5311
  • [3] A biologically inspired object-based visual attention model
    Longsheng Wei
    Nong Sang
    Yuehuan Wang
    Artificial Intelligence Review, 2010, 34 : 109 - 119
  • [4] A biologically inspired object-based visual attention model
    Wei, Longsheng
    Sang, Nong
    Wang, Yuehuan
    ARTIFICIAL INTELLIGENCE REVIEW, 2010, 34 (02) : 109 - 119
  • [5] Brain-Inspired Visual Attention Modeling Based on EEG for Intelligent Robotics
    Hu, Shuzhan
    Duan, Yiping
    Tao, Xiaoming
    Chu, Jian
    Lu, Jianhua
    IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2024, 18 (03) : 431 - 443
  • [6] Brain-inspired automated visual object discovery and detection
    Chen, Lichao
    Singh, Sudhir
    Kailath, Thomas
    Roychowdhury, Vwani
    PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2019, 116 (01) : 96 - 105
  • [7] COSFIRE: A Brain-Inspired Approach to Visual Pattern Recognition
    Azzopardi, George
    Petkov, Nicolai
    BRAIN-INSPIRED COMPUTING, 2014, 8603 : 76 - 87
  • [8] A bio-inspired method and system for visual object-based attention and segmentation
    Huber, David J.
    Khosla, Deepak
    AUTOMATIC TARGET RECOGNITION XX; ACQUISITION, TRACKING, POINTING, AND LASER SYSTEMS TECHNOLOGIES XXIV; AND OPTICAL PATTERN RECOGNITION XXI, 2010, 7696
  • [9] Object-based auditory and visual attention
    Shinn-Cunningham, Barbara G.
    TRENDS IN COGNITIVE SCIENCES, 2008, 12 (05) : 182 - 186
  • [10] BI-AVAN: A Brain-Inspired Adversarial Visual Attention Network for Characterizing Human Visual Attention From Neural Activity
    Huang, Heng
    Zhao, Lin
    Dai, Haixing
    Zhang, Lu
    Hu, Xintao
    Zhu, Dajiang
    Liu, Tianming
    IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 11191 - 11203