Deep Networks for Human Visual Attention: A Hybrid Model Using Foveal Vision

被引:2
|
作者
Almeida, Ana Filipa [1 ]
Figueiredo, Rui [1 ]
Bernardino, Alexandre [1 ]
Santos-Victor, Jose [1 ]
机构
[1] Univ Lisbon, Inst Super Tecn, Inst Syst & Robot, Lisbon, Portugal
关键词
Computer vision; Deep neural networks; Object classification and localization; Space-variant vision; Visual attention;
D O I
10.1007/978-3-319-70836-2_10
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Visual attention plays a central role in natural and artificial systems to control perceptual resources. The classic artificial visual attention systems uses salient features of the image obtained from the information given by predefined filters. Recently, deep neural networks have been developed for recognizing thousands of objects and autonomously generate visual characteristics optimized by training with large data sets. Besides being used for object recognition, these features have been very successful in other visual problems such as object segmentation, tracking and recently, visual attention. In this work we propose a biologically inspired object classification and localization framework that combines Deep Convolutional Neural Networks with foveal vision. First, a feed-forward pass is performed to obtain the predicted class labels. Next, we get the object location proposals by applying a segmentation mask on the saliency map calculated through a top-down backward pass. The main contribution of our work lies in the evaluation of the performances obtained with different non-uniform resolutions. We were able to establish a relationship between performance and the different levels of information preserved by each of the sensing configurations. The results demonstrate that we do not need to store and transmit all the information present on high-resolution images since, beyond a certain amount of preserved information, the performance in the classification and localization task saturates.
引用
收藏
页码:117 / 128
页数:12
相关论文
共 50 条
  • [31] Attention based Deep Hybrid Networks for Traffic Flow Prediction using Google Maps Data
    Rahman, Md. Moshiur
    Nower, Naushin
    [J]. PROCEEDINGS OF 2023 8TH INTERNATIONAL CONFERENCE ON MACHINE LEARNING TECHNOLOGIES, ICMLT 2023, 2023, : 74 - 81
  • [32] Deep Modular Co-Attention Networks for Visual Question Answering
    Yu, Zhou
    Yu, Jun
    Cui, Yuhao
    Tao, Dacheng
    Tian, Qi
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 6274 - 6283
  • [33] Attention Map-Guided Visual Explanations for Deep Neural Networks
    An, Junkang
    Joe, Inwhee
    [J]. APPLIED SCIENCES-BASEL, 2022, 12 (08):
  • [34] Hybrid Integration of Visual Attention Model into Image Quality Metric
    Jung, Chanho
    [J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2014, E97D (11): : 2971 - 2973
  • [35] A hybrid human recognition framework using machine learning and deep neural networks
    Sheneamer, Abdullah M.
    Halawi, Malik H.
    Al-Qahtani, Meshari H.
    [J]. PLOS ONE, 2024, 19 (06):
  • [36] Motion detection using a model of visual attention
    Zhang, Shijie
    Stentiford, Fred
    [J]. 2007 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOLS 1-7, 2007, : 1641 - 1644
  • [37] Identification of Visual Attention Regions in Machine Vision Using Saliency Map
    Kounte, Manjunath R.
    Sujatha, B. K.
    [J]. 2015 INTERNATIONAL CONFERENCE ON COMMUNICATIONS AND SIGNAL PROCESSING (ICCSP), 2015, : 639 - 643
  • [38] Deep Prototypical Networks With Hybrid Residual Attention for Hyperspectral Image Classification
    Xi, Bobo
    Li, Jiaojiao
    Li, Yunsong
    Song, Rui
    Shi, Yanzi
    Liu, Songlin
    Du, Qian
    [J]. IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2020, 13 : 3683 - 3700
  • [39] Posture Detection of Heart Disease Using Multi-Head Attention Vision Hybrid (MHAVH) Model
    Naz, Hina
    Zhang, Zuping
    Al-Habib, Mohammed
    Awwad, Fuad A.
    Ismail, Emad A. A.
    Khan, Zaid Ali
    [J]. CMC-COMPUTERS MATERIALS & CONTINUA, 2024, 79 (02): : 2673 - 2696
  • [40] Image quality enhancement using hybrid attention networks
    Wang, Jiachen
    Yang, Yingyun
    Hua, Yan
    [J]. IET IMAGE PROCESSING, 2022, 16 (02) : 521 - 534