Predicting Eye Fixations using Convolutional Neural Networks

被引:0
|
作者
Liu, Nian [1 ]
Han, Junwei [1 ]
Zhang, Dingwen [1 ]
Wen, Shifeng [1 ]
Liu, Tianming [2 ]
机构
[1] Northwestern Polytech Univ, Xian Shi, Shaanxi Sheng, Peoples R China
[2] Univ Georgia, Athens, GA 30602 USA
关键词
SALIENCY; FRAMEWORK; SELECTION; OBJECTS; IMAGE; MODEL;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
It is believed that eye movements in free-viewing of natural scenes are directed by both bottom-up visual saliency and top-down visual factors. In this paper, we propose a novel computational framework to simultaneously learn these two types of visual features from raw image data using a multiresolution convolutional neural network (Mr-CNN) for predicting eye fixations. The Mr-CNN is directly trained from image regions centered on fixation and non-fixation locations over multiple resolutions, using raw image pixels as inputs and eye fixation attributes as labels. Diverse top-down visual features can be learned in higher layers. Meanwhile bottom-up visual saliency can also be inferred via combining information over multiple resolutions. Finally, optimal integration of bottom-up and top-down cues can be learned in the last logistic regression layer to predict eye fixations. The proposed approach achieves state-of-the-art results over four publically available benchmark datasets, demonstrating the superiority of our work.
引用
收藏
页码:362 / 370
页数:9
相关论文
共 50 条
  • [41] Predicting and Understanding Urban Perception with Convolutional Neural Networks
    Porzi, Lorenzo
    Bulo, Samuel Rota
    Lepri, Bruno
    Ricci, Elisa
    [J]. MM'15: PROCEEDINGS OF THE 2015 ACM MULTIMEDIA CONFERENCE, 2015, : 139 - 148
  • [42] Convolutional neural networks for predicting creep and shrinkage of concrete
    Zhu, Jinsong
    Wang, Yanlei
    [J]. CONSTRUCTION AND BUILDING MATERIALS, 2021, 306
  • [43] Deep Convolutional Neural Networks for Predicting Hydroxyproline in Proteins
    Long, HaiXia
    Wang, Mi
    Fu, HaiYan
    [J]. CURRENT BIOINFORMATICS, 2017, 12 (03) : 233 - 238
  • [44] Convolutional Neural Networks in Predicting Missing Text in Arabic
    Souri, Adnan
    Alachhab, Mohamed
    Eddine Elmohajir, Badr
    Zbakh, Abdelali
    [J]. INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2019, 10 (06) : 520 - 527
  • [45] Convolutional neural networks can decode eye movement data: A black box approach to predicting task from eye movements
    Cole, Zachary J.
    Kuntzelman, Karl M.
    Dodd, Michael D.
    Johnson, Matthew R.
    [J]. JOURNAL OF VISION, 2021, 21 (07):
  • [46] Combination of Convolutional Neural Networks and Recurrent Neural Networks for predicting soil properties using Vis-NIR spectroscopy
    Yang, Jiechao
    Wang, Xuelei
    Wang, Ruihua
    Wang, Huanjie
    [J]. GEODERMA, 2020, 380
  • [47] Classification of the gaze fixations in the eye-brain-computer interface paradigm with a compact convolutional neural network
    Kozyrskiy, Bogdan L.
    Ovchinnikova, Anastasia O.
    Moskalenko, Alena D.
    Velichkovsky, Boris M.
    Shishkin, Sergei L.
    [J]. POSTPROCEEDINGS OF THE 9TH ANNUAL INTERNATIONAL CONFERENCE ON BIOLOGICALLY INSPIRED COGNITIVE ARCHITECTURES (BICA 2018), 2018, 145 : 293 - 299
  • [48] BrainNetCNN: Convolutional neural networks for brain networks; towards predicting neurodevelopment
    Kawahara, Jeremy
    Brown, Colin J.
    Miller, Steven P.
    Booth, Brian G.
    Chau, Vann
    Grunau, Ruth E.
    Zwicker, Jill G.
    Hamarneh, Ghassan
    [J]. NEUROIMAGE, 2017, 146 : 1038 - 1049
  • [49] Predicting Hard Disk Failures in Data Centers Using Temporal Convolutional Neural Networks
    Burrello, Alessio
    Pagliari, Daniele Jahier
    Bartolini, Andrea
    Benini, Luca
    Macii, Enrico
    Poncino, Massimo
    [J]. EURO-PAR 2020: PARALLEL PROCESSING WORKSHOPS, 2021, 12480 : 277 - 289
  • [50] PREDICTING TONGUE MOTION IN UNLABELED ULTRASOUND VIDEOS USING CONVOLUTIONAL LSTM NEURAL NETWORKS
    Zhao, Chaojie
    Zhang, Peng
    Zhu, Jian
    Wu, Chengrui
    Wang, Huaimin
    Xu, Kele
    [J]. 2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 5926 - 5930