Two-Stage Learning to Predict Human Eye Fixations via SDAEs

被引:114
|
作者
Han, Junwei [1 ]
Zhang, Dingwen [1 ]
Wen, Shifeng [1 ]
Guo, Lei [1 ]
Liu, Tianming [2 ]
Li, Xuelong [3 ]
机构
[1] Northwestern Polytech Univ, Sch Automat, Xian 710072, Peoples R China
[2] Univ Georgia, Dept Comp Sci, Athens, GA 30602 USA
[3] Chinese Acad Sci, Xian Inst Opt & Precis Mech, State Key Lab Transient Opt & Photon, Ctr OPT IMagery Anal & Learning, Xian 710119, Peoples R China
基金
美国国家科学基金会; 国家教育部博士点专项基金资助;
关键词
Deep networks; eye fixation prediction; saliency detection; stacked denoising autoencoders ( SDAEs); VISUAL SALIENCY; OBJECT DETECTION; RETRIEVAL; ATTENTION; AUTOENCODERS; FRAMEWORK; MODEL;
D O I
10.1109/TCYB.2015.2404432
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Saliency detection models aiming to quantitatively predict human eye-attended locations in the visual field have been receiving increasing research interest in recent years. Unlike traditional methods that rely on hand-designed features and contrast inference mechanisms, this paper proposes a novel framework to learn saliency detection models from raw image data using deep networks. The proposed framework mainly consists of two learning stages. At the first learning stage, we develop a stacked denoising autoencoder (SDAE) model to learn robust, representative features from raw image data under an unsupervised manner. The second learning stage aims to jointly learn optimal mechanisms to capture the intrinsic mutual patterns as the feature contrast and to integrate them for final saliency prediction. Given the input of pairs of a center patch and its surrounding patches represented by the features learned at the first stage, a SDAE network is trained under the supervision of eye fixation labels, which achieves both contrast inference and contrast integration simultaneously. Experiments on three publically available eye tracking benchmarks and the comparisons with 16 state-of-the-art approaches demonstrate the effectiveness of the proposed framework.
引用
收藏
页码:487 / 498
页数:12
相关论文
共 50 条
  • [1] Learning to Predict Eye Fixations via Multiresolution Convolutional Neural Networks
    Liu, Nian
    Han, Junwei
    Liu, Tianming
    Li, Xuelong
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2018, 29 (02) : 392 - 404
  • [2] Learning to Predict Sequences of Human Visual Fixations
    Jiang, Ming
    Boix, Xavier
    Roig, Gemma
    Xu, Juan
    Van Gool, Luc
    Zhao, Qi
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2016, 27 (06) : 1241 - 1252
  • [3] AN ITERATIVE REPRESENTATION LEARNING FRAMEWORK TO PREDICT THE SEQUENCE OF EYE FIXATIONS
    Xia, Chen
    Qi, Fei
    Shi, Guangming
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2017, : 1530 - 1535
  • [4] Improving Word Translation via Two-Stage Contrastive Learning
    Li, Yaoyiran
    Liu, Fangyu
    Collier, Nigel
    Korhonen, Anna
    Vulic, Ivan
    [J]. PROCEEDINGS OF THE 60TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022), VOL 1: (LONG PAPERS), 2022, : 4353 - 4374
  • [5] Improving imbalance classification via ensemble learning based on two-stage learning
    Liu, Na
    Wang, Jiaqi
    Zhu, Yongtong
    Wan, Lihong
    Li, Qingdu
    [J]. FRONTIERS IN COMPUTATIONAL NEUROSCIENCE, 2024, 17
  • [6] Two-Stage Metric Learning
    Wang, Jun
    Sun, Ke
    Sha, Fei
    Marchand-Maillet, Stephane
    Kalousis, Alexandros
    [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 32 (CYCLE 2), 2014, 32 : 370 - 378
  • [7] Predict Failures in Production Lines A Two-stage Approach with Clustering and Supervised Learning
    Zhang, Darui
    Xu, Bin
    Wood, Jasmine
    [J]. 2016 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2016, : 2070 - 2074
  • [8] Portfolio management via two-stage deep learning with a joint cost
    Yun, Hyungbin
    Lee, Minhyeok
    Kang, Yeong Seon
    Seok, Junhee
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2020, 143
  • [9] Learning Sparse Combinatorial Representations via Two-stage Submodular Maximization
    Balkanski, Eric
    Krause, Andreas
    Mirzasoleiman, Baharan
    Singer, Yaron
    [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 48, 2016, 48
  • [10] Deep Model Compression via Two-Stage Deep Reinforcement Learning
    Zhan, Huixin
    Lin, Wei-Ming
    Cao, Yongcan
    [J]. MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, 2021, 12975 : 238 - 254