Top-Down Visual Saliency via Joint CRF and Dictionary Learning

被引:96
|
作者
Yang, Jimei [1 ]
Yang, Ming-Hsuan [2 ]
机构
[1] Adobe Res, San Jose, CA 95110 USA
[2] Univ Calif Merced, Sch Engn, Merced, CA USA
基金
美国国家科学基金会;
关键词
Visual saliency; top-down visual saliency; fixation prediction; dictionary learning and conditional random fields; FEATURES; ATTENTION;
D O I
10.1109/TPAMI.2016.2547384
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Top-down visual saliency is an important module of visual attention. In this work, we propose a novel top-down saliency model that jointly learns a Conditional Random Field (CRF) and a visual dictionary. The proposed model incorporates a layered structure from top to bottom: CRF, sparse coding and image patches. With sparse coding as an intermediate layer, CRF is learned in a feature-adaptive manner; meanwhile with CRF as the output layer, the dictionary is learned under structured supervision. For efficient and effective joint learning, we develop a max-margin approach via a stochastic gradient descent algorithm. Experimental results on the Graz-02 and PASCAL VOC datasets show that our model performs favorably against state-of-the-art top-down saliency methods for target object localization. In addition, the dictionary update significantly improves the performance of our model. We demonstrate the merits of the proposed top-down saliency model by applying it to prioritizing object proposals for detection and predicting human fixations.
引用
收藏
页码:576 / 588
页数:13
相关论文
共 50 条
  • [1] Top-Down Visual Saliency via Joint CRF and Dictionary Learning
    Yang, Jimei
    Yang, Ming-Hsuan
    [J]. 2012 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2012, : 2296 - 2303
  • [2] Top-down Visual Saliency Guided by Captions
    Ramanishka, Vasili
    Das, Abir
    Zhang, Jianming
    Saenko, Kate
    [J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 3135 - 3144
  • [3] Top-down saliency detection driven by visual classification
    Murabito, Francesca
    Spampinato, Concetto
    Palazzo, Simone
    Giordano, Daniela
    Pogorelov, Konstantin
    Riegler, Michael
    [J]. COMPUTER VISION AND IMAGE UNDERSTANDING, 2018, 172 : 67 - 76
  • [4] Top-Down Saliency Detection via Contextual Pooling
    Zhu, Jun
    Qiu, Yuanyuan
    Zhang, Rui
    Huang, Jun
    Zhang, Wenjun
    [J]. JOURNAL OF SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2014, 74 (01): : 33 - 46
  • [5] Top-Down Saliency Detection via Contextual Pooling
    Jun Zhu
    Yuanyuan Qiu
    Rui Zhang
    Jun Huang
    Wenjun Zhang
    [J]. Journal of Signal Processing Systems, 2014, 74 : 33 - 46
  • [6] Visual saliency detection via integrating bottom-up and top-down information
    Shariatmadar, Zahra Sadat
    Faez, Karim
    [J]. OPTIK, 2019, 178 : 1195 - 1207
  • [7] Bottom-up saliency and top-down learning in the primary visual cortex of monkeys
    Yan, Yin
    Zhaoping, Li
    Li, Wu
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2018, 115 (41) : 10499 - 10504
  • [8] Top-down Saliency Detection via Hidden Semantic Information
    Yuan, Lingfeng
    Du, Yuliang
    Wan, Weibing
    [J]. ADVANCES IN MECHATRONICS AND CONTROL ENGINEERING III, 2014, 678 : 116 - +
  • [9] Exploring Duality in Visual Question-Driven Top-Down Saliency
    He, Shengfeng
    Han, Chu
    Han, Guoqiang
    Qin, Jing
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2020, 31 (07) : 2672 - 2679
  • [10] Top-down Gamma Saliency - Learning to Search for Objects in Complex Scenes
    Burt, Ryan
    Principe, Jose C.
    [J]. 2018 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2018,