Top-Down Visual Saliency via Joint CRF and Dictionary Learning

被引:98
|
作者
Yang, Jimei [1 ]
Yang, Ming-Hsuan [2 ]
机构
[1] Adobe Res, San Jose, CA 95110 USA
[2] Univ Calif Merced, Sch Engn, Merced, CA USA
基金
美国国家科学基金会;
关键词
Visual saliency; top-down visual saliency; fixation prediction; dictionary learning and conditional random fields; FEATURES; ATTENTION;
D O I
10.1109/TPAMI.2016.2547384
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Top-down visual saliency is an important module of visual attention. In this work, we propose a novel top-down saliency model that jointly learns a Conditional Random Field (CRF) and a visual dictionary. The proposed model incorporates a layered structure from top to bottom: CRF, sparse coding and image patches. With sparse coding as an intermediate layer, CRF is learned in a feature-adaptive manner; meanwhile with CRF as the output layer, the dictionary is learned under structured supervision. For efficient and effective joint learning, we develop a max-margin approach via a stochastic gradient descent algorithm. Experimental results on the Graz-02 and PASCAL VOC datasets show that our model performs favorably against state-of-the-art top-down saliency methods for target object localization. In addition, the dictionary update significantly improves the performance of our model. We demonstrate the merits of the proposed top-down saliency model by applying it to prioritizing object proposals for detection and predicting human fixations.
引用
收藏
页码:576 / 588
页数:13
相关论文
共 50 条
  • [1] Top-Down Visual Saliency via Joint CRF and Dictionary Learning
    Yang, Jimei
    Yang, Ming-Hsuan
    2012 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2012, : 2296 - 2303
  • [2] Top-down Visual Saliency Guided by Captions
    Ramanishka, Vasili
    Das, Abir
    Zhang, Jianming
    Saenko, Kate
    30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 3135 - 3144
  • [3] Top-down saliency detection driven by visual classification
    Murabito, Francesca
    Spampinato, Concetto
    Palazzo, Simone
    Giordano, Daniela
    Pogorelov, Konstantin
    Riegler, Michael
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2018, 172 : 67 - 76
  • [4] Top-Down Saliency Detection via Contextual Pooling
    Jun Zhu
    Yuanyuan Qiu
    Rui Zhang
    Jun Huang
    Wenjun Zhang
    Journal of Signal Processing Systems, 2014, 74 : 33 - 46
  • [5] Top-Down Saliency Detection via Contextual Pooling
    Zhu, Jun
    Qiu, Yuanyuan
    Zhang, Rui
    Huang, Jun
    Zhang, Wenjun
    JOURNAL OF SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2014, 74 (01): : 33 - 46
  • [6] Visual saliency detection via integrating bottom-up and top-down information
    Shariatmadar, Zahra Sadat
    Faez, Karim
    OPTIK, 2019, 178 : 1195 - 1207
  • [7] Bottom-up saliency and top-down learning in the primary visual cortex of monkeys
    Yan, Yin
    Zhaoping, Li
    Li, Wu
    PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2018, 115 (41) : 10499 - 10504
  • [8] Top-down Saliency Detection via Hidden Semantic Information
    Yuan, Lingfeng
    Du, Yuliang
    Wan, Weibing
    ADVANCES IN MECHATRONICS AND CONTROL ENGINEERING III, 2014, 678 : 116 - +
  • [9] Exploring Duality in Visual Question-Driven Top-Down Saliency
    He, Shengfeng
    Han, Chu
    Han, Guoqiang
    Qin, Jing
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2020, 31 (07) : 2672 - 2679
  • [10] Top-down Gamma Saliency - Learning to Search for Objects in Complex Scenes
    Burt, Ryan
    Principe, Jose C.
    2018 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2018,