Top-Down Visual Saliency via Joint CRF and Dictionary Learning

被引：98

作者：

Yang, Jimei ^{[1
]}

Yang, Ming-Hsuan ^{[2
]}

机构：

[1] Adobe Res, San Jose, CA 95110 USA

[2] Univ Calif Merced, Sch Engn, Merced, CA USA

来源：

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE | 2017年 / 39卷 / 03期

基金：

美国国家科学基金会;

关键词：

Visual saliency; top-down visual saliency; fixation prediction; dictionary learning and conditional random fields; FEATURES; ATTENTION;

D O I：

10.1109/TPAMI.2016.2547384

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Top-down visual saliency is an important module of visual attention. In this work, we propose a novel top-down saliency model that jointly learns a Conditional Random Field (CRF) and a visual dictionary. The proposed model incorporates a layered structure from top to bottom: CRF, sparse coding and image patches. With sparse coding as an intermediate layer, CRF is learned in a feature-adaptive manner; meanwhile with CRF as the output layer, the dictionary is learned under structured supervision. For efficient and effective joint learning, we develop a max-margin approach via a stochastic gradient descent algorithm. Experimental results on the Graz-02 and PASCAL VOC datasets show that our model performs favorably against state-of-the-art top-down saliency methods for target object localization. In addition, the dictionary update significantly improves the performance of our model. We demonstrate the merits of the proposed top-down saliency model by applying it to prioritizing object proposals for detection and predicting human fixations.

引用

页码：576 / 588

页数：13

共 50 条

[1] Top-Down Visual Saliency via Joint CRF and Dictionary Learning
Yang, Jimei
Yang, Ming-Hsuan
2012 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2012, : 2296 - 2303
[2] Top-down Visual Saliency Guided by Captions
Ramanishka, Vasili
Das, Abir
Zhang, Jianming
Saenko, Kate
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 3135 - 3144
[3] Top-down saliency detection driven by visual classification
Murabito, Francesca
Spampinato, Concetto
Palazzo, Simone
Giordano, Daniela
Pogorelov, Konstantin
Riegler, Michael
COMPUTER VISION AND IMAGE UNDERSTANDING, 2018, 172 : 67 - 76
[4] Top-Down Saliency Detection via Contextual Pooling
Jun Zhu
Yuanyuan Qiu
Rui Zhang
Jun Huang
Wenjun Zhang
Journal of Signal Processing Systems, 2014, 74 : 33 - 46
[5] Top-Down Saliency Detection via Contextual Pooling
Zhu, Jun
Qiu, Yuanyuan
Zhang, Rui
Huang, Jun
Zhang, Wenjun
JOURNAL OF SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2014, 74 (01): : 33 - 46
[6] Visual saliency detection via integrating bottom-up and top-down information
Shariatmadar, Zahra Sadat
Faez, Karim
OPTIK, 2019, 178 : 1195 - 1207
[7] Bottom-up saliency and top-down learning in the primary visual cortex of monkeys
Yan, Yin
Zhaoping, Li
Li, Wu
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2018, 115 (41) : 10499 - 10504
[8] Top-down Saliency Detection via Hidden Semantic Information
Yuan, Lingfeng
Du, Yuliang
Wan, Weibing
ADVANCES IN MECHATRONICS AND CONTROL ENGINEERING III, 2014, 678 : 116 - +
[9] Exploring Duality in Visual Question-Driven Top-Down Saliency
He, Shengfeng
Han, Chu
Han, Guoqiang
Qin, Jing
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2020, 31 (07) : 2672 - 2679
[10] Top-down Gamma Saliency - Learning to Search for Objects in Complex Scenes
Burt, Ryan
Principe, Jose C.
2018 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2018,

← 1 2 3 4 5 →