Probabilistic Multi-Task Learning for Visual Saliency Estimation in Video

被引:106
|
作者
Li, Jia [2 ,3 ]
Tian, Yonghong [1 ]
Huang, Tiejun [1 ]
Gao, Wen [1 ]
机构
[1] Peking Univ, Natl Engn Lab Video Technol, Beijing 100871, Peoples R China
[2] Chinese Acad Sci, Inst Comp Technol, Key Lab Intelligent Informat Proc, Beijing 100190, Peoples R China
[3] Chinese Acad Sci, Grad Univ, Beijing 100049, Peoples R China
关键词
Visual saliency; Probabilistic framework; Visual search tasks; Multi-task learning; ATTENTION; MODEL;
D O I
10.1007/s11263-010-0354-6
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we present a probabilistic multi-task learning approach for visual saliency estimation in video. In our approach, the problem of visual saliency estimation is modeled by simultaneously considering the stimulus-driven and task-related factors in a probabilistic framework. In this framework, a stimulus-driven component simulates the low-level processes in human vision system using multi-scale wavelet decomposition and unbiased feature competition; while a task-related component simulates the high-level processes to bias the competition of the input features. Different from existing approaches, we propose a multi-task learning algorithm to learn the task-related "stimulus-saliency" mapping functions for each scene. The algorithm also learns various fusion strategies, which are used to integrate the stimulus-driven and task-related components to obtain the visual saliency. Extensive experiments were carried out on two public eye-fixation datasets and one regional saliency dataset. Experimental results show that our approach outperforms eight state-of-the-art approaches remarkably.
引用
收藏
页码:150 / 165
页数:16
相关论文
共 50 条
  • [1] Probabilistic Multi-Task Learning for Visual Saliency Estimation in Video
    Jia Li
    Yonghong Tian
    Tiejun Huang
    Wen Gao
    [J]. International Journal of Computer Vision, 2010, 90 : 150 - 165
  • [2] Multi-Task Rank Learning for Visual Saliency Estimation
    Li, Jia
    Tian, Yonghong
    Huang, Tiejun
    Gao, Wen
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2011, 21 (05) : 623 - 636
  • [3] Cross-Domain Multi-task Learning for Object Detection and Saliency Estimation
    Khattar, Apoorv
    Hegde, Srinidhi
    Hebbalaguppe, Ramya
    [J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2021, 2021, : 3634 - 3643
  • [4] Saliency-Regularized Deep Multi-Task Learning
    Bai, Guangji
    Zhao, Liang
    [J]. PROCEEDINGS OF THE 28TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, KDD 2022, 2022, : 15 - 25
  • [5] Multi-Task Joint Learning of 3D Keypoint Saliency and Correspondence Estimation
    Wei, Guangshun
    Ma, Long
    Wang, Chen
    Desrosiers, Christian
    Zhou, Yuanfeng
    [J]. COMPUTER-AIDED DESIGN, 2021, 141
  • [6] Multi-task learning for video anomaly detection
    Chang, Xingya
    Zhang, Yuxin
    Xue, Dingyu
    Chen, Dongyue
    [J]. Journal of Visual Communication and Image Representation, 2022, 87
  • [7] Multi-task learning for video anomaly detection*
    Chang, Xingya
    Zhang, Yuxin
    Xue, Dingyu
    Chen, Dongyue
    [J]. JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2022, 87
  • [8] Generative Modeling for Multi-task Visual Learning
    Bao, Zhipeng
    Hebert, Martial
    Wang, Yu-Xiong
    [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
  • [9] Probabilistic Joint Feature Selection for Multi-task Learning
    Xiong, Tao
    Bi, Jinbo
    Rao, Bharat
    Cherkassky, Vladimir
    [J]. PROCEEDINGS OF THE SEVENTH SIAM INTERNATIONAL CONFERENCE ON DATA MINING, 2007, : 332 - +
  • [10] Multi-Task Probabilistic Regression With Overlap Maximization for Visual Tracking
    Feng, Zihang
    Yan, Liping
    Xia, Yuanqing
    Xiao, Bo
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 33 (12) : 7554 - 7564