A novel spatiotemporal attention enhanced discriminative network for video salient object detection

被引:0
|
作者
Bing Liu
Kezhou Mu
Mingzhu Xu
Fangyuan Wang
Lei Feng
机构
[1] Harbin Institute of Technology,School of Electronics and Information Engineering
来源
Applied Intelligence | 2022年 / 52卷
关键词
Video salient object detection; Attention mechanism; Multiscale; CSAtt-ConvLSTM;
D O I
暂无
中图分类号
学科分类号
摘要
In contrast to image salient object detection, on which many achievements have been made, video salient object detection remains a considerable challenge. Not all features are useful in salient object detection, and some even cause interferences. In this paper, we propose a novel multiscale spatiotemporal ConvLSTM model based on an attention mechanism, which introduces space-based and channel-based attention mechanisms and improves the network’s capability to extract high-level semantic information and low-level spatial structural features. First, to obtain more effective spatiotemporal information, a ConvLSTM module embedded with an attention mechanism (CSAtt-ConvLSTM) is designed at higher layers of the network to weight salient features of the extracted spatiotemporal consistency. Second, a multiscale attention (MSA) module for distinguishing features is designed, which introduces two attention mechanisms: channel-wise attention (CA) units and spatial-wise attention (SA) units. The CA and SA units are used after high-level feature mapping obtained by the CSAtt-ConvLSTM module and shallow feature mapping, respectively, and then their outputs are fused as final output feature maps. A large number of experiments on multiple datasets verified the effectiveness of our proposed model, which reached a real-time speed on a single GPU of 20 fps.
引用
收藏
页码:5922 / 5937
页数:15
相关论文
共 50 条
  • [1] A novel spatiotemporal attention enhanced discriminative network for video salient object detection
    Liu, Bing
    Mu, Kezhou
    Xu, Mingzhu
    Wang, Fangyuan
    Feng, Lei
    APPLIED INTELLIGENCE, 2022, 52 (06) : 5922 - 5937
  • [2] Video salient object detection via spatiotemporal attention neural networks
    Tang, Yi
    Zou, Wenbin
    Hua, Yang
    Jin, Zhi
    Li, Xia
    NEUROCOMPUTING, 2020, 377 (377) : 27 - 37
  • [3] Flow driven attention network for video salient object detection
    Zhou, Feng
    Shuai, Hui
    Liu, Qingshan
    Guo, Guodong
    IET IMAGE PROCESSING, 2020, 14 (06) : 997 - 1004
  • [4] Video salient object detection using dual-stream spatiotemporal attention
    Xu, Chenchu
    Gao, Zhifan
    Zhang, Heye
    Li, Shuo
    de Albuquerque, Victor Hugo C.
    APPLIED SOFT COMPUTING, 2021, 108
  • [5] Video Salient Object Detection Network with Bidirectional Memory and Spatiotemporal Constraints
    Wang, Hongyu
    Mu, Nan
    Zhang, Yu
    2021 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2021, : 2781 - 2786
  • [6] Spatiotemporal context-aware network for video salient object detection
    Tianyou Chen
    Jin Xiao
    Xiaoguang Hu
    Guofeng Zhang
    Shaojie Wang
    Neural Computing and Applications, 2022, 34 : 16861 - 16877
  • [7] Spatiotemporal context-aware network for video salient object detection
    Chen, Tianyou
    Xiao, Jin
    Hu, Xiaoguang
    Zhang, Guofeng
    Wang, Shaojie
    NEURAL COMPUTING & APPLICATIONS, 2022, 34 (19): : 16861 - 16877
  • [8] DS-Net: Dynamic spatiotemporal network for video salient object detection
    Liu, Jing
    Wang, Jiaxiang
    Wang, Weikang
    Su, Yuting
    DIGITAL SIGNAL PROCESSING, 2022, 130
  • [9] Salient Object Detection based on Spatiotemporal Attention Models
    Tapu, Ruxandra
    Zaharia, Titus
    2013 IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS (ICCE), 2013, : 39 - 42
  • [10] Salient Object Detection With Spatiotemporal Background Priors for Video
    Xi, Tao
    Zhao, Wei
    Wang, Han
    Lin, Weisi
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2017, 26 (07) : 3425 - 3436