A novel spatiotemporal attention enhanced discriminative network for video salient object detection

被引:0
|
作者
Bing Liu
Kezhou Mu
Mingzhu Xu
Fangyuan Wang
Lei Feng
机构
[1] Harbin Institute of Technology,School of Electronics and Information Engineering
来源
Applied Intelligence | 2022年 / 52卷
关键词
Video salient object detection; Attention mechanism; Multiscale; CSAtt-ConvLSTM;
D O I
暂无
中图分类号
学科分类号
摘要
In contrast to image salient object detection, on which many achievements have been made, video salient object detection remains a considerable challenge. Not all features are useful in salient object detection, and some even cause interferences. In this paper, we propose a novel multiscale spatiotemporal ConvLSTM model based on an attention mechanism, which introduces space-based and channel-based attention mechanisms and improves the network’s capability to extract high-level semantic information and low-level spatial structural features. First, to obtain more effective spatiotemporal information, a ConvLSTM module embedded with an attention mechanism (CSAtt-ConvLSTM) is designed at higher layers of the network to weight salient features of the extracted spatiotemporal consistency. Second, a multiscale attention (MSA) module for distinguishing features is designed, which introduces two attention mechanisms: channel-wise attention (CA) units and spatial-wise attention (SA) units. The CA and SA units are used after high-level feature mapping obtained by the CSAtt-ConvLSTM module and shallow feature mapping, respectively, and then their outputs are fused as final output feature maps. A large number of experiments on multiple datasets verified the effectiveness of our proposed model, which reached a real-time speed on a single GPU of 20 fps.
引用
收藏
页码:5922 / 5937
页数:15
相关论文
共 50 条
  • [21] Multi-Stream Temporally Enhanced Network for Video Salient Object Detection
    Xu, Dan
    Ru, Jiale
    Shi, Jinlong
    CMC-COMPUTERS MATERIALS & CONTINUA, 2024, 78 (01): : 85 - 104
  • [22] Pyramid Constrained Self-Attention Network for Fast Video Salient Object Detection
    Gu, Yuchao
    Wang, Lijuan
    Wang, Ziqin
    Liu, Yun
    Cheng, Ming-Ming
    Lu, Shao-Ping
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 10869 - 10876
  • [23] GUIDANCE AND TEACHING NETWORK FOR VIDEO SALIENT OBJECT DETECTION
    Jiao, Yingxia
    Wang, Xiao
    Chou, Yu-Cheng
    Yang, Shouyuan
    Ji, Ge-Peng
    Zhu, Rong
    Gao, Ge
    2021 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2021, : 2199 - 2203
  • [24] STA-Net: spatial-temporal attention network for video salient object detection
    Bi, Hong-Bo
    Lu, Di
    Zhu, Hui-Hui
    Yang, Li-Na
    Guan, Hua-Ping
    APPLIED INTELLIGENCE, 2021, 51 (06) : 3450 - 3459
  • [25] STA-Net: spatial-temporal attention network for video salient object detection
    Hong-Bo Bi
    Di Lu
    Hui-Hui Zhu
    Li-Na Yang
    Hua-Ping Guan
    Applied Intelligence, 2021, 51 : 3450 - 3459
  • [26] DFNet: Discriminative feature extraction and integration network for salient object detection
    Noori, Mehrdad
    Mohammadi, Sina
    Majelan, Sina Ghofrani
    Bahri, Ali
    Havaei, Mohammad
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2020, 89
  • [27] Salient object detection based on backbone enhanced network
    Luo, Ronghua
    Huang, Huailin
    Wu, WeiZeng
    IMAGE AND VISION COMPUTING, 2020, 95
  • [28] Part-aware attention correctness for video salient object detection
    Liu, Ze-yu
    Liu, Jian-wei
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 119
  • [29] Multi-attention embedded network for salient object detection
    He, Wei
    Pan, Chen
    Xu, Wenlong
    Zhang, Ning
    SOFT COMPUTING, 2021, 25 (20) : 13053 - 13067
  • [30] Video Salient Object Detection via Contrastive Features and Attention Modules
    Chen, Yi-Wen
    Jin, Xiaojie
    Shen, Xiaohui
    Yang, Ming-Hsuan
    2022 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2022), 2022, : 536 - 545