A novel spatiotemporal attention enhanced discriminative network for video salient object detection

被引:0
|
作者
Bing Liu
Kezhou Mu
Mingzhu Xu
Fangyuan Wang
Lei Feng
机构
[1] Harbin Institute of Technology,School of Electronics and Information Engineering
来源
Applied Intelligence | 2022年 / 52卷
关键词
Video salient object detection; Attention mechanism; Multiscale; CSAtt-ConvLSTM;
D O I
暂无
中图分类号
学科分类号
摘要
In contrast to image salient object detection, on which many achievements have been made, video salient object detection remains a considerable challenge. Not all features are useful in salient object detection, and some even cause interferences. In this paper, we propose a novel multiscale spatiotemporal ConvLSTM model based on an attention mechanism, which introduces space-based and channel-based attention mechanisms and improves the network’s capability to extract high-level semantic information and low-level spatial structural features. First, to obtain more effective spatiotemporal information, a ConvLSTM module embedded with an attention mechanism (CSAtt-ConvLSTM) is designed at higher layers of the network to weight salient features of the extracted spatiotemporal consistency. Second, a multiscale attention (MSA) module for distinguishing features is designed, which introduces two attention mechanisms: channel-wise attention (CA) units and spatial-wise attention (SA) units. The CA and SA units are used after high-level feature mapping obtained by the CSAtt-ConvLSTM module and shallow feature mapping, respectively, and then their outputs are fused as final output feature maps. A large number of experiments on multiple datasets verified the effectiveness of our proposed model, which reached a real-time speed on a single GPU of 20 fps.
引用
收藏
页码:5922 / 5937
页数:15
相关论文
共 50 条
  • [41] Discriminative Focus of Attention for Real-Time Object Detection in Video
    Saptharishi, Mahesh
    Lipchin, Aleksey
    Lisin, Dimitri
    2012 IEEE WORKSHOP ON SIGNAL PROCESSING SYSTEMS (SIPS), 2012, : 85 - 90
  • [42] IENet: inheritance enhancement network for video salient object detection
    Jiang, Tao
    Wang, Yi
    Hou, Feng
    Wang, Ruili
    MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (28) : 72007 - 72026
  • [43] Multi-Stream Attention-Aware Graph Convolution Network for Video Salient Object Detection
    Xu, Mingzhu
    Fu, Ping
    Liu, Bing
    Li, Junbao
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 : 4183 - 4197
  • [44] Fie-net: spatiotemporal full-stage interaction enhancement network for video salient object detection
    Wang, Jun
    Sun, Chenhao
    Wang, Haoyu
    Ren, Xing
    Huang, Ziqing
    Li, Xiaoli
    SIGNAL IMAGE AND VIDEO PROCESSING, 2024, 18 (8-9) : 6321 - 6337
  • [45] Salient Object Detection with Pyramid Attention and Salient Edges
    Wang, Wenguan
    Zhao, Shuyang
    Shen, Jianbing
    Hoi, Steven C. H.
    Borji, Ali
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 1448 - 1457
  • [46] Reverse Attention for Salient Object Detection
    Chen, Shuhan
    Tan, Xiuli
    Wang, Ben
    Hu, Xuelong
    COMPUTER VISION - ECCV 2018, PT IX, 2018, 11213 : 236 - 252
  • [47] Group attention retention network for co-salient object detection
    Liu, Jing
    Wang, Jiaxiang
    Fan, Zhiwei
    Yuan, Min
    Wang, Weikang
    Yu, Jiexiao
    MACHINE VISION AND APPLICATIONS, 2023, 34 (06)
  • [48] ETANet: An Efficient Triple-Attention Network for Salient Object Detection
    Ngo, Thien-Thu
    Huh, Eui-Nam
    Hong, Choong Seon
    2023 INTERNATIONAL CONFERENCE ON INFORMATION NETWORKING, ICOIN, 2023, : 271 - 276
  • [49] CGAN: closure-guided attention network for salient object detection
    Das, Dibyendu Kumar
    Shit, Sahadeb
    Ray, Dip Narayan
    Majumder, Somajyoti
    VISUAL COMPUTER, 2022, 38 (11): : 3803 - 3817
  • [50] Reverse Attention-Based Residual Network for Salient Object Detection
    Chen, Shuhan
    Tan, Xiuli
    Wang, Ben
    Lu, Huchuan
    Hu, Xuelong
    Fu, Yun
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 : 3763 - 3776