IENet: inheritance enhancement network for video salient object detection

被引:0
|
作者
Jiang, Tao [1 ]
Wang, Yi [2 ]
Hou, Feng [1 ]
Wang, Ruili [1 ]
机构
[1] Massey Univ, Sch Math & Computat Sci, Auckland 0632, New Zealand
[2] Dalian Univ Technol DUT, RU Int Sch Informat Sci & Engn, Dalian 116000, Peoples R China
基金
中国国家自然科学基金;
关键词
Video salient object detection; Feature fusion; Visual transformer; Frame-aware temporal relationships; OPTIMIZATION; CUES;
D O I
10.1007/s11042-024-18408-4
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Effective utilization of spatiotemporal information is essential for improving the accuracy and robustness of Video Salient Object Detection (V-SOD). However, current methods have not fully utilized historical frame information, ultimately resulting in insufficient integration of complementary semantic information. To address this issue, we propose a novel Inheritance Enhancement Network (IENet) based on Transformer. The core of IENet is a Heritable Multi-Frame Attention (HMA) module, which fully exploits long-term context and frame-aware temporal modeling in feature extraction through unidirectional cross-frame enhancement. In contrast to existing methods, our heritable strategy is based on the unidirectional inheritance model using attention maps which ensure the information propagation for each frame is consistent and orderly, avoiding additional interference. Furthermore, we propose an auxiliary attention loss by using inherited attention maps to direct the network to focus more on target regions. The experimental results of our IENet reveal its effectiveness in handling challenging scenes on five popular benchmark datasets. For instance, in the cases of VOS and DAVSOD, our method achieves 0.042% and 0.070% for MAE compared to other competitive models. Particularly, IENet excels in inheriting finer details from historical frames even in complex environments. The module and predicted maps are publicly available at https://github.com/TOMMYWHY/IENet
引用
收藏
页码:72007 / 72026
页数:20
相关论文
共 50 条
  • [21] A semi-supervised recurrent neural network for video salient object detection
    Kompella, Aditya
    Kulkarni, Raghavendra, V
    NEURAL COMPUTING & APPLICATIONS, 2021, 33 (06): : 2065 - 2083
  • [22] A novel spatiotemporal attention enhanced discriminative network for video salient object detection
    Bing Liu
    Kezhou Mu
    Mingzhu Xu
    Fangyuan Wang
    Lei Feng
    Applied Intelligence, 2022, 52 : 5922 - 5937
  • [23] Motion-Aware Memory Network for Fast Video Salient Object Detection
    Zhao, Xing
    Liang, Haoran
    Li, Peipei
    Sun, Guodao
    Zhao, Dongdong
    Liang, Ronghua
    He, Xiaofei
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2024, 33 : 709 - 721
  • [24] DS-Net: Dynamic spatiotemporal network for video salient object detection
    Liu, Jing
    Wang, Jiaxiang
    Wang, Weikang
    Su, Yuting
    DIGITAL SIGNAL PROCESSING, 2022, 130
  • [25] Attention Embedded Spatio-Temporal Network for Video Salient Object Detection
    Huang, Lili
    Yan, Pengxiang
    Li, Guanbin
    Wang, Qing
    Lin, Liang
    IEEE ACCESS, 2019, 7 : 166203 - 166213
  • [26] A semi-supervised recurrent neural network for video salient object detection
    Aditya Kompella
    Raghavendra V. Kulkarni
    Neural Computing and Applications, 2021, 33 : 2065 - 2083
  • [27] CASNet: A Cross-Attention Siamese Network for Video Salient Object Detection
    Ji, Yuzhu
    Zhang, Haijun
    Jie, Zequn
    Ma, Lin
    Wu, Q. M. Jonathan
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2021, 32 (06) : 2676 - 2690
  • [28] A novel spatiotemporal attention enhanced discriminative network for video salient object detection
    Liu, Bing
    Mu, Kezhou
    Xu, Mingzhu
    Wang, Fangyuan
    Feng, Lei
    APPLIED INTELLIGENCE, 2022, 52 (06) : 5922 - 5937
  • [29] Weakly Supervised Video Salient Object Detection
    Zhao, Wangbo
    Zhang, Jing
    Li, Long
    Barnes, Nick
    Liu, Nian
    Han, Junwei
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 16821 - 16830
  • [30] Salient Object Detection Approach in UAV Video
    Zhang, Yueqiang
    Su, Ang
    Zhu, Xianwei
    Zhang, Xiaohu
    Shang, Yang
    MIPPR 2013: AUTOMATIC TARGET RECOGNITION AND NAVIGATION, 2013, 8918