IENet: inheritance enhancement network for video salient object detection

被引:0
|
作者
Jiang, Tao [1 ]
Wang, Yi [2 ]
Hou, Feng [1 ]
Wang, Ruili [1 ]
机构
[1] Massey Univ, Sch Math & Computat Sci, Auckland 0632, New Zealand
[2] Dalian Univ Technol DUT, RU Int Sch Informat Sci & Engn, Dalian 116000, Peoples R China
基金
中国国家自然科学基金;
关键词
Video salient object detection; Feature fusion; Visual transformer; Frame-aware temporal relationships; OPTIMIZATION; CUES;
D O I
10.1007/s11042-024-18408-4
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Effective utilization of spatiotemporal information is essential for improving the accuracy and robustness of Video Salient Object Detection (V-SOD). However, current methods have not fully utilized historical frame information, ultimately resulting in insufficient integration of complementary semantic information. To address this issue, we propose a novel Inheritance Enhancement Network (IENet) based on Transformer. The core of IENet is a Heritable Multi-Frame Attention (HMA) module, which fully exploits long-term context and frame-aware temporal modeling in feature extraction through unidirectional cross-frame enhancement. In contrast to existing methods, our heritable strategy is based on the unidirectional inheritance model using attention maps which ensure the information propagation for each frame is consistent and orderly, avoiding additional interference. Furthermore, we propose an auxiliary attention loss by using inherited attention maps to direct the network to focus more on target regions. The experimental results of our IENet reveal its effectiveness in handling challenging scenes on five popular benchmark datasets. For instance, in the cases of VOS and DAVSOD, our method achieves 0.042% and 0.070% for MAE compared to other competitive models. Particularly, IENet excels in inheriting finer details from historical frames even in complex environments. The module and predicted maps are publicly available at https://github.com/TOMMYWHY/IENet
引用
收藏
页码:72007 / 72026
页数:20
相关论文
共 50 条
  • [1] GUIDANCE AND TEACHING NETWORK FOR VIDEO SALIENT OBJECT DETECTION
    Jiao, Yingxia
    Wang, Xiao
    Chou, Yu-Cheng
    Yang, Shouyuan
    Ji, Ge-Peng
    Zhu, Rong
    Gao, Ge
    2021 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2021, : 2199 - 2203
  • [2] Cross Complementary Fusion Network for Video Salient Object Detection
    Wang, Ziyang
    Li, Junxia
    Pan, Zefeng
    IEEE ACCESS, 2020, 8 : 201259 - 201270
  • [3] Flow driven attention network for video salient object detection
    Zhou, Feng
    Shuai, Hui
    Liu, Qingshan
    Guo, Guodong
    IET IMAGE PROCESSING, 2020, 14 (06) : 997 - 1004
  • [4] PSNet: Parallel Symmetric Network for Video Salient Object Detection
    Cong, Runmin
    Song, Weiyu
    Lei, Jianjun
    Yue, Guanghui
    Zhao, Yao
    Kwong, Sam
    IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE, 2023, 7 (02): : 402 - 414
  • [5] MRBENet: A Multiresolution Boundary Enhancement Network for Salient Object Detection
    Jia, Xing-Zhao
    DongYe, Chang-Lei
    Peng, Yan-Jun
    Zhao, Wen-Xiu
    Liu, Tian-De
    COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2022, 2022
  • [6] Salient Object Detection in Video Streams
    Tapu, Ruxandra
    Mocanu, Bogdan
    Tapu, Ermina
    2012 10TH INTERNATIONAL SYMPOSIUM ON ELECTRONICS AND TELECOMMUNICATIONS, 2012, : 275 - 278
  • [7] Ranking Video Salient Object Detection
    Wang, Zheng
    Yan, Xinyu
    Han, Yahong
    Sun, Meijun
    PROCEEDINGS OF THE 27TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA (MM'19), 2019, : 873 - 881
  • [8] Video Salient Object Detection Network with Bidirectional Memory and Spatiotemporal Constraints
    Wang, Hongyu
    Mu, Nan
    Zhang, Yu
    2021 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2021, : 2781 - 2786
  • [9] Spatiotemporal context-aware network for video salient object detection
    Tianyou Chen
    Jin Xiao
    Xiaoguang Hu
    Guofeng Zhang
    Shaojie Wang
    Neural Computing and Applications, 2022, 34 : 16861 - 16877
  • [10] A novel FCNs-ConvLSTM network for video salient object detection
    Huang, Hai
    Liu, Chang
    Tian, Lei
    Mu, Junsheng
    Jing, Xiaojun
    INTERNATIONAL JOURNAL OF CIRCUIT THEORY AND APPLICATIONS, 2021, 49 (04) : 1050 - 1060