Motion-Aware Memory Network for Fast Video Salient Object Detection

被引:3
|
作者
Zhao, Xing [1 ]
Liang, Haoran [1 ]
Li, Peipei [2 ]
Sun, Guodao [1 ]
Zhao, Dongdong [1 ]
Liang, Ronghua [1 ]
He, Xiaofei [1 ]
机构
[1] Zhejiang Univ Technol, Coll Comp Sci & Technol, Hangzhou 310023, Peoples R China
[2] Zhejiang Univ Technol, Coll Mech Engn, Hangzhou 310023, Peoples R China
关键词
Video salient object detection; salient object detection; memory network; feature fusion; OPTIMIZATION;
D O I
10.1109/TIP.2023.3348659
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Previous methods based on 3DCNN, convLSTM, or optical flow have achieved great success in video salient object detection (VSOD). However, these methods still suffer from high computational costs or poor quality of the generated saliency maps. To address this, we design a space-time memory (STM)-based network that employs a standard encoder-decoder architecture. During the encoding stage, we extract high-level temporal features from the current frame and its adjacent frames, which is more efficient and practical than methods reliant on optical flow. During the decoding stage, we introduce an effective fusion strategy for both spatial and temporal branches. The semantic information of the high-level features is used to improve the object details in the low-level features. Subsequently, spatiotemporal features are methodically derived step by step to reconstruct the saliency maps. Moreover, inspired by the boundary supervision prevalent in image salient object detection (ISOD), we design a motion-aware loss that predicts object boundary motion, and simultaneously perform multitask learning for VSOD and object motion prediction. This can further enhance the model's capability to accurately extract spatiotemporal features while maintaining object integrity. Extensive experiments on several datasets demonstrate the effectiveness of our method and can achieve state-of-the-art metrics on some datasets. Our proposed model does not require optical flow or additional preprocessing, and can reach an impressive inference speed of nearly 100 FPS.
引用
收藏
页码:709 / 721
页数:13
相关论文
共 50 条
  • [31] Complementarity-Aware Attention Network for Salient Object Detection
    Li, Junxia
    Pan, Zefeng
    Liu, Qingshan
    Cui, Ying
    Sun, Yubao
    IEEE TRANSACTIONS ON CYBERNETICS, 2022, 52 (02) : 873 - 886
  • [32] Boosting Feature-Aware Network for Salient Object Detection
    Zheng, Jianwei
    Gu, Yubin
    Feng, Yuchao
    Xu, Jinshan
    Zhang, Meiyu
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2022, PT IV, 2022, 13532 : 14 - 26
  • [33] A motion-aware ConvLSTM network for action recognition
    Mahshid Majd
    Reza Safabakhsh
    Applied Intelligence, 2019, 49 : 2515 - 2521
  • [34] MAU: A Motion-Aware Unit for Video Prediction and Beyond
    Chang, Zheng
    Zhang, Xinfeng
    Wang, Shanshe
    Siwei
    Ye, Yan
    Xiang, Xinguang
    Gao, Wen
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [35] A motion-aware ConvLSTM network for action recognition
    Majd, Mahshid
    Safabakhsh, Reza
    APPLIED INTELLIGENCE, 2019, 49 (07) : 2515 - 2521
  • [36] MoVideo: Motion-Aware Video Generation with Diffusion Model
    Liang, Jingyun
    Fang, Yuchen
    Zhang, Kai
    Timofte, Radu
    Van Gool, Luc
    Ranjan, Rakesh
    COMPUTER VISION-ECCV 2024, PT XLIV, 2025, 15102 : 56 - 74
  • [37] FASA: Fast, Accurate, and Size-Aware Salient Object Detection
    Yildirim, Goekhan
    Suesstrunk, Sabine
    COMPUTER VISION - ACCV 2014, PT III, 2015, 9005 : 514 - 528
  • [38] Motion-aware future frame prediction for video anomaly detection based on saliency perception
    Xu, Haitao
    Liu, Weibin
    Xing, Weiwei
    Wei, Xiang
    SIGNAL IMAGE AND VIDEO PROCESSING, 2022, 16 (08) : 2121 - 2129
  • [39] Motion-aware vehicle detection in driving videos
    Kilicarslan, Mehmet
    Temel, Tansu
    TURKISH JOURNAL OF ELECTRICAL ENGINEERING AND COMPUTER SCIENCES, 2022, 30 (01) : 63 - 78
  • [40] Multi-Stream Attention-Aware Graph Convolution Network for Video Salient Object Detection
    Xu, Mingzhu
    Fu, Ping
    Liu, Bing
    Li, Junbao
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 : 4183 - 4197