Coherence-aware context aggregator for fast video object segmentation

被引:23
|
作者
Lan, Meng [1 ]
Zhang, Jing [2 ]
Wang, Zengmao [1 ]
机构
[1] Wuhan Univ, Sch Comp Sci, Wuhan, Peoples R China
[2] Univ Sydney, Sch Comp Sci, Camperdown, Australia
基金
中国国家自然科学基金;
关键词
Video object segmentation; Semi-supervised learning; Spatio-temporal representation; Context;
D O I
10.1016/j.patcog.2022.109214
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Semi-supervised video object segmentation (VOS) is a highly challenging problem that has attracted much research attention in recent years. Temporal context plays an important role in VOS by providing object clues from the past frames. However, most of the prevailing methods directly use the predicted temporal results to guide the segmentation of the current frame, while ignoring the coherence of tem-poral context, which may be misleading and degrade the performance. In this paper, we propose a novel model named Coherence-aware Context Aggregator (CCA) for VOS, which consists of three modules. First, a coherence-aware module (CAM) is proposed to evaluate the coherence of the predicted result of the current frame and then fuses the coherent features to update the temporal context. CAM can determine whether the prediction is accurate, thus guiding the update of the temporal context and avoiding the introduction of erroneous information. Second, we devise a spatio-temporal context aggregation (STCA) module to aggregate the temporal context with the spatial feature of the current frame to learn a robust and discriminative target representation in the decoder part. Third, we design a refinement module to refine the coarse feature generated from the STCA module for more precise segmentation. Additionally, CCA uses a cropping strategy and takes small-size images as input, thus making it computationally ef-ficient and achieving a real-time running speed. Extensive experiments on four challenging benchmarks show that CCA achieves a better trade-off between efficiency and accuracy compared to state-of-the-art methods. The code will be public. (c) 2022 Elsevier Ltd. All rights reserved.
引用
收藏
页数:12
相关论文
共 50 条
  • [41] Fast Video Object Segmentation Using Markov Random Field
    Mak, Chun-Man
    Cham, Wai-Kuen
    2008 IEEE 10TH WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING, VOLS 1 AND 2, 2008, : 347 - 352
  • [42] Fast Appearance Modeling for Automatic Primary Video Object Segmentation
    Yang, Jiong
    Price, Brian
    Shen, Xiaohui
    Lin, Zhe
    Yuan, Junsong
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2016, 25 (02) : 503 - 515
  • [43] Unsupervised video object segmentation with distractor-aware online adaptation
    Wang, Ye
    Choi, Jongmoo
    Chen, Yueru
    Li, Siyang
    Huang, Qin
    Zhang, Kaitai
    Lee, Ming-Sui
    Kuo, C-C Jay
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2021, 74
  • [44] Fast Video Object Segmentation via Dynamic Targeting Network
    Zhang, Lu
    Lin, Zhe
    Zhang, Jianming
    Lu, Huchuan
    He, You
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 5581 - 5590
  • [45] RANet: Ranking Attention Network for Fast Video Object Segmentation
    Wang, Ziqin
    Xu, Jun
    Liu, Li
    Zhu, Fan
    Shao, Ling
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 3977 - 3986
  • [46] Fast Interactive Video Object Segmentation with Graph Neural Networks
    Varga, Viktor
    Lorincz, Andras
    2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
  • [47] Fast texture segmentation for object-oriented video coding
    Lavagetto, Fabio
    Cocurullo, Fabio
    European transactions on telecommunications and related technologies, 1995, 6 (03): : 241 - 253
  • [48] Context-aware Method for Small Object Segmentation in Road Scenes
    Wang, Haitao
    Chen, Guang
    Li, Zhijun
    Peng, Jianyi
    Liu, Zhengfa
    Wu, Ya
    2022 INTERNATIONAL CONFERENCE ON ADVANCED ROBOTICS AND MECHATRONICS (ICARM 2022), 2022, : 238 - 243
  • [49] Service Ratio-Optimal, Content Coherence-Aware Data Push Systems
    Liaskos, Christos
    Tsioliaridou, Ageliki
    ACM TRANSACTIONS ON MANAGEMENT INFORMATION SYSTEMS, 2016, 6 (04)
  • [50] Spatiotemporal context-aware network for video salient object detection
    Tianyou Chen
    Jin Xiao
    Xiaoguang Hu
    Guofeng Zhang
    Shaojie Wang
    Neural Computing and Applications, 2022, 34 : 16861 - 16877