Coherence-aware context aggregator for fast video object segmentation

被引：23

作者：

Lan, Meng ^{[1
]}

Zhang, Jing ^{[2
]}

Wang, Zengmao ^{[1
]}

机构：

[1] Wuhan Univ, Sch Comp Sci, Wuhan, Peoples R China

[2] Univ Sydney, Sch Comp Sci, Camperdown, Australia

来源：

PATTERN RECOGNITION | 2023年 / 136卷

基金：

中国国家自然科学基金;

关键词：

Video object segmentation; Semi-supervised learning; Spatio-temporal representation; Context;

D O I：

10.1016/j.patcog.2022.109214

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Semi-supervised video object segmentation (VOS) is a highly challenging problem that has attracted much research attention in recent years. Temporal context plays an important role in VOS by providing object clues from the past frames. However, most of the prevailing methods directly use the predicted temporal results to guide the segmentation of the current frame, while ignoring the coherence of tem-poral context, which may be misleading and degrade the performance. In this paper, we propose a novel model named Coherence-aware Context Aggregator (CCA) for VOS, which consists of three modules. First, a coherence-aware module (CAM) is proposed to evaluate the coherence of the predicted result of the current frame and then fuses the coherent features to update the temporal context. CAM can determine whether the prediction is accurate, thus guiding the update of the temporal context and avoiding the introduction of erroneous information. Second, we devise a spatio-temporal context aggregation (STCA) module to aggregate the temporal context with the spatial feature of the current frame to learn a robust and discriminative target representation in the decoder part. Third, we design a refinement module to refine the coarse feature generated from the STCA module for more precise segmentation. Additionally, CCA uses a cropping strategy and takes small-size images as input, thus making it computationally ef-ficient and achieving a real-time running speed. Extensive experiments on four challenging benchmarks show that CCA achieves a better trade-off between efficiency and accuracy compared to state-of-the-art methods. The code will be public. (c) 2022 Elsevier Ltd. All rights reserved.

引用

页数：12

共 50 条

[21] Design of coherence-aware channel indication and prediction for rate adaptation
Yongjiu Du
Pengda Huang
Yan Shi
Dinesh Rajan
Joseph Camp
EURASIP Journal on Wireless Communications and Networking, 2019
[22] Temporal Context Enhanced Referring Video Object Segmentation
Hu, Xiao
Hampiholi, Basavaraj
Neumann, Heiko
Lang, Jochen
2024 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION, WACV 2024, 2024, : 5562 - 5571
[23] Design of coherence-aware channel indication and prediction for rate adaptation
Du, Yongjiu
Huang, Pengda
Shi, Yan
Rajan, Dinesh
Camp, Joseph
EURASIP JOURNAL ON WIRELESS COMMUNICATIONS AND NETWORKING, 2019, 2019 (01)
[24] Online Meta Adaptation for Fast Video Object Segmentation
Xiao, Huaxin
Kang, Bingyi
Liu, Yu
Zhang, Maojun
Feng, Jiashi
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2020, 42 (05) : 1205 - 1217
[25] Region Aware Video Object Segmentation With Deep Motion Modeling
Miao, Bo
Bennamoun, Mohammed
Gao, Yongsheng
Mian, Ajmal
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2024, 33 : 2639 - 2651
[26] Guided Co-Segmentation Network for Fast Video Object Segmentation
Liu, Weide
Lin, Guosheng
Zhang, Tianyi
Liu, Zichuan
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2021, 31 (04) : 1607 - 1617
[27] Fast Video Object Segmentation Based on Siamese Networks
Fu L.-H.
Zhao Y.
Sun X.-W.
Lu Z.-S.
Wang D.
Yang H.-X.
Tien Tzu Hsueh Pao/Acta Electronica Sinica, 2020, 48 (04): : 625 - 630
[28] FAST VIDEO OBJECT SEGMENTATION VIA DYNAMIC YOLACT
Meng, Tianfang
Zhang, Wenqiang
2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 2400 - 2404
[29] Fast pixel-matching for video object segmentation
Yu, Siyue
Xiao, Jimin
Zhang, Bingfeng
Lim, Eng Gee
Zhao, Yao
SIGNAL PROCESSING-IMAGE COMMUNICATION, 2021, 98
[30] Quality-aware pattern diffusion for video object segmentation
Zhou, Chuanwei
Xu, Chunyan
Li, Jun
Cui, Zhen
Yang, Jian
NEUROCOMPUTING, 2023, 528 : 148 - 159

← 1 2 3 4 5 →