A temporal attention based appearance model for video object segmentation

被引：2

作者：

Wang, Hui ^{[1
]}

Liu, Weibin ^{[1
]}

Xing, Weiwei ^{[2
]}

机构：

[1] Beijing Jiaotong Univ, Inst Informat Sci, Beijing 100044, Peoples R China

[2] Beijing Jiaotong Univ, Sch Software Engn, Beijing 100044, Peoples R China

来源：

APPLIED INTELLIGENCE | 2022年 / 52卷 / 02期

基金：

中国国家自然科学基金; 北京市自然科学基金;

关键词：

Video object segmentation; Convolutional neural networks; Appearance model; Mixture loss;

D O I：

10.1007/s10489-021-02547-4

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

More and more researchers have recently paid attention to video object segmentation because it is an important building block for numerous computer vision applications. Although many algorithms promote its development, there are still some open challenges. Efficient and robust pipelines are needed to address appearance changes and the distraction from similar background objects in the video object segmentation. This paper proposes a novel neural network that integrates a temporal attention based appearance model and a boundary-aware loss. The appearance model fuses the appearance information of the first frame, the previous frame, and the current frame in the feature space, which assists the proposed method to learn a discriminative and robust target representation and avoid the drift problem of traditional propagation schemes. Moreover, the boundary-aware loss is employed for network training. Equipped with the boundary-aware loss, the proposed method achieves more accurate segmentation results with clear boundaries. The proposed method is compared with several recent state-of-the-art algorithms on popular benchmark datasets. Comprehensive experiments show that the proposed method achieves favorable performance with a high frame rate.

引用

页码：2290 / 2300

页数：11

共 50 条

[21] Feedback Learning Gaussian Appearance Network for Video Object Segmentation
Wang L.
Song H.-H.
Zhang K.-H.
Liu Q.-S.
Zidonghua Xuebao/Acta Automatica Sinica, 2022, 48 (03): : 834 - 842
[22] Fast Appearance Modeling for Automatic Primary Video Object Segmentation
Yang, Jiong
Price, Brian
Shen, Xiaohui
Lin, Zhe
Yuan, Junsong
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2016, 25 (02) : 503 - 515
[23] Video object segmentation with a Potts model
Zhao, Jieyu
Wang, Xiaoquan
ICNC 2007: THIRD INTERNATIONAL CONFERENCE ON NATURAL COMPUTATION, VOL 2, PROCEEDINGS, 2007, : 742 - +
[24] Multilevel Model for Video Object Segmentation Based on Supervision Optimization
Chen, Yadang
Hao, Chuanyan
Liu, Alex X.
Wu, Enhua
IEEE TRANSACTIONS ON MULTIMEDIA, 2019, 21 (08) : 1934 - 1945
[25] Temporal Collection and Distribution for Referring Video Object Segmentation
Tang, Jiajin
Zheng, Ge
Yang, Sibei
2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 15420 - 15430
[26] Coherency Based Spatio-Temporal Saliency Detection for Video Object Segmentation
Mahapatra, Dwarikanath
Gilani, Syed Omer
Saini, Mukesh Kumar
IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2014, 8 (03) : 454 - 462
[27] A New Spatio-Temporal Saliency-Based Video Object Segmentation
Zhengzheng Tu
Andrew Abel
Lei Zhang
Bin Luo
Amir Hussain
Cognitive Computation, 2016, 8 : 629 - 647
[28] Temporal segmentation of video objects for hierarchical object-based motion description
Fu, Y
Ekin, A
Tekalp, AM
Mehrotra, R
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2002, 11 (02) : 135 - 145
[29] Automatic video object segmentation algorithm based on spatio-temporal information
Zhang, Xiao-Bo
Liu, Wen-Yao
Lu, Da-Wei
Guangdianzi Jiguang/Journal of Optoelectronics Laser, 2008, 19 (03): : 384 - 387
[30] A New Spatio-Temporal Saliency-Based Video Object Segmentation
Tu, Zhengzheng
Abel, Andrew
Zhang, Lei
Luo, Bin
Hussain, Amir
COGNITIVE COMPUTATION, 2016, 8 (04) : 629 - 647

← 1 2 3 4 5 →