A temporal attention based appearance model for video object segmentation

被引:2
|
作者
Wang, Hui [1 ]
Liu, Weibin [1 ]
Xing, Weiwei [2 ]
机构
[1] Beijing Jiaotong Univ, Inst Informat Sci, Beijing 100044, Peoples R China
[2] Beijing Jiaotong Univ, Sch Software Engn, Beijing 100044, Peoples R China
基金
中国国家自然科学基金; 北京市自然科学基金;
关键词
Video object segmentation; Convolutional neural networks; Appearance model; Mixture loss;
D O I
10.1007/s10489-021-02547-4
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
More and more researchers have recently paid attention to video object segmentation because it is an important building block for numerous computer vision applications. Although many algorithms promote its development, there are still some open challenges. Efficient and robust pipelines are needed to address appearance changes and the distraction from similar background objects in the video object segmentation. This paper proposes a novel neural network that integrates a temporal attention based appearance model and a boundary-aware loss. The appearance model fuses the appearance information of the first frame, the previous frame, and the current frame in the feature space, which assists the proposed method to learn a discriminative and robust target representation and avoid the drift problem of traditional propagation schemes. Moreover, the boundary-aware loss is employed for network training. Equipped with the boundary-aware loss, the proposed method achieves more accurate segmentation results with clear boundaries. The proposed method is compared with several recent state-of-the-art algorithms on popular benchmark datasets. Comprehensive experiments show that the proposed method achieves favorable performance with a high frame rate.
引用
收藏
页码:2290 / 2300
页数:11
相关论文
共 50 条
  • [21] Feedback Learning Gaussian Appearance Network for Video Object Segmentation
    Wang L.
    Song H.-H.
    Zhang K.-H.
    Liu Q.-S.
    Zidonghua Xuebao/Acta Automatica Sinica, 2022, 48 (03): : 834 - 842
  • [22] Fast Appearance Modeling for Automatic Primary Video Object Segmentation
    Yang, Jiong
    Price, Brian
    Shen, Xiaohui
    Lin, Zhe
    Yuan, Junsong
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2016, 25 (02) : 503 - 515
  • [23] Video object segmentation with a Potts model
    Zhao, Jieyu
    Wang, Xiaoquan
    ICNC 2007: THIRD INTERNATIONAL CONFERENCE ON NATURAL COMPUTATION, VOL 2, PROCEEDINGS, 2007, : 742 - +
  • [24] Multilevel Model for Video Object Segmentation Based on Supervision Optimization
    Chen, Yadang
    Hao, Chuanyan
    Liu, Alex X.
    Wu, Enhua
    IEEE TRANSACTIONS ON MULTIMEDIA, 2019, 21 (08) : 1934 - 1945
  • [25] Temporal Collection and Distribution for Referring Video Object Segmentation
    Tang, Jiajin
    Zheng, Ge
    Yang, Sibei
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 15420 - 15430
  • [26] Coherency Based Spatio-Temporal Saliency Detection for Video Object Segmentation
    Mahapatra, Dwarikanath
    Gilani, Syed Omer
    Saini, Mukesh Kumar
    IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2014, 8 (03) : 454 - 462
  • [27] A New Spatio-Temporal Saliency-Based Video Object Segmentation
    Zhengzheng Tu
    Andrew Abel
    Lei Zhang
    Bin Luo
    Amir Hussain
    Cognitive Computation, 2016, 8 : 629 - 647
  • [28] Temporal segmentation of video objects for hierarchical object-based motion description
    Fu, Y
    Ekin, A
    Tekalp, AM
    Mehrotra, R
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2002, 11 (02) : 135 - 145
  • [29] Automatic video object segmentation algorithm based on spatio-temporal information
    Zhang, Xiao-Bo
    Liu, Wen-Yao
    Lu, Da-Wei
    Guangdianzi Jiguang/Journal of Optoelectronics Laser, 2008, 19 (03): : 384 - 387
  • [30] A New Spatio-Temporal Saliency-Based Video Object Segmentation
    Tu, Zhengzheng
    Abel, Andrew
    Zhang, Lei
    Luo, Bin
    Hussain, Amir
    COGNITIVE COMPUTATION, 2016, 8 (04) : 629 - 647