Fast pixel-matching for video object segmentation

被引:8
|
作者
Yu, Siyue [1 ]
Xiao, Jimin [1 ]
Zhang, Bingfeng [1 ]
Lim, Eng Gee [1 ]
Zhao, Yao [2 ]
机构
[1] Xian Jiaotong Liverpool Univ, Suzhou, Jiangsu, Peoples R China
[2] Beijing Jiaotong Univ, Beijing, Peoples R China
基金
中国国家自然科学基金;
关键词
Non-local pixel matching; Mask-propagation; Encoder-decoder;
D O I
10.1016/j.image.2021.116373
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Video object segmentation, aiming to segment the foreground objects given the annotation of the first frame, has been attracting increasing attentions. Many state-of-the-art approaches have achieved great performance by relying on online model updating or mask-propagation techniques. However, most online models require high computational cost due to model fine-tuning during inference. Most mask-propagation based models are faster but with relatively low performance due to failure to adapt to object appearance variation. In this paper, we are aiming to design a new model to make a good balance between speed and performance. We propose a model, called NPMCA-net, which directly localizes foreground objects based on mask-propagation and non-local technique by matching pixels in reference and target frames. Since we bring in information of both first and previous frames, our network is robust to large object appearance variation, and can better adapt to occlusions. Extensive experiments show that our approach can achieve a new state-of-the-art performance with a fast speed at the same time (86.5% IoU on DAVIS-2016 and 72.2% IoU on DAVIS-2017, with speed of 0.11s per frame) under the same level comparison. Source code is available at https://github.com/siyueyu/NPMCA-net.
引用
收藏
页数:10
相关论文
共 50 条
  • [21] Guided Co-Segmentation Network for Fast Video Object Segmentation
    Liu, Weide
    Lin, Guosheng
    Zhang, Tianyi
    Liu, Zichuan
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2021, 31 (04) : 1607 - 1617
  • [22] Fast Video Object Segmentation Based on Siamese Networks
    Fu L.-H.
    Zhao Y.
    Sun X.-W.
    Lu Z.-S.
    Wang D.
    Yang H.-X.
    Tien Tzu Hsueh Pao/Acta Electronica Sinica, 2020, 48 (04): : 625 - 630
  • [23] FAST VIDEO OBJECT SEGMENTATION VIA DYNAMIC YOLACT
    Meng, Tianfang
    Zhang, Wenqiang
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 2400 - 2404
  • [24] Kernel based local matching network for video object segmentation
    Wang, Guoqiang
    Li, Lan
    Zhu, Min
    Zhao, Rui
    Zhang, Xiang
    MACHINE VISION AND APPLICATIONS, 2024, 35 (03)
  • [25] Video object segmentation through semantic visual words matching
    Hao, Chuanyan
    Chen, Yadang
    Wu, Weimin
    Yang, Zhi-Xin
    Wu, Enhua
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (13) : 19591 - 19605
  • [26] Video object segmentation through semantic visual words matching
    Chuanyan Hao
    Yadang Chen
    Weimin Wu
    Zhi-Xin Yang
    Enhua Wu
    Multimedia Tools and Applications, 2023, 82 : 19591 - 19605
  • [27] Spectral Context Matching for Video Object Segmentation Under Occlusion
    Shi, Xiaoxue
    Lu, Yao
    Zhou, Tianfei
    Lei, Xiaoyu
    ADVANCES IN MULTIMEDIA INFORMATION PROCESSING - PCM 2017, PT II, 2018, 10736 : 337 - 346
  • [28] Complementary Coarse-to-Fine Matching for Video Object Segmentation
    Chen, Zhen
    Yang, Ming
    Zhang, Shiliang
    ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2023, 19 (06)
  • [29] Kernel based local matching network for video object segmentation
    Guoqiang Wang
    Lan Li
    Min Zhu
    Rui Zhao
    Xiang Zhang
    Machine Vision and Applications, 2024, 35
  • [30] A New Approach to Video Steganography using Pixel Pattern Matching and Key Segmentation
    Wadekar, Himanshu
    Babu, Aishwarya
    Bharvadia, Vaishali
    Tatwadarshi, P. N.
    2017 INTERNATIONAL CONFERENCE ON INNOVATIONS IN INFORMATION, EMBEDDED AND COMMUNICATION SYSTEMS (ICIIECS), 2017,