Real Time Video Object Segmentation in Compressed Domain

被引:27
|
作者
Tan, Zhentao [1 ,2 ]
Liu, Bin [1 ,2 ]
Chu, Qi [1 ,2 ]
Zhong, Hangshi [1 ,2 ]
Wu, Yue [3 ]
Li, Weihai [1 ,2 ]
Yu, Nenghai [1 ,2 ]
机构
[1] Univ Sci & Technol China, Sch Informat Sci & Technol, Hefei 230026, Peoples R China
[2] Chinese Acad Sci, Key Lab Electromagnet Space Informat, Hefei 230026, Peoples R China
[3] Alibaba Grp, Hangzhou 311121, Peoples R China
基金
中国国家自然科学基金;
关键词
Compressed domain; object segmentation; feature propagation; feature matching;
D O I
10.1109/TCSVT.2020.2971641
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Many of the recent methods for semi-supervised video object segmentation are still far from being applicable for real time applications due to their slow inference speed. Therefore, we explore a propagation based segmentation method in compressed domain to accelerate inference speed in this paper. In particular, we only extract the features of I-frames by traditional deep convolutional neural network and produce the features of P-frames through information flow propagation. In the process of feature propagation, we propose two effective components to enhance the representation ability of simply warped features in terms of appearance and location. Specifically, we propose a residual supplement module to supplement appearance information which is lost in direct warping and a spatial attention module that can mine extra spatial saliency to provide the location information of the specified object. Besides, we propose a metric based decoder module which consists of a feature match module and a multi-level refinement module to transform information from semantic representation to shape segmentation mask. Extensive experiments on several video datasets demonstrate that the proposed method can achieve comparable accuracy while much faster inference speed when compared to the state-of-the-art algorithms.
引用
收藏
页码:175 / 188
页数:14
相关论文
共 50 条
  • [41] Nonlinear Sampling Control Model for Real-time Video Sequences of Compressed Domain
    Cheng, Deqiang
    Jin, Yu
    Zhou, Ting
    Li, Wenjie
    [J]. CCDC 2009: 21ST CHINESE CONTROL AND DECISION CONFERENCE, VOLS 1-6, PROCEEDINGS, 2009, : 4436 - +
  • [42] A transform domain approach to real-time foreground segmentation in video sequences
    Zhu, JH
    Schwartz, SC
    Liu, B
    [J]. 2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING, 2005, : 685 - 688
  • [43] Multi-Video-Object Segmentation based on SOFM Network for Compressed Video Sequences
    Fu Wenxiu
    Wang Lei
    Wang Xu
    [J]. ICNC 2008: FOURTH INTERNATIONAL CONFERENCE ON NATURAL COMPUTATION, VOL 3, PROCEEDINGS, 2008, : 255 - 259
  • [44] Real-time steganography in compressed video
    Liu, Bin
    Liu, Fenlin
    Lu, Bin
    Luo, Xiangyang
    [J]. MULTIMEDIA CONTENT REPRESENTATION, CLASSIFICATION AND SECURITY, 2006, 4105 : 43 - 48
  • [45] Multi-Attention Network for Compressed Video Referring Object Segmentation
    Chen, Weidong
    Hong, Dexiang
    Qi, Yuankai
    Han, Zhenjun
    Wang, Shuhui
    Qing, Laiyun
    Huang, Qingming
    Li, Guorong
    [J]. PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022, : 4416 - 4425
  • [46] CNN Implementation of a Moving Object Segmentation Approach for Real-Time Video Surveillance
    Rodriguez-Fernandez, D.
    Vilarino, D. L.
    Pardo, X. M.
    [J]. 2008 11TH INTERNATIONAL WORKSHOP ON CELLULAR NEURAL NETWORKS AND THEIR APPLICATIONS, 2008, : 129 - 134
  • [47] Real-time and light-weighted unsupervised video object segmentation network
    Zhao, Zongji
    Zhao, Sanyuan
    Shen, Jianbing
    [J]. PATTERN RECOGNITION, 2021, 120
  • [48] Object Detection in Real Time Video
    Aote, Shailendra
    Tiwari, Pankhudi
    Butolia, Manjiri
    Suradkar, Aaishwarya
    Gupta, Shrasti
    [J]. BIOSCIENCE BIOTECHNOLOGY RESEARCH COMMUNICATIONS, 2020, 13 (14): : 95 - 97
  • [49] Real-time video segmentation
    Dibos, F
    Pelletier, S
    Koep, G
    [J]. AVSS 2005: ADVANCED VIDEO AND SIGNAL BASED SURVEILLANCE, PROCEEDINGS, 2005, : 382 - 387
  • [50] Real-time object segmentation and coding for selective-quality video communications
    Challapali, K
    Brodsky, T
    Lin, YT
    Yan, Y
    Chen, RY
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2004, 14 (06) : 813 - 824