Real Time Video Object Segmentation in Compressed Domain

被引:27
|
作者
Tan, Zhentao [1 ,2 ]
Liu, Bin [1 ,2 ]
Chu, Qi [1 ,2 ]
Zhong, Hangshi [1 ,2 ]
Wu, Yue [3 ]
Li, Weihai [1 ,2 ]
Yu, Nenghai [1 ,2 ]
机构
[1] Univ Sci & Technol China, Sch Informat Sci & Technol, Hefei 230026, Peoples R China
[2] Chinese Acad Sci, Key Lab Electromagnet Space Informat, Hefei 230026, Peoples R China
[3] Alibaba Grp, Hangzhou 311121, Peoples R China
基金
中国国家自然科学基金;
关键词
Compressed domain; object segmentation; feature propagation; feature matching;
D O I
10.1109/TCSVT.2020.2971641
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Many of the recent methods for semi-supervised video object segmentation are still far from being applicable for real time applications due to their slow inference speed. Therefore, we explore a propagation based segmentation method in compressed domain to accelerate inference speed in this paper. In particular, we only extract the features of I-frames by traditional deep convolutional neural network and produce the features of P-frames through information flow propagation. In the process of feature propagation, we propose two effective components to enhance the representation ability of simply warped features in terms of appearance and location. Specifically, we propose a residual supplement module to supplement appearance information which is lost in direct warping and a spatial attention module that can mine extra spatial saliency to provide the location information of the specified object. Besides, we propose a metric based decoder module which consists of a feature match module and a multi-level refinement module to transform information from semantic representation to shape segmentation mask. Extensive experiments on several video datasets demonstrate that the proposed method can achieve comparable accuracy while much faster inference speed when compared to the state-of-the-art algorithms.
引用
收藏
页码:175 / 188
页数:14
相关论文
共 50 条
  • [31] Adaptive Template and Transition Map for Real-Time Video Object Segmentation
    Park, Hyojin
    Yoo, Jayeon
    Venkatesh, Ganesh
    Kwak, Nojun
    [J]. IEEE Access, 2021, 9 : 116914 - 116926
  • [32] Fast Real-Time Video Object Segmentation with a Tangled Memory Network
    Mei, Jianbiao
    Wang, Mengmeng
    Yang, Yu
    Li, Yanjun
    Liu, Yong
    [J]. ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2023, 14 (03)
  • [33] Moving Object Segmentation in the H.264 Compressed Domain
    Niu, Changfeng
    Liu, Yushu
    [J]. COMPUTER VISION - ACCV 2009, PT II, 2010, 5995 : 645 - 654
  • [34] Compressed domain moving object segmentation in local motion scene
    [J]. 2005, Science Press, Beijing, China (27):
  • [35] Moving object segmentation in the H.264 compressed domain
    Liu, Zhi
    Zhang, Zhaoyang
    Shen, Liquan
    [J]. OPTICAL ENGINEERING, 2007, 46 (01)
  • [36] Compressed-domain video segmentation using wavelet transformation
    Yu, HH
    [J]. APPLICATIONS OF DIGITAL IMAGE PROCESSING XXII, 1999, 3808 : 140 - 147
  • [37] A new approach to object based video compositing in the compressed domain
    Abdel-Maguid, M
    Moniri, M
    [J]. PROCEEDINGS OF THE FIFTH IASTED INTERNATIONAL CONFERENCE ON VISUALIZATION, IMAGING, AND IMAGE PROCESSING, 2005, : 274 - 278
  • [38] Video object summarization in the MPEG-4 compressed domain
    Erol, B
    Kossentini, F
    [J]. 2000 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS, VOLS I-VI, 2000, : 2027 - 2030
  • [39] Motion-based video object tracking in the compressed domain
    Ritch, Mark
    Canagarajah, Nishan
    [J]. 2007 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOLS 1-7, 2007, : 3097 - 3100
  • [40] Real-Time Compressed-Domain Video Watermarking Resistance to Geometric Distortions
    Wang, Liyun
    Ling, Hefei
    Zou, Fuhao
    Lu, Zhengding
    [J]. IEEE MULTIMEDIA, 2012, 19 (01) : 70 - 79