Temporal-spatial information mining and aggregation for video matting

被引:0
|
作者
Zhiwei Ma
Guilin Yao
机构
[1] Harbin University of Commerce,
来源
关键词
Double decoder; Spatial continuity; Temporal coherence; Video matting;
D O I
暂无
中图分类号
学科分类号
摘要
In previous video matting methods, there are some problems that require additional auxiliary information and lack of temporal consistency. To solve these problems, we propose a novel video matting framework (STMI-Net) based on temporal-spatial information mining and aggregation. This framework doesn’t require any auxiliary information and adopts a double decoder network structure, specifically, one decoder is composed of the recurrent network, which can make full use of the temporal information in the video frames to ensure the temporal coherence in results; and the other decoder is composed of the convolution network, which deeply restores the frame-by-frame spatial features to achieve the spatial continuity in results. By aggregating these two parts of the information at the global level, our model achieves 0.0066 MSE on the VideoMatte240K dataset, which surpasses the RVM baseline by 13%; and achieves 0.0047 MSE on PPM-100 portrait matting dataset, which surpasses the MG baseline by 26.5%. We also implement an ablation study to demonstrate the specific functions of the temporal decoder and the spatial decoder in our model.
引用
收藏
页码:29221 / 29237
页数:16
相关论文
共 50 条
  • [41] VIDEO PREDICTION WITH TEMPORAL-SPATIAL ATTENTION MECHANISM AND DEEP PERCEPTUAL SIMILARITY BRANCH
    Wu, Qian
    Wang, Wenmin
    Chen, Xiongtao
    Li, Weimian
    2019 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2019, : 1594 - 1599
  • [42] Temporal-Spatial Feature Extraction of DSA Video and Its Application in AVM Diagnosis
    Shi, Keke
    Xiao, Weiping
    Wu, Guoqing
    Xiao, Yang
    Lei, Yu
    Yu, Jinhua
    Gu, Yuxiang
    FRONTIERS IN NEUROLOGY, 2021, 12
  • [43] A Stereo Object Segmentation Algorithm Based on Disparity and Temporal-Spatial Information
    Chen, Jing
    Cai, Canhui
    Li, Cuihua
    IEEE INTERNATIONAL SYMPOSIUM ON INTELLIGENT SIGNAL PROCESSING AND COMMUNICATIONS SYSTEMS (ISPACS 2012), 2012,
  • [44] PTSEFormer: Progressive Temporal-Spatial Enhanced TransFormer Towards Video Object Detection
    Wang, Han
    Tang, Jun
    Liu, Xiaodong
    Guan, Shanyan
    Xie, Rong
    Song, Li
    COMPUTER VISION, ECCV 2022, PT VIII, 2022, 13668 : 732 - 747
  • [45] Temporal-Spatial Dichotomous Noises
    LI Jing-Hui HAN Yin-Xia Center for Nonlinear Studies
    Communications in Theoretical Physics, 2004, 42 (07) : 55 - 58
  • [46] Temporal-spatial dichotomous noises
    Li, JH
    Han, YX
    COMMUNICATIONS IN THEORETICAL PHYSICS, 2004, 42 (01) : 55 - 58
  • [47] Consistent Panoramic Video Style Transfer via Temporal-Spatial Cross Perception
    Wang, Weiyu
    Qing, Chunmei
    Tan, Junpeng
    Xu, Xiangmin
    ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, PT VI, ICIC 2024, 2024, 14867 : 265 - 277
  • [48] HEVC-BASED MOTION COMPENSATED JOINT TEMPORAL-SPATIAL VIDEO DENOISING
    Tang, Minhao
    Han, Yuxing
    Wen, Jiangtao
    Yang, Shiqiang
    2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 1797 - 1801
  • [49] Video Enhancement Using Temporal-Spatial Total Variation Retinex and Luminance Adaptation
    Wang, Liqian
    Shao, Wenze
    Ge, Qi
    Li, Haibo
    Xiao, Liang
    Wei, Zhihui
    PROCEEDINGS OF 2017 IEEE INTERNATIONAL CONFERENCE ON PROGRESS IN INFORMATICS AND COMPUTING (PIC 2017), 2017, : 108 - 112
  • [50] Temporal-Spatial Symmetric Distributed Multi-View Video Coding Scheme
    Zhang, Guoyun
    Xiang, Canqun
    Ou, Xianfeng
    Yue, Hong
    Guo, Longyuan
    Wu, Jianhui
    Tu, Bing
    He, Wei
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2017, 31 (08)