Temporal-spatial information mining and aggregation for video matting

被引：0

作者：

Zhiwei Ma

Guilin Yao

机构：

[1] Harbin University of Commerce,

来源：

Multimedia Tools and Applications | 2024年 / 83卷

关键词：

Double decoder; Spatial continuity; Temporal coherence; Video matting;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

In previous video matting methods, there are some problems that require additional auxiliary information and lack of temporal consistency. To solve these problems, we propose a novel video matting framework (STMI-Net) based on temporal-spatial information mining and aggregation. This framework doesn’t require any auxiliary information and adopts a double decoder network structure, specifically, one decoder is composed of the recurrent network, which can make full use of the temporal information in the video frames to ensure the temporal coherence in results; and the other decoder is composed of the convolution network, which deeply restores the frame-by-frame spatial features to achieve the spatial continuity in results. By aggregating these two parts of the information at the global level, our model achieves 0.0066 MSE on the VideoMatte240K dataset, which surpasses the RVM baseline by 13%; and achieves 0.0047 MSE on PPM-100 portrait matting dataset, which surpasses the MG baseline by 26.5%. We also implement an ablation study to demonstrate the specific functions of the temporal decoder and the spatial decoder in our model.

引用

页码：29221 / 29237

页数：16

共 50 条

[41] VIDEO PREDICTION WITH TEMPORAL-SPATIAL ATTENTION MECHANISM AND DEEP PERCEPTUAL SIMILARITY BRANCH
Wu, Qian
Wang, Wenmin
Chen, Xiongtao
Li, Weimian
2019 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2019, : 1594 - 1599
[42] Temporal-Spatial Feature Extraction of DSA Video and Its Application in AVM Diagnosis
Shi, Keke
Xiao, Weiping
Wu, Guoqing
Xiao, Yang
Lei, Yu
Yu, Jinhua
Gu, Yuxiang
FRONTIERS IN NEUROLOGY, 2021, 12
[43] A Stereo Object Segmentation Algorithm Based on Disparity and Temporal-Spatial Information
Chen, Jing
Cai, Canhui
Li, Cuihua
IEEE INTERNATIONAL SYMPOSIUM ON INTELLIGENT SIGNAL PROCESSING AND COMMUNICATIONS SYSTEMS (ISPACS 2012), 2012,
[44] PTSEFormer: Progressive Temporal-Spatial Enhanced TransFormer Towards Video Object Detection
Wang, Han
Tang, Jun
Liu, Xiaodong
Guan, Shanyan
Xie, Rong
Song, Li
COMPUTER VISION, ECCV 2022, PT VIII, 2022, 13668 : 732 - 747
[45] Temporal-Spatial Dichotomous Noises
LI Jing-Hui HAN Yin-Xia Center for Nonlinear Studies
Communications in Theoretical Physics, 2004, 42 (07) : 55 - 58
[46] Temporal-spatial dichotomous noises
Li, JH
Han, YX
COMMUNICATIONS IN THEORETICAL PHYSICS, 2004, 42 (01) : 55 - 58
[47] Consistent Panoramic Video Style Transfer via Temporal-Spatial Cross Perception
Wang, Weiyu
Qing, Chunmei
Tan, Junpeng
Xu, Xiangmin
ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, PT VI, ICIC 2024, 2024, 14867 : 265 - 277
[48] HEVC-BASED MOTION COMPENSATED JOINT TEMPORAL-SPATIAL VIDEO DENOISING
Tang, Minhao
Han, Yuxing
Wen, Jiangtao
Yang, Shiqiang
2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 1797 - 1801
[49] Video Enhancement Using Temporal-Spatial Total Variation Retinex and Luminance Adaptation
Wang, Liqian
Shao, Wenze
Ge, Qi
Li, Haibo
Xiao, Liang
Wei, Zhihui
PROCEEDINGS OF 2017 IEEE INTERNATIONAL CONFERENCE ON PROGRESS IN INFORMATICS AND COMPUTING (PIC 2017), 2017, : 108 - 112
[50] Temporal-Spatial Symmetric Distributed Multi-View Video Coding Scheme
Zhang, Guoyun
Xiang, Canqun
Ou, Xianfeng
Yue, Hong
Guo, Longyuan
Wu, Jianhui
Tu, Bing
He, Wei
INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2017, 31 (08)

← 1 2 3 4 5 →