Temporal-spatial information mining and aggregation for video matting

被引：0

作者：

Zhiwei Ma

Guilin Yao

机构：

[1] Harbin University of Commerce,

来源：

Multimedia Tools and Applications | 2024年 / 83卷

关键词：

Double decoder; Spatial continuity; Temporal coherence; Video matting;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

In previous video matting methods, there are some problems that require additional auxiliary information and lack of temporal consistency. To solve these problems, we propose a novel video matting framework (STMI-Net) based on temporal-spatial information mining and aggregation. This framework doesn’t require any auxiliary information and adopts a double decoder network structure, specifically, one decoder is composed of the recurrent network, which can make full use of the temporal information in the video frames to ensure the temporal coherence in results; and the other decoder is composed of the convolution network, which deeply restores the frame-by-frame spatial features to achieve the spatial continuity in results. By aggregating these two parts of the information at the global level, our model achieves 0.0066 MSE on the VideoMatte240K dataset, which surpasses the RVM baseline by 13%; and achieves 0.0047 MSE on PPM-100 portrait matting dataset, which surpasses the MG baseline by 26.5%. We also implement an ablation study to demonstrate the specific functions of the temporal decoder and the spatial decoder in our model.

引用

页码：29221 / 29237

页数：16

共 50 条

[21] Improving Temporal-Spatial Features Extraction of Forest Flame Video
Zhao, Yaqin
Xu, Mingming
NATIONAL ACADEMY SCIENCE LETTERS-INDIA, 2015, 38 (03): : 203 - 206
[22] AN ADAPTIVE TEMPORAL-SPATIAL FILTER FOR MPEG DECODED VIDEO SIGNALS
LIU, TS
CHANG, LW
MULTIDIMENSIONAL SYSTEMS AND SIGNAL PROCESSING, 1995, 6 (03) : 251 - 262
[23] Hybrid temporal-spatial error concealment technique for video communications
Kuo, TY
Li, SF
2004 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXP (ICME), VOLS 1-3, 2004, : 1743 - 1746
[24] Temporal-spatial memory: retrieval of spatial information does not reduce recency
Farrand, P
Parmentier, FBR
Jones, DM
ACTA PSYCHOLOGICA, 2001, 106 (03) : 285 - 301
[25] TSFANet: Temporal-Spatial Feature Aggregation Network for GNSS Jamming Recognition
Zhong, Wanfu
Xiong, Hailiang
Hua, Yuan
Shah, Danyal Hussain
Liao, Zhiwei
Xu, Yudan
IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2024, 73 : 1 - 13
[26] RECURSIVE TEMPORAL-SPATIAL INFORMATION FUSION WITH APPLICATIONS TO TARGET IDENTIFICATION
HONG, L
LYNCH, A
IEEE TRANSACTIONS ON AEROSPACE AND ELECTRONIC SYSTEMS, 1993, 29 (02) : 435 - 445
[27] Image segmentation using temporal-spatial information in dynamic scenes
Huang, WQ
Wang, YM
Zhao, Y
2003 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-5, PROCEEDINGS, 2003, : 3140 - 3145
[28] Information-statistical approach for temporal-spatial data with application
Sy, BK
Gupta, AK
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2002, 15 (02) : 177 - 191
[29] Temporal-Spatial Filtering for Enhancement of Low-Light Surveillance Video
Guo, Fan
Tang, Jin
Peng, Hui
Zou, Beiji
JOURNAL OF ADVANCED COMPUTATIONAL INTELLIGENCE AND INTELLIGENT INFORMATICS, 2016, 20 (04) : 652 - 661
[30] Adaptive error concealment for temporal-spatial multiple description video coding
Yang, Meilin
Gadgil, Neeraj
Comer, Mary L.
Delp, Edward J.
SIGNAL PROCESSING-IMAGE COMMUNICATION, 2016, 47 : 313 - 331

← 1 2 3 4 5 →