Temporal-spatial information mining and aggregation for video matting

被引:0
|
作者
Zhiwei Ma
Guilin Yao
机构
[1] Harbin University of Commerce,
来源
Multimedia Tools and Applications | 2024年 / 83卷
关键词
Double decoder; Spatial continuity; Temporal coherence; Video matting;
D O I
暂无
中图分类号
学科分类号
摘要
In previous video matting methods, there are some problems that require additional auxiliary information and lack of temporal consistency. To solve these problems, we propose a novel video matting framework (STMI-Net) based on temporal-spatial information mining and aggregation. This framework doesn’t require any auxiliary information and adopts a double decoder network structure, specifically, one decoder is composed of the recurrent network, which can make full use of the temporal information in the video frames to ensure the temporal coherence in results; and the other decoder is composed of the convolution network, which deeply restores the frame-by-frame spatial features to achieve the spatial continuity in results. By aggregating these two parts of the information at the global level, our model achieves 0.0066 MSE on the VideoMatte240K dataset, which surpasses the RVM baseline by 13%; and achieves 0.0047 MSE on PPM-100 portrait matting dataset, which surpasses the MG baseline by 26.5%. We also implement an ablation study to demonstrate the specific functions of the temporal decoder and the spatial decoder in our model.
引用
收藏
页码:29221 / 29237
页数:16
相关论文
共 50 条
  • [21] Improving Temporal-Spatial Features Extraction of Forest Flame Video
    Zhao, Yaqin
    Xu, Mingming
    NATIONAL ACADEMY SCIENCE LETTERS-INDIA, 2015, 38 (03): : 203 - 206
  • [22] AN ADAPTIVE TEMPORAL-SPATIAL FILTER FOR MPEG DECODED VIDEO SIGNALS
    LIU, TS
    CHANG, LW
    MULTIDIMENSIONAL SYSTEMS AND SIGNAL PROCESSING, 1995, 6 (03) : 251 - 262
  • [23] Hybrid temporal-spatial error concealment technique for video communications
    Kuo, TY
    Li, SF
    2004 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXP (ICME), VOLS 1-3, 2004, : 1743 - 1746
  • [24] Temporal-spatial memory: retrieval of spatial information does not reduce recency
    Farrand, P
    Parmentier, FBR
    Jones, DM
    ACTA PSYCHOLOGICA, 2001, 106 (03) : 285 - 301
  • [25] TSFANet: Temporal-Spatial Feature Aggregation Network for GNSS Jamming Recognition
    Zhong, Wanfu
    Xiong, Hailiang
    Hua, Yuan
    Shah, Danyal Hussain
    Liao, Zhiwei
    Xu, Yudan
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2024, 73 : 1 - 13
  • [26] RECURSIVE TEMPORAL-SPATIAL INFORMATION FUSION WITH APPLICATIONS TO TARGET IDENTIFICATION
    HONG, L
    LYNCH, A
    IEEE TRANSACTIONS ON AEROSPACE AND ELECTRONIC SYSTEMS, 1993, 29 (02) : 435 - 445
  • [27] Image segmentation using temporal-spatial information in dynamic scenes
    Huang, WQ
    Wang, YM
    Zhao, Y
    2003 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-5, PROCEEDINGS, 2003, : 3140 - 3145
  • [28] Information-statistical approach for temporal-spatial data with application
    Sy, BK
    Gupta, AK
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2002, 15 (02) : 177 - 191
  • [29] Temporal-Spatial Filtering for Enhancement of Low-Light Surveillance Video
    Guo, Fan
    Tang, Jin
    Peng, Hui
    Zou, Beiji
    JOURNAL OF ADVANCED COMPUTATIONAL INTELLIGENCE AND INTELLIGENT INFORMATICS, 2016, 20 (04) : 652 - 661
  • [30] Adaptive error concealment for temporal-spatial multiple description video coding
    Yang, Meilin
    Gadgil, Neeraj
    Comer, Mary L.
    Delp, Edward J.
    SIGNAL PROCESSING-IMAGE COMMUNICATION, 2016, 47 : 313 - 331