Attention-guided Temporally Coherent Video Object Matting

被引:13
|
作者
Zhang, Yunke [1 ]
Wang, Chi [1 ]
Cui, Miaomiao [2 ]
Ren, Peiran [2 ]
Xie, Xuansong [2 ]
Hua, Xian-Sheng [3 ]
Bao, Hujun [1 ]
Huang, Qixing [4 ]
Xu, Weiwei [1 ]
机构
[1] Zhejiang Univ, Hangzhou, Peoples R China
[2] Alibaba Grp, Hangzhou, Peoples R China
[3] Damo Acad, Alibaba Grp, Hangzhou, Peoples R China
[4] Univ Texas Austin, Austin, TX 78712 USA
基金
国家重点研发计划;
关键词
datasets; neural networks; video matting; attention mechanism; INTERACTIVE IMAGE; SEGMENTATION;
D O I
10.1145/3474085.3475623
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper proposes a novel deep learning-based video object matting method that can achieve temporally coherent matting results. Its key component is an attention-based temporal aggregation module that maximizes image matting networks ' strength for video matting networks. This module computes temporal correlations for pixels adjacent to each other along the time axis in feature space, which is robust against motion noises. We also design a novel loss term to train the attention weights, which drastically boosts the video matting performance. Besides, we show how to effectively solve the trimap generation problem by fine-tuning a state-of-the-art video object segmentation network with a sparse set of user-annotated keyframes. To facilitate video matting and trimap generation networks ' training, we construct a large-scale video matting dataset with 80 training and 28 validation foreground video clips with ground-truth alpha mattes. Experimental results show that our method can generate high-quality alpha mattes for various videos featuring appearance change, occlusion, and fast motion. Our code and dataset can be found at: https://github.com/yunkezhang/TCVOM
引用
收藏
页码:5128 / 5137
页数:10
相关论文
共 50 条
  • [41] Temporally Coherent Completion of Dynamic Video
    Huang, Jia-Bin
    Kang, Sing Bing
    Ahuja, Narendra
    Kopf, Johannes
    ACM TRANSACTIONS ON GRAPHICS, 2016, 35 (06):
  • [42] Automatic Temporally Coherent Video Colorization
    Thasarathan, Harrish
    Nazeri, Kamyar
    Ebrahimi, Mehran
    2019 16TH CONFERENCE ON COMPUTER AND ROBOT VISION (CRV 2019), 2019, : 189 - 194
  • [43] Dense Attention-Guided Cascaded Network for Salient Object Detection of Strip Steel Surface Defects
    Zhou, Xiaofei
    Fang, Hao
    Liu, Zhi
    Zheng, Bolun
    Sun, Yaoqi
    Zhang, Jiyong
    Yan, Chenggang
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2022, 71
  • [44] ADMNet: Attention-Guided Densely Multi-Scale Network for Lightweight Salient Object Detection
    Zhou, Xiaofei
    Shen, Kunye
    Liu, Zhi
    IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 10828 - 10841
  • [45] Attention-guided Unified Network for Panoptic Segmentation
    Li, Yanwei
    Chen, Xinze
    Zhu, Zheng
    Xie, Lingxi
    Huang, Guan
    Du, Dalong
    Wang, Xingang
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 7019 - 7028
  • [46] A model of attention-guided visual perception and recognition
    Rybak, IA
    Gusakova, VI
    Golovan, AV
    Podladchikova, LN
    Shevtsova, NA
    VISION RESEARCH, 1998, 38 (15-16) : 2387 - 2400
  • [47] Attention-guided generator with dual discriminator GAN for real-time video anomaly detection
    Singh, Rituraj
    Sethi, Anikeit
    Saini, Krishanu
    Saurav, Sumeet
    Tiwari, Aruna
    Singh, Sanjay
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 131
  • [48] DIFA-GAN: Differential Attention-Guided Generative Adversarial Network for Unsupervised Video Forecasting
    Jin, Beibei
    IEEE SIGNAL PROCESSING LETTERS, 2024, 31 : 1795 - 1799
  • [49] Multiscale Attention-Guided Panoptic Segmentation Network
    Fu, Du
    Qu, Shaojun
    Fu, Ya
    Computer Engineering and Applications, 2023, 59 (22) : 223 - 232
  • [50] Attention-Guided Multispectral and Panchromatic Image Classification
    Shi, Cheng
    Dang, Yenan
    Fang, Li
    Lv, Zhiyong
    Shen, Huifang
    REMOTE SENSING, 2021, 13 (23)