Flow-Guided Transformer for Video Inpainting

被引:25
|
作者
Zhang, Kaidong [1 ]
Fu, Jingjing [2 ]
Liu, Dong [1 ]
机构
[1] Univ Sci & Technol China, Hefei, Peoples R China
[2] Microsoft Res Asia, Beijing, Peoples R China
来源
关键词
Video inpainting; Optical flow; Transformer; OBJECT REMOVAL; IMAGE;
D O I
10.1007/978-3-031-19797-0_5
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We propose a flow-guided transformer, which innovatively leverage the motion discrepancy exposed by optical flows to instruct the attention retrieval in transformer for high fidelity video inpainting. More specially, we design a novel flow completion network to complete the corrupted flows by exploiting the relevant flow features in a local temporal window. With the completed flows, we propagate the content across video frames, and adopt the flow-guided transformer to synthesize the rest corrupted regions. We decouple transformers along temporal and spatial dimension, so that we can easily integrate the locally relevant completed flows to instruct spatial attention only. Furthermore, we design a flow-reweight module to precisely control the impact of completed flows on each spatial transformer. For the sake of efficiency, we introduce window partition strategy to both spatial and temporal transformers. Especially in spatial transformer, we design a dual perspective spatial MHSA, which integrates the global tokens to the window-based attention. Extensive experiments demonstrate the effectiveness of the proposed method qualitatively and quantitatively. Codes are available at https://github.com/hitachinsk/FGT.
引用
收藏
页码:74 / 90
页数:17
相关论文
共 50 条
  • [1] FSTT: Flow-Guided Spatial Temporal Transformer for Deep Video Inpainting
    Liu, Ruixin
    Zhu, Yuesheng
    ELECTRONICS, 2023, 12 (21)
  • [2] Deep Flow-Guided Video Inpainting
    Xu, Rui
    Li, Xiaoxiao
    Zhou, Bolei
    Loy, Chen Change
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 3718 - 3727
  • [3] Local and nonlocal flow-guided video inpainting
    Jing Wang
    Zongju Yang
    Zhanqiang Huo
    Wei Chen
    Multimedia Tools and Applications, 2024, 83 : 10321 - 10340
  • [4] Local and nonlocal flow-guided video inpainting
    Wang, Jing
    Yang, Zongju
    Huo, Zhanqiang
    Chen, Wei
    MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (04) : 10321 - 10340
  • [5] Flow-Guided Video Inpainting with Scene Templates
    Lao, Dong
    Zhu, Peihao
    Wonka, Peter
    Sundaramoorthi, Ganesh
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 14579 - 14588
  • [6] Flow-Guided Transformer for Video Colorization
    Zhai, Yan
    Tao, Zhulin
    Dai, Longquan
    Wang, He
    Huang, Xianglin
    Yang, Lifang
    Proceedings - International Conference on Image Processing, ICIP, 2023, : 2485 - 2489
  • [7] FLOW-GUIDED TRANSFORMER FOR VIDEO COLORIZATION
    Zhai, Yan
    Tao, Zhulin
    Dai, Longquan
    Wang, He
    Huang, Xianglin
    Yang, Lifang
    2023 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2023, : 2485 - 2489
  • [8] FVIFormer: Flow-Guided Global-Local Aggregation Transformer Network for Video Inpainting
    Yan, Weiqing
    Sun, Yiqiu
    Yue, Guanghui
    Zhou, Wei
    Liu, Hantao
    IEEE JOURNAL ON EMERGING AND SELECTED TOPICS IN CIRCUITS AND SYSTEMS, 2024, 14 (02) : 235 - 244
  • [9] Error Compensation Framework for Flow-Guided Video Inpainting
    Kang, Jaeyeon
    Oh, Seoung Wug
    Kim, Seon Joo
    COMPUTER VISION - ECCV 2022, PT XV, 2022, 13675 : 375 - 390
  • [10] Flow-Guided Sparse Transformer for Video Deblurring
    Lin, Jing
    Cai, Yuanhao
    Hu, Xiaowan
    Wang, Haoqian
    Yan, Youliang
    Zou, Xueyi
    Ding, Henghui
    Zhang, Yulun
    Timofte, Radu
    Van Gool, Luc
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,