FVIFormer: Flow-Guided Global-Local Aggregation Transformer Network for Video Inpainting

被引:2
|
作者
Yan, Weiqing [1 ]
Sun, Yiqiu [1 ]
Yue, Guanghui [2 ]
Zhou, Wei [3 ]
Liu, Hantao [3 ]
机构
[1] Yantai Univ, Sch Comp & Control Engn, Yantai 261400, Peoples R China
[2] Shenzhen Univ, Med Sch, Sch Biomed Engn, Shenzhen 518060, Peoples R China
[3] Cardiff Univ, Sch Comp Sci & Informat, Cardiff CF24 4AG, Wales
基金
中国国家自然科学基金;
关键词
Machine learning--deep learning; OBJECT REMOVAL; IMAGE;
D O I
10.1109/JETCAS.2024.3392972
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Video inpainting has been extensively used in recent years. Established works usually utilise the similarity between the missing region and its surrounding features to inpaint in the visually damaged content in a multi-stage manner. However, due to the complexity of the video content, it may result in the destruction of structural information of objects within the video. In addition to this, the presence of moving objects in the damaged regions of the video can further increase the difficulty of this work. To address these issues, we propose a flow-guided global-Local aggregation Transformer network for video inpainting. First, we use a pre-trained optical flow complementation network to repair the defective optical flow of video frames. Then, we propose a content inpainting module, which use the complete optical flow as a guide, and propagate the global content across the video frames using efficient temporal and spacial Transformer to inpaint in the corrupted regions of the video. Finally, we propose a structural rectification module to enhance the coherence of content around the missing regions via combining the extracted local and global features. In addition, considering the efficiency of the overall framework, we also optimized the self-attention mechanism to improve the speed of training and testing via depth-wise separable encoding. We validate the effectiveness of our method on the YouTube-VOS and DAVIS video datasets. Extensive experiment results demonstrate the effectiveness of our approach in edge-complementing video content that has undergone stabilisation algorithms.
引用
收藏
页码:235 / 244
页数:10
相关论文
共 50 条
  • [1] Flow-Guided Transformer for Video Inpainting
    Zhang, Kaidong
    Fu, Jingjing
    Liu, Dong
    COMPUTER VISION - ECCV 2022, PT XVIII, 2022, 13678 : 74 - 90
  • [2] Local and nonlocal flow-guided video inpainting
    Jing Wang
    Zongju Yang
    Zhanqiang Huo
    Wei Chen
    Multimedia Tools and Applications, 2024, 83 : 10321 - 10340
  • [3] Local and nonlocal flow-guided video inpainting
    Wang, Jing
    Yang, Zongju
    Huo, Zhanqiang
    Chen, Wei
    MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (04) : 10321 - 10340
  • [4] FSTT: Flow-Guided Spatial Temporal Transformer for Deep Video Inpainting
    Liu, Ruixin
    Zhu, Yuesheng
    ELECTRONICS, 2023, 12 (21)
  • [5] Deep Flow-Guided Video Inpainting
    Xu, Rui
    Li, Xiaoxiao
    Zhou, Bolei
    Loy, Chen Change
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 3718 - 3727
  • [6] Motion-Guided Global-Local Aggregation Transformer Network for Precipitation Nowcasting
    Dong, Xichao
    Zhao, Zewei
    Wang, Yupei
    Wang, Jianping
    Hu, Cheng
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
  • [7] Flow-Guided Video Inpainting with Scene Templates
    Lao, Dong
    Zhu, Peihao
    Wonka, Peter
    Sundaramoorthi, Ganesh
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 14579 - 14588
  • [8] Flow-Guided Transformer for Video Colorization
    Zhai, Yan
    Tao, Zhulin
    Dai, Longquan
    Wang, He
    Huang, Xianglin
    Yang, Lifang
    Proceedings - International Conference on Image Processing, ICIP, 2023, : 2485 - 2489
  • [9] FLOW-GUIDED TRANSFORMER FOR VIDEO COLORIZATION
    Zhai, Yan
    Tao, Zhulin
    Dai, Longquan
    Wang, He
    Huang, Xianglin
    Yang, Lifang
    2023 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2023, : 2485 - 2489
  • [10] Error Compensation Framework for Flow-Guided Video Inpainting
    Kang, Jaeyeon
    Oh, Seoung Wug
    Kim, Seon Joo
    COMPUTER VISION - ECCV 2022, PT XV, 2022, 13675 : 375 - 390