Feature pre-inpainting enhanced transformer for video inpainting

被引:6
|
作者
Li, Guanxiao [1 ]
Zhang, Ke [1 ]
Su, Yu [1 ]
Wang, Jingyu [1 ,2 ]
机构
[1] Northwestern Polytech Univ, Sch Astronaut, Xian 710072, Shaanxi, Peoples R China
[2] Northwestern Polytech Univ, Sch Artificial Intelligence OPt & Elect iOPEN, Xian 710072, Shaanxi, Peoples R China
基金
中国国家自然科学基金;
关键词
Video inpainting; Feature pre-inpainting; Local-global interleaving transformer;
D O I
10.1016/j.engappai.2023.106323
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Transformer-based video inpainting methods aggregate coherent contents into missing regions by learning dependencies spatial-temporally. However, existing methods suffer from the inaccurate self-attention calcu-lation and excessive quadratic computational complexity, due to uninformative representations of missing regions and inefficient global self-attention mechanisms, respectively. To mitigate these problems, we propose a Feature pre-Inpainting enhanced Transformer (FITer) video inpainting method, in which the feature pre-inpainting network (FPNet) and local-global interleaving Transformer are designed. The FPNet pre-inpaints missing features before the Transformer by exploiting spatial context, and the representations of missing regions are thus enhanced with more informative content. Therefore, the interleaving Transformer can calculate more accurate self-attention weights and learns more effective dependencies between missing and valid regions. Since the interleaving Transformer involves both global and window-based local self-attention mechanisms, the proposed FITer method can effectively aggregate spatial-temporal features into missing regions while improving efficiency. Experiments on YouTube-VOS and DAVIS datasets demonstrate that the FITer method outperforms previous methods qualitatively and quantitatively.
引用
收藏
页数:12
相关论文
共 50 条
  • [21] Pre-Inpainting Convolutional Skip Triple Attention Segmentation Network for AGV Lane Detection in Overexposure Environment
    Yang, Zongxin
    Yang, Xu
    Wu, Long
    Hu, Jiemin
    Zou, Bo
    Zhang, Yong
    Zhang, Jianlong
    APPLIED SCIENCES-BASEL, 2022, 12 (20):
  • [22] Video-rate Video Inpainting
    Murase, Rito
    Zhang, Yan
    Okatani, Takayuki
    2019 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2019, : 1553 - 1561
  • [23] Digital Inpainting and Video Falsifying
    Tang, Nick C.
    Shih, Timothy K.
    INTERNATIONAL SYMPOSIUM OF INFORMATION TECHNOLOGY 2008, VOLS 1-4, PROCEEDINGS: COGNITIVE INFORMATICS: BRIDGING NATURAL AND ARTIFICIAL KNOWLEDGE, 2008, : 18 - 23
  • [24] Video Inpainting of Complex Scenes
    Newson, Alasdair
    Almansa, Andres
    Fradet, Matthieu
    Gousseau, Yann
    Perez, Patrick
    SIAM JOURNAL ON IMAGING SCIENCES, 2014, 7 (04): : 1993 - 2019
  • [25] Face Inpainting by Feature Guidance
    Tang, Nick C.
    Zhuang, Yueting
    Wang, Yushun
    Shih, Timothy K.
    Tsai, Joseph C.
    ISCAS: 2009 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, VOLS 1-5, 2009, : 2613 - +
  • [26] Stereo-video inpainting
    Raimbault, Felix
    Kokaram, Anil
    JOURNAL OF ELECTRONIC IMAGING, 2012, 21 (01)
  • [27] Deep Stereo Video Inpainting
    Wu, Zhiliang
    Sun, Changchang
    Xuan, Hanyu
    Yan, Yan
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR, 2023, : 5693 - 5702
  • [28] Inpainting Algorithm for Video Processing
    Chede, Mayuri D.
    Metkar, Shilpa P.
    AMBIENT COMMUNICATIONS AND COMPUTER SYSTEMS, RACCCS 2017, 2018, 696 : 717 - 728
  • [29] Method of Fast Video Inpainting
    Petrov, Eugeny
    Kharina, Natalia
    2019 INTERNATIONAL SIBERIAN CONFERENCE ON CONTROL AND COMMUNICATIONS (SIBCON), 2019,
  • [30] Video Inpainting: A Complete Framework
    Zarif, Sameh
    Ibrahim, Mina
    INTERNATIONAL JOURNAL OF IMAGE AND GRAPHICS, 2021, 21 (03)