Feature pre-inpainting enhanced transformer for video inpainting

被引：6

作者：

Li, Guanxiao ^{[1
]}

Zhang, Ke ^{[1
]}

Su, Yu ^{[1
]}

Wang, Jingyu ^{[1
,2
]}

机构：

[1] Northwestern Polytech Univ, Sch Astronaut, Xian 710072, Shaanxi, Peoples R China

[2] Northwestern Polytech Univ, Sch Artificial Intelligence OPt & Elect iOPEN, Xian 710072, Shaanxi, Peoples R China

来源：

ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE | 2023年 / 123卷

基金：

中国国家自然科学基金;

关键词：

Video inpainting; Feature pre-inpainting; Local-global interleaving transformer;

D O I：

10.1016/j.engappai.2023.106323

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Transformer-based video inpainting methods aggregate coherent contents into missing regions by learning dependencies spatial-temporally. However, existing methods suffer from the inaccurate self-attention calcu-lation and excessive quadratic computational complexity, due to uninformative representations of missing regions and inefficient global self-attention mechanisms, respectively. To mitigate these problems, we propose a Feature pre-Inpainting enhanced Transformer (FITer) video inpainting method, in which the feature pre-inpainting network (FPNet) and local-global interleaving Transformer are designed. The FPNet pre-inpaints missing features before the Transformer by exploiting spatial context, and the representations of missing regions are thus enhanced with more informative content. Therefore, the interleaving Transformer can calculate more accurate self-attention weights and learns more effective dependencies between missing and valid regions. Since the interleaving Transformer involves both global and window-based local self-attention mechanisms, the proposed FITer method can effectively aggregate spatial-temporal features into missing regions while improving efficiency. Experiments on YouTube-VOS and DAVIS datasets demonstrate that the FITer method outperforms previous methods qualitatively and quantitatively.

引用

页数：12

共 50 条

[21] Pre-Inpainting Convolutional Skip Triple Attention Segmentation Network for AGV Lane Detection in Overexposure Environment
Yang, Zongxin
Yang, Xu
Wu, Long
Hu, Jiemin
Zou, Bo
Zhang, Yong
Zhang, Jianlong
APPLIED SCIENCES-BASEL, 2022, 12 (20):
[22] Video-rate Video Inpainting
Murase, Rito
Zhang, Yan
Okatani, Takayuki
2019 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2019, : 1553 - 1561
[23] Digital Inpainting and Video Falsifying
Tang, Nick C.
Shih, Timothy K.
INTERNATIONAL SYMPOSIUM OF INFORMATION TECHNOLOGY 2008, VOLS 1-4, PROCEEDINGS: COGNITIVE INFORMATICS: BRIDGING NATURAL AND ARTIFICIAL KNOWLEDGE, 2008, : 18 - 23
[24] Video Inpainting of Complex Scenes
Newson, Alasdair
Almansa, Andres
Fradet, Matthieu
Gousseau, Yann
Perez, Patrick
SIAM JOURNAL ON IMAGING SCIENCES, 2014, 7 (04): : 1993 - 2019
[25] Face Inpainting by Feature Guidance
Tang, Nick C.
Zhuang, Yueting
Wang, Yushun
Shih, Timothy K.
Tsai, Joseph C.
ISCAS: 2009 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, VOLS 1-5, 2009, : 2613 - +
[26] Stereo-video inpainting
Raimbault, Felix
Kokaram, Anil
JOURNAL OF ELECTRONIC IMAGING, 2012, 21 (01)
[27] Deep Stereo Video Inpainting
Wu, Zhiliang
Sun, Changchang
Xuan, Hanyu
Yan, Yan
2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR, 2023, : 5693 - 5702
[28] Inpainting Algorithm for Video Processing
Chede, Mayuri D.
Metkar, Shilpa P.
AMBIENT COMMUNICATIONS AND COMPUTER SYSTEMS, RACCCS 2017, 2018, 696 : 717 - 728
[29] Method of Fast Video Inpainting
Petrov, Eugeny
Kharina, Natalia
2019 INTERNATIONAL SIBERIAN CONFERENCE ON CONTROL AND COMMUNICATIONS (SIBCON), 2019,
[30] Video Inpainting: A Complete Framework
Zarif, Sameh
Ibrahim, Mina
INTERNATIONAL JOURNAL OF IMAGE AND GRAPHICS, 2021, 21 (03)

← 1 2 3 4 5 →