ITrans: generative image inpainting with transformers

被引：2

作者：

Miao, Wei ^{[1
,4
]}

Wang, Lijun ^{[2
]}

Lu, Huchuan ^{[1
]}

Huang, Kaining ^{[3
]}

Shi, Xinchu ^{[3
]}

Liu, Bocong ^{[3
]}

机构：

[1] Dalian Univ Technol, Sch Informat & Commun Engn, 2 Linggong Rd, Dalian 116023, Peoples R China

[2] Dalian Univ Technol, Sch Artificial Intelligence, 2 Linggong Rd, Dalian 116023, Liaoning, Peoples R China

[3] Meituan Grp, 4 Wangjing East Rd, Beijing 100102, Peoples R China

[4] Univ Jyvaskyla, Fac Informat Technol, Seminaarinkatu 15, Jyvaskyla 40014, Finland

来源：

MULTIMEDIA SYSTEMS | 2024年 / 30卷 / 01期

关键词：

Convolutional neural network; Image inpainting; Global transformer; Local transformer; OBJECT REMOVAL;

D O I：

10.1007/s00530-023-01211-w

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Despite significant improvements, convolutional neural network (CNN) based methods are struggling with handling long-range global image dependencies due to their limited receptive fields, leading to an unsatisfactory inpainting performance under complicated scenarios. To address this issue, we propose the Inpainting Transformer (ITrans) network, which combines the power of both self-attention and convolution operations. The ITrans network augments convolutional encoder-decoder structure with two novel designs, i.e. , the global and local transformers. The global transformer aggregates high-level image context from the encoder in a global perspective, and propagates the encoded global representation to the decoder in a multi-scale manner. Meanwhile, the local transformer is intended to extract low-level image details inside the local neighborhood at a reduced computational overhead. By incorporating the above two transformers, ITrans is capable of both global relationship modeling and local details encoding, which is essential for hallucinating perceptually realistic images. Extensive experiments demonstrate that the proposed ITrans network outperforms favorably against state-of-the-art inpainting methods both quantitatively and qualitatively.

引用

页数：12

共 50 条

[41] Generative Image Inpainting with Multi-Stage Decoding Network
Liu, Wei-Rong
Mi, Yan-Chun
Yang, Fan
Zhang, Yan
Guo, Hong-Lin
Liu, Zhong-Min
[J]. Tien Tzu Hsueh Pao/Acta Electronica Sinica, 2022, 50 (03): : 625 - 636
[42] Medical image captioning via generative pretrained transformers
Alexander Selivanov
Oleg Y. Rogov
Daniil Chesakov
Artem Shelmanov
Irina Fedulova
Dmitry V. Dylov
[J]. Scientific Reports, 13
[43] Medical image captioning via generative pretrained transformers
Selivanov, Alexander
Rogov, Oleg Y.
Chesakov, Daniil
Shelmanov, Artem
Fedulova, Irina
Dylov, Dmitry V.
[J]. SCIENTIFIC REPORTS, 2023, 13 (01)
[44] Generative image inpainting via edge structure and color aware fusion
Shao, Hang
Wang, Yongxiong
Fu, Yinghua
Yin, Zhong
[J]. SIGNAL PROCESSING-IMAGE COMMUNICATION, 2020, 87 (87)
[45] Collaborative Contrastive Learning-Based Generative Model for Image Inpainting
Du, Yongqiang
Liu, Haoran
Chen, Songnan
[J]. IEEE ACCESS, 2022, 10 : 106641 - 106654
[46] Image inpainting based on tensor ring decomposition with generative adversarial network
Yuan, Jianjun
Wu, Hong
Zhao, Luoming
Wu, Fujun
[J]. SIGNAL IMAGE AND VIDEO PROCESSING, 2024,
[47] Generative Memory-Guided Semantic Reasoning Model for Image Inpainting
Feng, Xin
Pei, Wenjie
Li, Fengjun
Chen, Fanglin
Zhang, David
Lu, Guangming
[J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (11) : 7432 - 7447
[48] PSinGAN: Single image inpainting by generative model trained on partial observation
Miyata, Takamichi
[J]. IEICE NONLINEAR THEORY AND ITS APPLICATIONS, 2024, 15 (03): : 577 - 587
[49] Multiview Scene Image Inpainting Based on Conditional Generative Adversarial Networks
Yuan, Zefeng
Li, Hengyu
Liu, Jingyi
Luo, Jun
[J]. IEEE TRANSACTIONS ON INTELLIGENT VEHICLES, 2020, 5 (02): : 314 - 323
[50] Image Multi-Inpainting via Progressive Generative Adversarial Networks
Cai, Jiayin
Li, Changlin
Tao, Xin
Tai, Yu-Wing
[J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2022, 2022, : 977 - 986

← 1 2 3 4 5 →