ITrans: generative image inpainting with transformers

被引:2
|
作者
Miao, Wei [1 ,4 ]
Wang, Lijun [2 ]
Lu, Huchuan [1 ]
Huang, Kaining [3 ]
Shi, Xinchu [3 ]
Liu, Bocong [3 ]
机构
[1] Dalian Univ Technol, Sch Informat & Commun Engn, 2 Linggong Rd, Dalian 116023, Peoples R China
[2] Dalian Univ Technol, Sch Artificial Intelligence, 2 Linggong Rd, Dalian 116023, Liaoning, Peoples R China
[3] Meituan Grp, 4 Wangjing East Rd, Beijing 100102, Peoples R China
[4] Univ Jyvaskyla, Fac Informat Technol, Seminaarinkatu 15, Jyvaskyla 40014, Finland
关键词
Convolutional neural network; Image inpainting; Global transformer; Local transformer; OBJECT REMOVAL;
D O I
10.1007/s00530-023-01211-w
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Despite significant improvements, convolutional neural network (CNN) based methods are struggling with handling long-range global image dependencies due to their limited receptive fields, leading to an unsatisfactory inpainting performance under complicated scenarios. To address this issue, we propose the Inpainting Transformer (ITrans) network, which combines the power of both self-attention and convolution operations. The ITrans network augments convolutional encoder-decoder structure with two novel designs, i.e. , the global and local transformers. The global transformer aggregates high-level image context from the encoder in a global perspective, and propagates the encoded global representation to the decoder in a multi-scale manner. Meanwhile, the local transformer is intended to extract low-level image details inside the local neighborhood at a reduced computational overhead. By incorporating the above two transformers, ITrans is capable of both global relationship modeling and local details encoding, which is essential for hallucinating perceptually realistic images. Extensive experiments demonstrate that the proposed ITrans network outperforms favorably against state-of-the-art inpainting methods both quantitatively and qualitatively.
引用
收藏
页数:12
相关论文
共 50 条
  • [1] ITrans: generative image inpainting with transformers
    Wei Miao
    Lijun Wang
    Huchuan Lu
    Kaining Huang
    Xinchu Shi
    Bocong Liu
    [J]. Multimedia Systems, 2024, 30
  • [2] Generative image inpainting with enhanced gated convolution and Transformers
    Wang, Min
    Lu, Wanglong
    Lyu, Jiankai
    Shi, Kaijie
    Zhao, Hanli
    [J]. DISPLAYS, 2022, 75
  • [3] Generative image inpainting for link prediction
    Fulan Qian
    Jianhong Li
    Xiuquan Du
    Xi Chen
    Shu Zhao
    Yanping Zhang
    [J]. Applied Intelligence, 2020, 50 : 4482 - 4494
  • [4] Generative Image Inpainting with Contextual Attention
    Yu, Jiahui
    Lin, Zhe
    Yang, Jimei
    Shen, Xiaohui
    Lu, Xin
    Huang, Thomas S.
    [J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 5505 - 5514
  • [5] Generative Image Inpainting with Submanifold Alignment
    Li, Ang
    Qi, Jianzhong
    Zhang, Rui
    Ma, Xingjun
    Ramamohanarao, Kotagiri
    [J]. PROCEEDINGS OF THE TWENTY-EIGHTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2019, : 811 - 817
  • [6] Generative image inpainting for link prediction
    Qian, Fulan
    Li, Jianhong
    Du, Xiuquan
    Chen, Xi
    Zhao, Shu
    Zhang, Yanping
    [J]. APPLIED INTELLIGENCE, 2020, 50 (12) : 4482 - 4494
  • [7] Diverse Image Inpainting with Bidirectional and Autoregressive Transformers
    Yu, Yingchen
    Zhan, Fangneng
    Wu, Rongliang
    Pan, Jianxiong
    Cui, Kaiwen
    Lu, Shijian
    Ma, Feiying
    Xie, Xuansong
    Miao, Chunyan
    [J]. PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 69 - 78
  • [8] Image Inpainting Based on Adaptive Generative Models
    Gapon, N.
    Puzerenko, A.
    Voronin, V.
    Zhdanova, M.
    Semenishchev, E.
    [J]. ARTIFICIAL INTELLIGENCE AND MACHINE LEARNING FOR MULTI-DOMAIN OPERATIONS APPLICATIONS VI, 2024, 13051
  • [9] Semantic Image Inpainting with Deep Generative Models
    Yeh, Raymond A.
    Chen, Chen
    Lim, Teck Yian
    Schwing, Alexander G.
    Hasegawa-Johnson, Mark
    Do, Minh N.
    [J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 6882 - 6890
  • [10] Semantic Image Inpainting with Progressive Generative Networks
    Zhang, Haoran
    Hu, Zhenzhen
    Luo, Changzhi
    Zuo, Wangmeng
    Wang, Meng
    [J]. PROCEEDINGS OF THE 2018 ACM MULTIMEDIA CONFERENCE (MM'18), 2018, : 1939 - 1947