Diverse Image Inpainting with Bidirectional and Autoregressive Transformers

被引:40
|
作者
Yu, Yingchen [1 ,2 ]
Zhan, Fangneng [1 ]
Wu, Rongliang [1 ]
Pan, Jianxiong [3 ]
Cui, Kaiwen [1 ]
Lu, Shijian [1 ]
Ma, Feiying [3 ]
Xie, Xuansong [3 ]
Miao, Chunyan [1 ]
机构
[1] Nanyang Technol Univ, Singapore, Singapore
[2] Alibaba Grp, Singapore, Singapore
[3] Alibaba Grp, DAMO Acad, Hangzhou, Zhejiang, Peoples R China
关键词
computer vision; deep learning; transformer; image inpainting;
D O I
10.1145/3474085.3475436
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Image inpainting is an underdetermined inverse problem, which naturally allows diverse contents to fill up the missing or corrupted regions realistically. Prevalent approaches using convolutional neural networks (CNNs) can synthesize visually pleasant contents, but CNNs suffer from limited perception fields for capturing global features. With image-level attention, transformers enable to model long-range dependencies and generate diverse contents with autoregressive modeling of pixel-sequence distributions. However, the unidirectional attention in autoregressive transformers is sub-optimal as corrupted image regions may have arbitrary shapes with contexts from any direction. We propose BAT-Fill, an innovative image inpainting framework that introduces a novel bidirectional autoregressive transformer (BAT) for image inpainting. BAT utilizes the transformers to learn autoregressive distributions, which naturally allows the diverse generation of missing contents. In addition, it incorporates the masked language model like BERT, which enables bidirectionally modeling of contextual information of missing regions for better image completion. Extensive experiments over multiple datasets show that BAT-Fill achieves superior diversity and fidelity in image inpainting qualitatively and quantitatively.
引用
收藏
页码:69 / 78
页数:10
相关论文
共 50 条
  • [1] ITrans: generative image inpainting with transformers
    Miao, Wei
    Wang, Lijun
    Lu, Huchuan
    Huang, Kaining
    Shi, Xinchu
    Liu, Bocong
    [J]. MULTIMEDIA SYSTEMS, 2024, 30 (01)
  • [2] ITrans: generative image inpainting with transformers
    Wei Miao
    Lijun Wang
    Huchuan Lu
    Kaining Huang
    Xinchu Shi
    Bocong Liu
    [J]. Multimedia Systems, 2024, 30
  • [3] Diverse image inpainting with disentangled uncertainty
    Wang, Wentao
    He, Lu
    Niu, Li
    Zhang, Jianfu
    Liu, Yue
    Ling, Haoyu
    Zhang, Liqing
    [J]. PATTERN RECOGNITION, 2023, 137
  • [4] Diverse Image Inpainting with Normalizing Flow
    Wang, Cairong
    Zhu, Yiming
    Yuan, Chun
    [J]. COMPUTER VISION, ECCV 2022, PT XXIII, 2022, 13683 : 53 - 69
  • [5] Bidirectional interaction of CNN and Transformer for image inpainting
    Liu, Jialu
    Gong, Maoguo
    Gao, Yuan
    Lu, Yiheng
    Li, Hao
    [J]. KNOWLEDGE-BASED SYSTEMS, 2024, 299
  • [6] Image Inpainting with Learnable Bidirectional Attention Maps
    Xie, Chaohao
    Liu, Shaohui
    Li, Chao
    Cheng, Ming-Ming
    Zuo, Wangmeng
    Liu, Xiao
    Wen, Shilei
    Ding, Errui
    [J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 8857 - 8866
  • [7] Reduce Information Loss in Transformers for Pluralistic Image Inpainting
    Liu, Qiankun
    Tan, Zhentao
    Chen, Dongdong
    Chu, Qi
    Dai, Xiyang
    Chen, Yinpeng
    Liu, Mengchen
    Yuan, Lu
    Yu, Nenghai
    [J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 11337 - 11347
  • [8] Generative image inpainting with enhanced gated convolution and Transformers
    Wang, Min
    Lu, Wanglong
    Lyu, Jiankai
    Shi, Kaijie
    Zhao, Hanli
    [J]. DISPLAYS, 2022, 75
  • [9] FiNet: Compatible and Diverse Fashion Image Inpainting
    Han, Xintong
    Wu, Zuxuan
    Huang, Weilin
    Scott, Matthew R.
    Davis, Larry S.
    [J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 4480 - 4490
  • [10] Image inpainting by bidirectional information flow on texture and structure
    Lian, Jing
    Zhang, Jibao
    Zhang, Huaikun
    Chen, Yuekai
    Zhang, Jiajun
    Liu, Jizhao
    [J]. SIGNAL PROCESSING, 2025, 226