Learning Pyramid-Context Encoder Network for High-Quality Image Inpainting

被引:309
|
作者
Zeng, Yanhong [1 ,2 ]
Fu, Jianlong [3 ]
Chao, Hongyang [1 ,2 ]
Guo, Baining [3 ]
机构
[1] Sun Yat Sen Univ, Sch Data & Comp Sci, Guangzhou, Peoples R China
[2] Sun Yat Sen Univ, Key Lab Machine Intelligence & Adv Comp, Minist Educ, Guangzhou, Peoples R China
[3] Microsoft Res, Beijing, Peoples R China
关键词
COMPLETION;
D O I
10.1109/CVPR.2019.00158
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
High-quality image inpainting requires filling missing regions in a damaged image with plausible content. Existing works eitherfill the regions by copying image patches or generating semantically-coherent patches from region context, while neglect the fact that both visual and semantic plausibility are highly-demanded. In this paper, we propose a Pyramid-context ENcoder Network (PEN-Net) for image inpainting by deep generative models. The PEN-Net is built upon a U-Net structure, which can restore an image by encoding contextual semantics from full resolution input, and decoding the learned semantic features back into images. Specifically, we propose a pyramid-context encoder, which progressively learns region affinity by attention from a high-level semantic feature map and transfers the learned attention to the previous low-level feature map. As the missing content can be filled by attention transfer from deep to shallow in a pyramid fashion, both visual and semantic coherence for image inpainting can be ensured. We further propose a multi-scale decoder with deeply-supervised pyramid losses and an adversarial loss. Such a design not only results in fast convergence in training, but more realistic results in testing. Extensive experiments on various datasets show the superior performance of the proposed network.
引用
收藏
页码:1486 / 1494
页数:9
相关论文
共 50 条
  • [41] Recovering real-world scene: high-quality image inpainting using multi-exposed references
    Zhu, Z. J.
    Li, Z. G.
    Rahardja, S.
    Fraenti, P.
    ELECTRONICS LETTERS, 2009, 45 (25) : 1310 - 1311
  • [43] Video Stabilization via Prediction with Time-Series Network and Image Inpainting with Pyramid Fusion
    Cheng Keyang
    Li Shichao
    Rong Lan
    Wang Wenshan
    Shi Wenxi
    Zhan Yongzhao
    CHINESE JOURNAL OF ELECTRONICS, 2021, 30 (06) : 1103 - 1110
  • [44] Video Stabilization via Prediction with Time-Series Network and Image Inpainting with Pyramid Fusion
    CHENG Keyang
    LI Shichao
    RONG Lan
    WANG Wenshan
    SHI Wenxi
    ZHAN Yongzhao
    Chinese Journal of Electronics, 2021, 30 (06) : 1103 - 1110
  • [45] Structure-Guided Image Inpainting Based on Multi-Scale Attention Pyramid Network
    Gong, Jun
    Luo, Senlin
    Yu, Wenxin
    Nie, Liang
    APPLIED SCIENCES-BASEL, 2024, 14 (18):
  • [46] Image Inpainting Forensics Algorithm Based on Dual-Domain Encoder-Decoder Network
    Zhang, Dengyong
    Tan, En
    Li, Feng
    Liu, Shuai
    Wang, Jing
    Hu, Jinbin
    ALGORITHMS AND ARCHITECTURES FOR PARALLEL PROCESSING, ICA3PP 2023, PT V, 2024, 14491 : 92 - 111
  • [47] Semantic Image Inpainting using Self-Learning Encoder-Decoder and Adversarial Loss
    Salem, Nermin M.
    Mahdi, Hani M. K.
    Abbas, Hazem
    PROCEEDINGS OF 2018 13TH INTERNATIONAL CONFERENCE ON COMPUTER ENGINEERING AND SYSTEMS (ICCES), 2018, : 103 - 108
  • [48] CPFTransformer: transformer fusion context pyramid medical image segmentation network
    Li, Jiao
    Ye, Jinyu
    Zhang, Ruixin
    Wu, Yue
    Berhane, Gebremedhin Samuel
    Deng, Hongxia
    Shi, Hong
    FRONTIERS IN NEUROSCIENCE, 2023, 17
  • [49] Perceptual Intra Video Encoder for High-Quality High-Definition Content
    Martinez-Rach, Miguel
    Lopez-Granado, Otoniel
    Pinol, Pablo
    Malumbres, Manuel P.
    2013 DATA COMPRESSION CONFERENCE (DCC), 2013, : 509 - 509
  • [50] Learning Multi-Scale Deep Image Prior for High-Quality Unsupervised Image Denoising
    Jiang, Hao
    Zhang, Qing
    Nie, Yongwei
    Zhu, Lei
    Zheng, Wei-Shi
    COMPUTER GRAPHICS FORUM, 2022, 41 (07) : 323 - 334