Generative Memory-Guided Semantic Reasoning Model for Image Inpainting

被引:8
|
作者
Feng, Xin [1 ]
Pei, Wenjie [1 ]
Li, Fengjun [1 ]
Chen, Fanglin [1 ]
Zhang, David [2 ]
Lu, Guangming [1 ]
机构
[1] Harbin Inst Technol Shenzhen, Dept Comp Sci, Shenzhen 518057, Peoples R China
[2] Chinese Univ Hong Kong Shenzhen, Sch Sci & Engn, Shenzhen 518172, Peoples R China
关键词
Semantics; Cognition; Decoding; Training; Image restoration; Image edge detection; Visualization; Image inpainting; generative memory; image synthesis; semantic reasoning;
D O I
10.1109/TCSVT.2022.3188169
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
The critical challenge of single image inpainting stems from accurate semantic inference via limited information while maintaining image quality. Typical methods for semantic image inpainting train an encoder-decoder network by learning a one-to-one mapping from the corrupted image to the inpainted version. While such methods perform well on images with small corrupted regions, it is challenging for these methods to deal with images with large corrupted area due to two potential limitations. 1) Such one-to-one mapping paradigm tends to overfit each single training pair of images; 2) The inter-image prior knowledge about the general distribution patterns of visual semantics, which can be transferred across images sharing similar semantics, is not explicitly exploited. In this paper, we propose the Generative Memory-guided Semantic Reasoning Model (GM-SRM), which infers the content of corrupted regions based on not only the known regions of the corrupted image, but also the learned inter-image reasoning priors characterizing the generalizable semantic distribution patterns between similar images. In particular, the proposed GM-SRM first pre-learns a generative memory from the whole training data to explicitly learn the distribution of different semantic patterns. Then the learned memory are leveraged to retrieve the matching semantics for the current corrupted image to perform semantic reasoning during image inpainting. While the encoder-decoder network is used for guaranteeing the pixel-level content consistency, our generative priors are favorable for performing high-level semantic reasoning, which is particularly effective for inferring semantic content for large corrupted area. Extensive experiments on Paris Street View, CelebA-HQ, and Places2 benchmarks demonstrate that our GM-SRM outperforms the state-of-the-art methods for image inpainting in terms of both visual quality and quantitative metrics.
引用
收藏
页码:7432 / 7447
页数:16
相关论文
共 50 条
  • [1] Progressive Semantic Reasoning for Image Inpainting
    Jin, Junjie
    Hu, Xinrong
    He, Kai
    Peng, Tao
    Liu, Junping
    Yang, Jie
    [J]. WEB CONFERENCE 2021: COMPANION OF THE WORLD WIDE WEB CONFERENCE (WWW 2021), 2021, : 68 - 76
  • [2] Semantic Image Inpainting with Multi-Stage Feature Reasoning Generative Adversarial Network
    Li, Guangyao
    Li, Liangfu
    Pu, Yingdan
    Wang, Nan
    Zhang, Xi
    [J]. SENSORS, 2022, 22 (08)
  • [3] Semantic Image Inpainting with Deep Generative Models
    Yeh, Raymond A.
    Chen, Chen
    Lim, Teck Yian
    Schwing, Alexander G.
    Hasegawa-Johnson, Mark
    Do, Minh N.
    [J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 6882 - 6890
  • [4] Semantic Image Inpainting with Progressive Generative Networks
    Zhang, Haoran
    Hu, Zhenzhen
    Luo, Changzhi
    Zuo, Wangmeng
    Wang, Meng
    [J]. PROCEEDINGS OF THE 2018 ACM MULTIMEDIA CONFERENCE (MM'18), 2018, : 1939 - 1947
  • [5] Semantic image inpainting based on Generative Adversarial Networks
    Wu, Chugang
    Xian, Yanhua
    Bai, Junqi
    Jing, Yuancheng
    [J]. 2020 INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND COMPUTER ENGINEERING (ICAICE 2020), 2020, : 276 - 280
  • [6] Memory-guided Unsupervised Image-to-image Translation
    Jeong, Somi
    Kim, Youngjung
    Lee, Eungbean
    Sohn, Kwanghoon
    [J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 6554 - 6563
  • [7] Semantic face image inpainting based on Generative Adversarial Network
    Zhang, Heshu
    Li, Tao
    [J]. 2020 35TH YOUTH ACADEMIC ANNUAL CONFERENCE OF CHINESE ASSOCIATION OF AUTOMATION (YAC), 2020, : 530 - 535
  • [8] Edge-Guided Generative Adversarial Network for Image Inpainting
    Xu, Shunxin
    Liu, Dong
    Xiong, Zhiwei
    [J]. 2017 IEEE VISUAL COMMUNICATIONS AND IMAGE PROCESSING (VCIP), 2017,
  • [9] Memory-Guided Semantic Learning Network for Temporal Sentence Grounding
    Liu, Daizong
    Qu, Xiaoye
    Di, Xing
    Cheng, Yu
    Xu, Zichuan
    Zhou, Pan
    [J]. THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / THE TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 1665 - 1673
  • [10] Semantic Image Inpainting through Improved Wasserstein Generative Adversarial Networks
    Vitoria, Patricia
    Sintes, Joan
    Ballester, Coloma
    [J]. VISAPP: PROCEEDINGS OF THE 14TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER VISION, IMAGING AND COMPUTER GRAPHICS THEORY AND APPLICATIONS, VOL 4, 2019, : 249 - 260