Spatially adaptive multi-scale contextual attention for image inpainting

被引:5
|
作者
Wang, Xueting [1 ,2 ]
Chen, Yiyan [1 ]
Yamasaki, Toshihiko [1 ]
机构
[1] Univ Tokyo, Dept Informat Commun & Engn, Bunkyo Ku, 7-3-1 Hongo, Tokyo 1138656, Japan
[2] CyberAgent Inc, AI Lab, Shibuya Ku, Shibuya Scramble Sq 2-24-12, Tokyo, Japan
关键词
Image inpainting; Spatially adaptive; Contextual attention; Multi-scale attention;
D O I
10.1007/s11042-022-12489-9
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Image inpainting is the task to fill missing regions of an image. Recently, researchers have achieved a great performance by using convolutional neural networks (CNNs) with the conventional patch-matching method. Existing methods compute the attention scores, which are based on the similarity of patches between the known and missing regions. Considering that patches at different spatial positions can convey different levels of detail, we propose a spatially adaptive multi-scale attention score that uses the patches of different scales to compute scores for each pixel at different positions. Through experiments on the Paris Street View and Places datasets, our proposal shows slight improvement compared with some related methods on the quantitative evaluation metrics commonly used in the existing methods. Moreover, we found that these quantitative metrics are not appropriate enough considering the subjective impressions of the generated images. Therefore, we conducted subjective evaluation through user study for comparison, which shows that our proposal has superiority of performance generating much more detailed and subjectively plausible images.
引用
收藏
页码:31831 / 31846
页数:16
相关论文
共 50 条
  • [41] MULTI-SCALE IMAGE INPAINTING WITH LABEL SELECTION BASED ON LOCAL STATISTICS
    Paredes, Daniel
    Rodriguez, Paul
    [J]. 2013 PROCEEDINGS OF THE 21ST EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2013,
  • [42] Multi-scale patch-GAN with edge detection for image inpainting
    Gang Chen
    Guipeng Zhang
    Zhenguo Yang
    Wenyin Liu
    [J]. Applied Intelligence, 2023, 53 : 3917 - 3932
  • [43] Multi-Scale Patch Partitioning for Image Inpainting Based on Visual Transformers
    Campana, Jose Luis Flores
    Decker, Luis Gustavo Lorgus
    Roberto e Souza, Marcos
    Maia, Helena de Almeida
    Pedrini, Helio
    [J]. 2022 35TH SIBGRAPI CONFERENCE ON GRAPHICS, PATTERNS AND IMAGES (SIBGRAPI 2022), 2022, : 180 - 185
  • [44] Multi-scale Gated Inpainting Network with Patch-Wise Spacial Attention
    Hu, Xinrong
    Jin, Junjie
    Xiong, Mingfu
    Liu, Junping
    Peng, Tao
    Zhang, Zili
    Chen, Jia
    He, Ruhan
    Qin, Xiao
    [J]. DATABASE SYSTEMS FOR ADVANCED APPLICATIONS: DASFAA 2021 INTERNATIONAL WORKSHOPS, 2021, 12680 : 169 - 184
  • [45] Multi-scale fire detection algorithm with adaptive attention
    Liang Y.
    Chen T.
    Zhang W.
    [J]. Beijing Ligong Daxue Xuebao/Transaction of Beijing Institute of Technology, 2024, 44 (01): : 91 - 101
  • [46] Multi-Scale Contextual Attention Based HDR Reconstruction of Dynamic Scenes
    Deng, Yipeng
    Liu, Qin
    Ikenaga, Takeshi
    [J]. TWELFTH INTERNATIONAL CONFERENCE ON DIGITAL IMAGE PROCESSING (ICDIP 2020), 2020, 11519
  • [47] STACKED MULTI-SCALE ATTENTION NETWORK FOR IMAGE COLORIZATION
    Jiang, Bin
    Xu, Fangqiang
    Xia, Jun
    Yang, Chao
    Huang, Wei
    Huang, Yun
    [J]. 2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 2225 - 2229
  • [48] Multi-Scale Context Attention Network for Image Retrieval
    Lou, Yihang
    Bai, Yan
    Wang, Shiqi
    Duan, Ling-Yu
    [J]. PROCEEDINGS OF THE 2018 ACM MULTIMEDIA CONFERENCE (MM'18), 2018, : 1128 - 1136
  • [49] MSANet: Multi-scale attention networks for image classification
    Ping Cao
    Fangxin Xie
    Shichao Zhang
    Zuping Zhang
    Jianfeng Zhang
    [J]. Multimedia Tools and Applications, 2022, 81 : 34325 - 34344
  • [50] MSANet: Multi-scale attention networks for image classification
    Cao, Ping
    Xie, Fangxin
    Zhang, Shichao
    Zhang, Zuping
    Zhang, Jianfeng
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (24) : 34325 - 34344