Semantic Layout Manipulation With High-Resolution Sparse Attention

被引:1
|
作者
Zheng, Haitian [1 ]
Lin, Zhe [2 ]
Lu, Jingwan [2 ]
Cohen, Scott [2 ]
Zhang, Jianming [2 ]
Xu, Ning [2 ]
Luo, Jiebo [1 ]
机构
[1] Univ Rochester, Dept Comp Sci, Rochester, NY 14627 USA
[2] Adobe Res, San Jose, CA 95110 USA
关键词
Layout; Semantics; Visualization; Task analysis; Image synthesis; Computational modeling; Generators; Image manipulation and editing; image synthesis; correspondence learning; inpainting; TO-IMAGE TRANSLATION;
D O I
10.1109/TPAMI.2022.3181587
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We tackle the problem of semantic image layout manipulation, which aims to manipulate an input image by editing its semantic label map. A core problem of this task is how to transfer visual details from the input images to the new semantic layout while making the resulting image visually realistic. Recent work on learning cross-domain correspondence has shown promising results for global layout transfer with dense attention-based warping. However, this method tends to lose texture details due to the resolution limitation and the lack of smoothness constraint on correspondence. To adapt this paradigm for the layout manipulation task, we propose a high-resolution sparse attention module that effectively transfers visual details to new layouts at a resolution up to 512x512. To further improve visual quality, we introduce a novel generator architecture consisting of a semantic encoder and a two-stage decoder for coarse-to-fine synthesis. Experiments on the ADE20k and Places365 datasets demonstrate that our proposed approach achieves substantial improvements over the existing inpainting and layout manipulation methods.
引用
收藏
页码:3768 / 3782
页数:15
相关论文
共 50 条
  • [41] High-resolution reflectivity inversion based on joint sparse representation
    Shi, Zhanzhan
    Zhou, Huailai
    Wang, Yuanjun
    Niu, Cong
    Huang, Rao
    [J]. ACTA GEOPHYSICA, 2019, 67 (06) : 1535 - 1550
  • [42] Efficient and expressive high-resolution image synthesis via variational autoencoder-enriched transformers with sparse attention mechanisms
    Tang, Bingyin
    Feng, Fan
    [J]. JOURNAL OF ELECTRONIC IMAGING, 2024, 33 (03)
  • [43] SPARSE SPATIAL ATTENTION NETWORK FOR SEMANTIC SEGMENTATION
    Liu, Mengyu
    Yin, Hujun
    [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2021, : 644 - 648
  • [44] Tamed Warping Network for High-Resolution Semantic Video Segmentation
    Li, Songyuan
    Feng, Junyi
    Li, Xi
    [J]. APPLIED SCIENCES-BASEL, 2023, 13 (18):
  • [45] Efficient Semantic-Guidance High-Resolution Video Matting
    Yu, Yue
    Li, Ding
    Yang, Yulin
    [J]. ADVANCES IN COMPUTER GRAPHICS, CGI 2023, PT I, 2024, 14495 : 143 - 154
  • [46] SEMANTIC LABELING FOR HIGH-RESOLUTION AERIAL IMAGES BASED ON THE DMFFNET
    Cao, Zhiying
    Diao, Wenhui
    Zhang, Yi
    Yan, Menglong
    Yu, Hongfeng
    Sun, Xian
    Fu, Kun
    [J]. 2019 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS 2019), 2019, : 1021 - 1024
  • [47] Deep Learning Semantic Segmentation for High-Resolution Medical Volumes
    Toubal, Imad Eddine
    Duan, Ye
    Yang, Deshan
    [J]. 2020 IEEE APPLIED IMAGERY PATTERN RECOGNITION WORKSHOP (AIPR): TRUSTED COMPUTING, PRIVACY, AND SECURING MULTIMEDIA, 2020,
  • [48] Semantic Labeling of High-Resolution Images Using EfficientUNets and Transformers
    Almarzouqi, Hasan
    Saoud, Lyes Saad
    [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61
  • [49] Dynamic Neural Representational Decoders for High-Resolution Semantic Segmentation
    Zhang, Bowen
    Liu, Yifan
    Tian, Zhi
    Shen, Chunhua
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [50] High-Resolution Semantic Segmentation of Woodland Fires Using Residual Attention UNet and Time Series of Sentinel-2
    Shirvani, Zeinab
    Abdi, Omid
    Goodman, Rosa C.
    [J]. REMOTE SENSING, 2023, 15 (05)