Semantic Layout Manipulation With High-Resolution Sparse Attention

被引:1
|
作者
Zheng, Haitian [1 ]
Lin, Zhe [2 ]
Lu, Jingwan [2 ]
Cohen, Scott [2 ]
Zhang, Jianming [2 ]
Xu, Ning [2 ]
Luo, Jiebo [1 ]
机构
[1] Univ Rochester, Dept Comp Sci, Rochester, NY 14627 USA
[2] Adobe Res, San Jose, CA 95110 USA
关键词
Layout; Semantics; Visualization; Task analysis; Image synthesis; Computational modeling; Generators; Image manipulation and editing; image synthesis; correspondence learning; inpainting; TO-IMAGE TRANSLATION;
D O I
10.1109/TPAMI.2022.3181587
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We tackle the problem of semantic image layout manipulation, which aims to manipulate an input image by editing its semantic label map. A core problem of this task is how to transfer visual details from the input images to the new semantic layout while making the resulting image visually realistic. Recent work on learning cross-domain correspondence has shown promising results for global layout transfer with dense attention-based warping. However, this method tends to lose texture details due to the resolution limitation and the lack of smoothness constraint on correspondence. To adapt this paradigm for the layout manipulation task, we propose a high-resolution sparse attention module that effectively transfers visual details to new layouts at a resolution up to 512x512. To further improve visual quality, we introduce a novel generator architecture consisting of a semantic encoder and a two-stage decoder for coarse-to-fine synthesis. Experiments on the ADE20k and Places365 datasets demonstrate that our proposed approach achieves substantial improvements over the existing inpainting and layout manipulation methods.
引用
收藏
页码:3768 / 3782
页数:15
相关论文
共 50 条
  • [1] SEMANTIC SEGMENTATION OF HIGH-RESOLUTION REMOTE SENSING IMAGES BASED ON SPARSE SELF-ATTENTION
    Sun, Li
    Zou, Huanxin
    Wei, Juan
    Li, Meilin
    Cao, Xu
    He, Shitian
    Liu, Shuo
    [J]. 2022 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS 2022), 2022, : 3492 - 3495
  • [2] High-Resolution Image Synthesis and Semantic Manipulation with Conditional GANs
    Wang, Ting-Chun
    Liu, Ming-Yu
    Zhu, Jun-Yan
    Tao, Andrew
    Kautz, Jan
    Catanzaro, Bryan
    [J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 8798 - 8807
  • [3] Semantic Segmentation of High-Resolution Remote Sensing Images Based on Sparse Self-Attention and Feature Alignment
    Sun, Li
    Zou, Huanxin
    Wei, Juan
    Cao, Xu
    He, Shitian
    Li, Meilin
    Liu, Shuo
    [J]. REMOTE SENSING, 2023, 15 (06)
  • [4] Lightweight Attention Network for Very High-Resolution Image Semantic Segmentation
    Guan, Renchu
    Wang, Mingming
    Bruzzone, Lorenzo
    Zhao, Haishi
    Yang, Chen
    [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61
  • [5] Integrating Gate and Attention Modules for High-Resolution Image Semantic Segmentation
    Zheng, Zixian
    Zhang, Xueliang
    Xiao, Pengfeng
    Li, Zhenshi
    [J]. IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2021, 14 : 4530 - 4546
  • [6] Board layout & high-resolution ADCs
    Data Conversion Division, National Semiconductor Europe
    [J]. Electron World, 2007, 1855 (34-36):
  • [7] Board layout & high-resolution ADCs
    McCormack, Paul
    [J]. ELECTRONICS WORLD, 2007, 113 (1855): : 34 - 36
  • [8] A Deformable Attention Network for High-Resolution Remote Sensing Images Semantic Segmentation
    Zuo, Renxiang
    Zhang, Guangyun
    Zhang, Rongting
    Jia, Xiuping
    [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
  • [9] AANet: Adaptive Attention Networks for Semantic Segmentation of High-Resolution Remote Sensing Imagery
    Chen, Yan
    Zhang, Qianchuan
    Wang, Xiaofeng
    Dong, Quan
    Kang, Menglei
    Jiang, Wenxiang
    Wang, Mengyuan
    Xu, Lixiang
    Zhang, Chen
    [J]. IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2024, 17 : 14640 - 14655
  • [10] Semantic segmentation of high-resolution images
    Wang, Juhong
    Liu, Bin
    Xu, Kun
    [J]. SCIENCE CHINA-INFORMATION SCIENCES, 2017, 60 (12) : 123101:1 - 123101:6