Semantic-Guided Inpainting Network for Complex Urban Scenes Manipulation

被引:7
|
作者
Ardino, Pierfrancesco [1 ,2 ]
Liu, Yahui [1 ,2 ]
Ricci, Elisa [1 ,2 ]
Lepri, Bruno [1 ]
de Nadai, Marco [1 ]
机构
[1] FBK, Trento, Italy
[2] Univ Trento, Trento, Italy
关键词
D O I
10.1109/ICPR48806.2021.9412690
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Manipulating images of complex scenes to reconstruct, insert and/or remove specific object instances is a challenging task. Complex scenes contain multiple semantics and objects, which are frequently cluttered or ambiguous, thus hampering the performance of inpainting models. Conventional techniques often rely on structural information such as object contours in multi-stage approaches that generate unreliable results and boundaries. In this work, we propose a novel deep learning model to alter a complex urban scene by removing a user-specified portion of the image and coherently inserting a new object (e.g. car or pedestrian) in that scene. Inspired by recent works on image inpainting, our proposed method leverages the semantic segmentation to model the content and structure of the image, and learn the best shape and location of the object to insert. To generate reliable results, we design a new decoder block that combines the semantic segmentation and generation task to guide better the generation of new objects and scenes, which have to be semantically consistent with the image. Our experiments, conducted on two large-scale datasets of urban scenes (Cityscapes and Indian Driving), show that our proposed approach successfully address the problem of semantically-guided inpainting of complex urban scene.
引用
收藏
页码:9280 / 9287
页数:8
相关论文
共 50 条
  • [1] Semantic-Guided Completion Network for Video Inpainting in Complex Urban Scene
    Wang, Jianan
    Xuan, Hanyu
    Wu, Zhiliang
    PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2023, PT XI, 2024, 14435 : 224 - 236
  • [2] Semantic-guided face inpainting with subspace pyramid aggregation☆
    Li, Yaqian
    Zhang, Xiumin
    Xiao, Cunjun
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2025, 108
  • [3] Contrastive Semantic-Guided Image Smoothing Network
    Wang, Jie
    Wang, Yongzhen
    Feng, Yidan
    Gong, Lina
    Yan, Xuefeng
    Xie, Haoran
    Wang, Fu Lee
    Wei, Mingqiang
    COMPUTER GRAPHICS FORUM, 2022, 41 (07) : 335 - 346
  • [4] Global semantic-guided network for saliency prediction
    Xie, Jiawei
    Liu, Zhi
    Li, Gongyang
    Lu, Xiaofeng
    Chen, Tao
    KNOWLEDGE-BASED SYSTEMS, 2024, 284
  • [5] A Sub-captions Semantic-Guided Network for Image Captioning
    Tian, Wei-Dong
    Zhu, Jun-jun
    Wu, Shuang
    Zhao, Zhong-Qiu
    Zhang, Yu-Zheng
    Zhang, Tian-yu
    INTELLIGENT COMPUTING METHODOLOGIES, PT III, 2022, 13395 : 367 - 379
  • [6] Semantic-Guided Transformer Network for Crop Classification in Hyperspectral Images
    Pi, Weiqiang
    Zhang, Tao
    Wang, Rongyang
    Ma, Guowei
    Wang, Yong
    Du, Jianmin
    JOURNAL OF IMAGING, 2025, 11 (02)
  • [7] SGFNet: Semantic-Guided Fusion Network for RGB-Thermal Semantic Segmentation
    WangLi, Yike
    Li, Gongyang
    Liu, Zhi
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 33 (12) : 7737 - 7748
  • [8] Semantic-guided complementary fusion network for salient object detection
    Yang, Kunqian
    He, Caitou
    NEUROCOMPUTING, 2025, 622
  • [9] Semantic-guided graph neural network for heterogeneous graph embedding
    Han, Mingjing
    Zhang, Han
    Li, Wei
    Yin, Yanbin
    EXPERT SYSTEMS WITH APPLICATIONS, 2023, 232
  • [10] Label embedding semantic-guided hashing
    Long, Jun
    Sun, Longzhi
    Guo, Lin
    Hua, Liujie
    Yang, Zhan
    NEUROCOMPUTING, 2022, 477 : 1 - 13