DeFLOCNet: Deep Image Editing via Flexible Low-level Controls

被引:12
|
作者
Liu, Hongyu [1 ]
Wan, Ziyu [2 ]
Huang, Wei [3 ]
Song, Yibing [4 ]
Han, Xintong [1 ]
Liao, Jing [2 ]
Jiang, Bin [3 ]
Liu, Wei [5 ]
机构
[1] Huya Inc, Guangzhou, Peoples R China
[2] City Univ Hong Kong, Hong Kong, Peoples R China
[3] Hunan Univ, Changsha, Peoples R China
[4] Tencent AI Lab, Shenzhen, Peoples R China
[5] Tencent Data Platform, Shenzhen, Peoples R China
基金
中国国家自然科学基金;
关键词
D O I
10.1109/CVPR46437.2021.01062
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
User-intended visual content fills the hole regions of an input image in the image editing scenario. The coarse low-level inputs, which typically consist of sparse sketch lines and color dots, convey user intentions for content creation (i.e., free-form editing). While existing methods combine an input image and these low-level controls for CNN inputs, the corresponding feature representations are not sufficient to convey user intentions, leading to unfaithfully generated content. In this paper, we propose DeFLOCNet which relies on a deep encoder-decoder CNN to retain the guidance of these controls in the deep feature representations. In each skip-connection layer, we design a structure generation block. Instead of attaching low-level controls to an input image, we inject these controls directly into each structure generation block for sketch line refinement and color propagation in the CNN feature space. We then concatenate the modulated features with the original decoder features for structure generation. Meanwhile, DeFLOCNet involves another decoder branch for texture generation and detail enhancement. Both structures and textures are rendered in the decoder, leading to user-intended editing results. Experiments on benchmarks demonstrate that DeFLOCNet effectively transforms different user intentions to create visually pleasing content.
引用
收藏
页码:10760 / 10769
页数:10
相关论文
共 50 条
  • [21] Controlling low-level image properties: The SHINE toolbox
    Verena Willenbockel
    Javid Sadr
    Daniel Fiset
    Greg O. Horne
    Frédéric Gosselin
    James W. Tanaka
    Behavior Research Methods, 2010, 42 : 671 - 684
  • [22] LOW-LEVEL PROCESSING TECHNIQUES IN GEOPHYSICAL IMAGE INTERPRETATION
    ROBERTO, V
    PERON, A
    FUMIS, PL
    PATTERN RECOGNITION LETTERS, 1989, 10 (02) : 111 - 122
  • [23] IMAGE SEGMENTATION SCHEMA FOR LOW-LEVEL COMPUTER VISION
    ASANO, T
    YOKOYA, N
    PATTERN RECOGNITION, 1981, 14 (1-6) : 267 - 273
  • [24] Medical Image Fusion Based on Low-Level Features
    Zhang, Yongxin
    Guo, Chenrui
    Zhao, Peng
    COMPUTATIONAL AND MATHEMATICAL METHODS IN MEDICINE, 2021, 2021
  • [25] Image Saliency Detection with Low-Level Features Enhancement
    Zhao, Ting
    Wu, Xiangqian
    PATTERN RECOGNITION AND COMPUTER VISION (PRCV 2018), PT I, 2018, 11256 : 408 - 419
  • [26] Low-Level Image Features for Stamps Detection and Classification
    Forczmanski, Pawel
    Markiewicz, Andrzej
    PROCEEDINGS OF THE 8TH INTERNATIONAL CONFERENCE ON COMPUTER RECOGNITION SYSTEMS CORES 2013, 2013, 226 : 383 - 392
  • [27] A low-level image processing algorithms accelerator platform
    Saldana, Griselda
    Arias-Estrada, Miguel
    18TH INTERNATIONAL CONFERENCE ON ELECTRONICS, COMMUNICATIONS AND COMPUTERS (CONIELECOMP 2008), PROCEEDINGS, 2008, : 117 - +
  • [28] NEW LOW-LEVEL PROCEDURE FOR IMAGE SEGMENTATION.
    Wechsler, Harry
    1978, 7 (01): : 120 - 129
  • [29] Mapping low-level image features to semantic concepts
    Stan, D
    Sethi, IK
    STORAGE AND RETRIEVAL FOR MEDIA DATABASES 2001, 2001, 4315 : 172 - 179
  • [30] Controlling low-level image properties: The SHINE toolbox
    Willenbockel, Verena
    Sadr, Javid
    Fiset, Daniel
    Horne, Greg O.
    Gosselin, Frederic
    Tanaka, James W.
    BEHAVIOR RESEARCH METHODS, 2010, 42 (03) : 671 - 684