DeFLOCNet: Deep Image Editing via Flexible Low-level Controls

被引:12
|
作者
Liu, Hongyu [1 ]
Wan, Ziyu [2 ]
Huang, Wei [3 ]
Song, Yibing [4 ]
Han, Xintong [1 ]
Liao, Jing [2 ]
Jiang, Bin [3 ]
Liu, Wei [5 ]
机构
[1] Huya Inc, Guangzhou, Peoples R China
[2] City Univ Hong Kong, Hong Kong, Peoples R China
[3] Hunan Univ, Changsha, Peoples R China
[4] Tencent AI Lab, Shenzhen, Peoples R China
[5] Tencent Data Platform, Shenzhen, Peoples R China
基金
中国国家自然科学基金;
关键词
D O I
10.1109/CVPR46437.2021.01062
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
User-intended visual content fills the hole regions of an input image in the image editing scenario. The coarse low-level inputs, which typically consist of sparse sketch lines and color dots, convey user intentions for content creation (i.e., free-form editing). While existing methods combine an input image and these low-level controls for CNN inputs, the corresponding feature representations are not sufficient to convey user intentions, leading to unfaithfully generated content. In this paper, we propose DeFLOCNet which relies on a deep encoder-decoder CNN to retain the guidance of these controls in the deep feature representations. In each skip-connection layer, we design a structure generation block. Instead of attaching low-level controls to an input image, we inject these controls directly into each structure generation block for sketch line refinement and color propagation in the CNN feature space. We then concatenate the modulated features with the original decoder features for structure generation. Meanwhile, DeFLOCNet involves another decoder branch for texture generation and detail enhancement. Both structures and textures are rendered in the decoder, leading to user-intended editing results. Experiments on benchmarks demonstrate that DeFLOCNet effectively transforms different user intentions to create visually pleasing content.
引用
收藏
页码:10760 / 10769
页数:10
相关论文
共 50 条
  • [1] Connecting Low-Level Image Processing and High-Level Vision via Deep Learning
    Liu, Ding
    PROCEEDINGS OF THE TWENTY-SEVENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2018, : 5775 - 5776
  • [2] Survey on low-level controllable image synthesis with deep learning
    Zhang, Shixiong
    Li, Jiao
    Yang, Lu
    ELECTRONIC RESEARCH ARCHIVE, 2023, 31 (12): : 7385 - 7426
  • [3] Boosting image denoising effect via low-level noise injection
    Jian Xiao
    Xiaohui Cheng
    Shaoping Xu
    Wuyong Tao
    Yanyang Xiao
    Signal, Image and Video Processing, 2024, 18 : 1053 - 1067
  • [4] Boosting image denoising effect via low-level noise injection
    Xiao, Jian
    Cheng, Xiaohui
    Xu, Shaoping
    Tao, Wuyong
    Xiao, Yanyang
    SIGNAL IMAGE AND VIDEO PROCESSING, 2024, 18 (02) : 1053 - 1067
  • [5] Saliency detection via integrating deep learning architecture and low-level features
    Chi, Jianning
    Wu, Chengdong
    Yu, Xiaosheng
    Chu, Hao
    Ji, Peng
    NEUROCOMPUTING, 2019, 352 : 75 - 92
  • [6] Phase congruency: A low-level image invariant
    Peter Kovesi
    Psychological Research, 2000, 64 : 136 - 148
  • [7] NEW LOW-LEVEL PROCEDURE FOR IMAGE SEGMENTATION
    WECHSLER, H
    COMPUTER GRAPHICS AND IMAGE PROCESSING, 1978, 7 (01): : 120 - 129
  • [8] The role of symmetry in low-level image segmentation
    Carlin, P.
    Watt, R.
    PERCEPTION, 1995, 24 : 130 - 131
  • [9] Low-level image properties in facial expressions
    Menzel, Claudia
    Redies, Christoph
    Hayn-Leichsenring, Gregor U.
    ACTA PSYCHOLOGICA, 2018, 188 : 74 - 83
  • [10] Image detection under low-level illumination
    Sequeira, Raul E.
    Gubner, John A.
    Saleh, Bahaa E. A.
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 1993, 2 (01) : 18 - 26