DeFLOCNet: Deep Image Editing via Flexible Low-level Controls

被引：12

作者：

Liu, Hongyu ^{[1
]}

Wan, Ziyu ^{[2
]}

Huang, Wei ^{[3
]}

Song, Yibing ^{[4
]}

Han, Xintong ^{[1
]}

Liao, Jing ^{[2
]}

Jiang, Bin ^{[3
]}

Liu, Wei ^{[5
]}

机构：

[1] Huya Inc, Guangzhou, Peoples R China

[2] City Univ Hong Kong, Hong Kong, Peoples R China

[3] Hunan Univ, Changsha, Peoples R China

[4] Tencent AI Lab, Shenzhen, Peoples R China

[5] Tencent Data Platform, Shenzhen, Peoples R China

来源：

2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021 | 2021年

基金：

中国国家自然科学基金;

关键词：

D O I：

10.1109/CVPR46437.2021.01062

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

User-intended visual content fills the hole regions of an input image in the image editing scenario. The coarse low-level inputs, which typically consist of sparse sketch lines and color dots, convey user intentions for content creation (i.e., free-form editing). While existing methods combine an input image and these low-level controls for CNN inputs, the corresponding feature representations are not sufficient to convey user intentions, leading to unfaithfully generated content. In this paper, we propose DeFLOCNet which relies on a deep encoder-decoder CNN to retain the guidance of these controls in the deep feature representations. In each skip-connection layer, we design a structure generation block. Instead of attaching low-level controls to an input image, we inject these controls directly into each structure generation block for sketch line refinement and color propagation in the CNN feature space. We then concatenate the modulated features with the original decoder features for structure generation. Meanwhile, DeFLOCNet involves another decoder branch for texture generation and detail enhancement. Both structures and textures are rendered in the decoder, leading to user-intended editing results. Experiments on benchmarks demonstrate that DeFLOCNet effectively transforms different user intentions to create visually pleasing content.

引用

页码：10760 / 10769

页数：10

共 50 条

[21] Controlling low-level image properties: The SHINE toolbox
Verena Willenbockel
Javid Sadr
Daniel Fiset
Greg O. Horne
Frédéric Gosselin
James W. Tanaka
Behavior Research Methods, 2010, 42 : 671 - 684
[22] LOW-LEVEL PROCESSING TECHNIQUES IN GEOPHYSICAL IMAGE INTERPRETATION
ROBERTO, V
PERON, A
FUMIS, PL
PATTERN RECOGNITION LETTERS, 1989, 10 (02) : 111 - 122
[23] IMAGE SEGMENTATION SCHEMA FOR LOW-LEVEL COMPUTER VISION
ASANO, T
YOKOYA, N
PATTERN RECOGNITION, 1981, 14 (1-6) : 267 - 273
[24] Medical Image Fusion Based on Low-Level Features
Zhang, Yongxin
Guo, Chenrui
Zhao, Peng
COMPUTATIONAL AND MATHEMATICAL METHODS IN MEDICINE, 2021, 2021
[25] Image Saliency Detection with Low-Level Features Enhancement
Zhao, Ting
Wu, Xiangqian
PATTERN RECOGNITION AND COMPUTER VISION (PRCV 2018), PT I, 2018, 11256 : 408 - 419
[26] Low-Level Image Features for Stamps Detection and Classification
Forczmanski, Pawel
Markiewicz, Andrzej
PROCEEDINGS OF THE 8TH INTERNATIONAL CONFERENCE ON COMPUTER RECOGNITION SYSTEMS CORES 2013, 2013, 226 : 383 - 392
[27] A low-level image processing algorithms accelerator platform
Saldana, Griselda
Arias-Estrada, Miguel
18TH INTERNATIONAL CONFERENCE ON ELECTRONICS, COMMUNICATIONS AND COMPUTERS (CONIELECOMP 2008), PROCEEDINGS, 2008, : 117 - +
[28] NEW LOW-LEVEL PROCEDURE FOR IMAGE SEGMENTATION.
Wechsler, Harry
1978, 7 (01): : 120 - 129
[29] Mapping low-level image features to semantic concepts
Stan, D
Sethi, IK
STORAGE AND RETRIEVAL FOR MEDIA DATABASES 2001, 2001, 4315 : 172 - 179
[30] Controlling low-level image properties: The SHINE toolbox
Willenbockel, Verena
Sadr, Javid
Fiset, Daniel
Horne, Greg O.
Gosselin, Frederic
Tanaka, James W.
BEHAVIOR RESEARCH METHODS, 2010, 42 (03) : 671 - 684

← 1 2 3 4 5 →