Language-based Colorization of Scene Sketches

被引:46
|
作者
Zou, Changqing [1 ,2 ]
Mo, Haoran [1 ]
Gao, Chengying [1 ]
Du, Ruofei [3 ]
Fu, Hongbo [4 ]
机构
[1] Sun Yat Sen Univ, Guangzhou, Guangdong, Peoples R China
[2] Huawei Noahs Ark Lab, Hong Kong, Peoples R China
[3] Google, Mountain View, CA 94043 USA
[4] City Univ Hong Kong, Hong Kong, Peoples R China
来源
ACM TRANSACTIONS ON GRAPHICS | 2019年 / 38卷 / 06期
关键词
Deep Neural Networks; Image Segmentation; Language-based Editing; Scene Sketch; Sketch Colorization;
D O I
10.1145/3355089.3356561
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Being natural, touchless, and fun-embracing, language-based inputs have been demonstrated effective for various tasks from image generation to literacy education for children. This paper for the first time presents a language-based system for interactive colorization of scene sketches, based on semantic comprehension. The proposed system is built upon deep neural networks trained on a large-scale repository of scene sketches and cartoonstyle color images with text descriptions. Given a scene sketch, our system allows users, via language-based instructions, to interactively localize and colorize specific foreground object instances to meet various colorization requirements in a progressive way. We demonstrate the effectiveness of our approach via comprehensive experimental results including alternative studies, comparison with the state-of-the-art methods, and generalization user studies. Given the unique characteristics of language-based inputs, we envision a combination of our interface with a traditional scribble-based interface for a practical multimodal colorization system, benefiting various applications. The dataset and source code can be found at https://github. com/SketchyScene/SketchySceneColorization.
引用
下载
收藏
页数:16
相关论文
共 50 条
  • [1] L-CoIns: Language-based Colorization with Instance Awareness
    Chang, Zheng
    Weng, Shuchen
    Zhang, Peixuan
    Li, Yu
    Li, Si
    Shi, Boxin
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 19221 - 19230
  • [2] A semantic and language-based representation of an environmental scene
    Jean-Marie Le Yaouanc
    Éric Saux
    Christophe Claramunt
    GeoInformatica, 2010, 14 : 333 - 352
  • [3] Automatic Colorization Algorithm with Anime Effect for Scene Sketches
    Zhu S.
    Chen Z.
    Ye D.
    Moshi Shibie yu Rengong Zhineng/Pattern Recognition and Artificial Intelligence, 2020, 33 (08): : 671 - 680
  • [4] A semantic and language-based representation of an environmental scene
    Le Yaouanc, Jean-Marie
    Saux, Eric
    Claramunt, Christophe
    GEOINFORMATICA, 2010, 14 (03) : 333 - 352
  • [5] L-CoDer: Language-Based Colorization with Color-Object Decoupling Transformer
    Chang, Zheng
    Weng, Shuchen
    Li, Yu
    Li, Si
    Shi, Boxin
    COMPUTER VISION - ECCV 2022, PT XVIII, 2022, 13678 : 360 - 375
  • [6] L-CoDe: Language-based Colorization Using Color-object Decoupled Conditions
    Weng, Shuchen
    Wu, Hao
    Chang, Zheng
    Tang, Jiajun
    Li, Si
    Shi, Boxin
    THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / THE TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 2677 - 2684
  • [7] L-CAD: Language-based Colorization with Any-level Descriptions using Diffusion Priors
    Chang, Zheng
    Weng, Shuchen
    Zhang, Peixuan
    Li, Yu
    Li, Si
    Shi, Boxin
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [8] Language-Based Medicine
    Kolla, Avani M.
    ACADEMIC MEDICINE, 2022, 97 (02) : 207 - 207
  • [9] Language-Based Hypervisors
    Budianto, Enrico
    Chow, Richard
    Ding, Jonathan
    McCool, Michael
    CRYPTOLOGY AND NETWORK SECURITY, CANS 2016, 2016, 10052 : 731 - 736
  • [10] Language-based hypervisors
    Budianto, Enrico
    Chow, Richard
    Ding, Jonathan
    McCool, Michael
    Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2016, 10052 LNCS : 731 - 736