L-CoIns: Language-based Colorization with Instance Awareness

被引:3
|
作者
Chang, Zheng [1 ]
Weng, Shuchen [2 ,3 ]
Zhang, Peixuan [1 ]
Li, Yu [4 ]
Li, Si [1 ]
Shi, Boxin [2 ,3 ]
机构
[1] Beijing Univ Posts & Telecommun, Sch Artificial Intelligence, Beijing, Peoples R China
[2] Peking Univ, Sch Comp Sci, Natl Key Lab Multimedia Informat Proc, Beijing, Peoples R China
[3] Peking Univ, Sch Comp Sci, Natl Engn Res Ctr Visual Technol, Beijing, Peoples R China
[4] Int Digital Econ Acad, Shenzhen, Peoples R China
基金
国家重点研发计划; 中国国家自然科学基金;
关键词
D O I
10.1109/CVPR52729.2023.01842
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Language-based colorization produces plausible colors consistent with the language description provided by the user. Recent studies introduce additional annotation to prevent color-object coupling and mismatch issues, but they still have difficulty in distinguishing instances corresponding to the same object words. In this paper, we propose a transformer-based framework to automatically aggregate similar image patches and achieve instance awareness without any additional knowledge. By applying our presented luminance augmentation and counter-color loss to break down the statistical correlation between luminance and color words, our model is driven to synthesize colors with better descriptive consistency. We further collect a dataset to provide distinctive visual characteristics and detailed language descriptions for multiple instances in the same image. Extensive experiments demonstrate our advantages of synthesizing visually pleasing and descriptionconsistent results of instance-aware colorization.
引用
收藏
页码:19221 / 19230
页数:10
相关论文
共 50 条
  • [1] Language-based Colorization of Scene Sketches
    Zou, Changqing
    Mo, Haoran
    Gao, Chengying
    Du, Ruofei
    Fu, Hongbo
    ACM TRANSACTIONS ON GRAPHICS, 2019, 38 (06):
  • [2] L-CoDer: Language-Based Colorization with Color-Object Decoupling Transformer
    Chang, Zheng
    Weng, Shuchen
    Li, Yu
    Li, Si
    Shi, Boxin
    COMPUTER VISION - ECCV 2022, PT XVIII, 2022, 13678 : 360 - 375
  • [3] L-CoDe: Language-based Colorization Using Color-object Decoupled Conditions
    Weng, Shuchen
    Wu, Hao
    Chang, Zheng
    Tang, Jiajun
    Li, Si
    Shi, Boxin
    THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / THE TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 2677 - 2684
  • [4] L-CAD: Language-based Colorization with Any-level Descriptions using Diffusion Priors
    Chang, Zheng
    Weng, Shuchen
    Zhang, Peixuan
    Li, Yu
    Li, Si
    Shi, Boxin
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [5] Language-Based Medicine
    Kolla, Avani M.
    ACADEMIC MEDICINE, 2022, 97 (02) : 207 - 207
  • [6] Language-Based Hypervisors
    Budianto, Enrico
    Chow, Richard
    Ding, Jonathan
    McCool, Michael
    CRYPTOLOGY AND NETWORK SECURITY, CANS 2016, 2016, 10052 : 731 - 736
  • [7] Language-based hypervisors
    Budianto, Enrico
    Chow, Richard
    Ding, Jonathan
    McCool, Michael
    Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2016, 10052 LNCS : 731 - 736
  • [8] Language-based security
    Abadi, M
    Morrisett, G
    Sabelfeld, A
    JOURNAL OF FUNCTIONAL PROGRAMMING, 2005, 15 : 129 - 129
  • [9] Language-based Decisions
    Bjorndahl, Adam
    Halpern, Joseph Y.
    ELECTRONIC PROCEEDINGS IN THEORETICAL COMPUTER SCIENCE, 2021, (335): : 55 - 67
  • [10] Captioning with Language-Based Attention
    Rajendra, Anshu
    Rajendra, Ritwik
    Mengshoel, Ole J.
    Zeng, Ming
    Haider, Momina
    2018 IEEE 5TH INTERNATIONAL CONFERENCE ON DATA SCIENCE AND ADVANCED ANALYTICS (DSAA), 2018, : 415 - 423