DoodleFormer: Creative Sketch Drawing with Transformers

被引:6
|
作者
Bhunia, Ankan Kumar [1 ]
Khan, Salman [1 ,2 ]
Cholakkal, Hisham [1 ]
Anwer, Rao Muhammad [1 ,3 ]
Khan, Fahad Shahbaz [1 ,4 ]
Laaksonen, Jorma [3 ]
Felsberg, Michael [4 ]
机构
[1] Mohamed bin Zayed Univ AI, Abu Dhabi, U Arab Emirates
[2] Australian Natl Univ, Canberra, ACT, Australia
[3] Aalto Univ, Espoo, Finland
[4] Linkoping Univ, Linkoping, Sweden
来源
关键词
D O I
10.1007/978-3-031-19790-1_21
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Creative sketching or doodling is an expressive activity, where imaginative and previously unseen depictions of everyday visual objects are drawn. Creative sketch image generation is a challenging vision problem, where the task is to generate diverse, yet realistic creative sketches possessing the unseen composition of the visual-world objects. Here, we propose a novel coarse-to-fine two-stage framework, DoodleFormer, that decomposes the creative sketch generation problem into the creation of coarse sketch composition followed by the incorporation of fine-details in the sketch. We introduce graph-aware transformer encoders that effectively capture global dynamic as well as local static structural relations among different body parts. To ensure diversity of the generated creative sketches, we introduce a probabilistic coarse sketch decoder that explicitly models the variations of each sketch body part to be drawn. Experiments are performed on two creative sketch datasets: Creative Birds and Creative Creatures. Our qualitative, quantitative and human-based evaluations show that DoodleFormer outperforms the state-of-the-art on both datasets, yielding realistic and diverse creative sketches. On Creative Creatures, DoodleFormer achieves an absolute gain of 25 in Frechet inception distance (FID) over state-of-the-art. We also demonstrate the effectiveness of DoodleFormer for related applications of text to creative sketch generation, sketch completion and house layout generation. Code is available at: https://github.com/ ankanbhunia/doodleformer.
引用
收藏
页码:338 / 355
页数:18
相关论文
共 50 条
  • [1] Robert Adam - The creative mind: From the sketch to the finished drawing
    Ledes, AE
    [J]. MAGAZINE ANTIQUES, 1998, 153 (01): : 30 - 30
  • [2] Drawing The sign and the sketch
    Salimei, Guendalina
    [J]. DISEGNARE IDEE IMMAGINI-IDEAS IMAGES, 2023, 34 (66): : 9 - 15
  • [3] Creative Society in Lithuania: a Sketch of the Conception of the Creative
    Stasiulis, Nerijus
    [J]. LOGOS-VILNIUS, 2015, (82): : 16 - 23
  • [4] Generation of pencil sketch drawing
    Lin, Hwei Jen
    Li, Yue-Sheng
    [J]. 2013 INTERNATIONAL CONFERENCE ON IT CONVERGENCE AND SECURITY (ICITCS), 2013,
  • [5] CREATIVE WORK IN DRAWING
    Sargent, Walter
    [J]. EDUCATION, 1932, 52 (07): : 410 - 413
  • [6] Creative drawing methods
    Cervellini, Franco
    [J]. DISEGNARE IDEE IMMAGINI-IDEAS IMAGES, 2012, 23 (45): : 56 - 65
  • [7] Sketch Drawing by NAO Humanoid Robot
    Singh, Avinash Kumar
    Chakraborty, Pavan
    Nandi, G. C.
    [J]. TENCON 2015 - 2015 IEEE REGION 10 CONFERENCE, 2015,
  • [8] POSSIBLE SOURCE FOR A BLAKE SKETCH AND DRAWING
    GRANT, PB
    [J]. BLAKE NEWSLETTER, 1977, 10 (03): : 85 - 87
  • [9] Dare To Sketch:.A Guide to Drawing on the Go
    Halliday, Heather
    [J]. LIBRARY JOURNAL, 2017, 142 (19) : 80 - 80
  • [10] Discussion on the Experimental Creative Sketch Teaching
    Li, Hui
    [J]. PROCEEDINGS OF THE 2016 2ND INTERNATIONAL CONFERENCE ON ECONOMY, MANAGEMENT, LAW AND EDUCATION (EMLE 2016), 2016, 20 : 466 - 468