End-to-End Photo-Sketch Generation via Fully Convolutional Representation Learning

被引:106
|
作者
Zhang, Liliang [1 ]
Lin, Liang [1 ]
Wu, Xian [1 ]
Ding, Shengyong [1 ]
Zhang, Lei [2 ]
机构
[1] Sun Yat Sen Univ, Guangzhou, Guangdong, Peoples R China
[2] Hong Kong Polytech Univ, Hong Kong, Hong Kong, Peoples R China
关键词
Sketch-photo generation; face verification; neural nets;
D O I
10.1145/2671188.2749321
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Sketch-based face recognition is an interesting task in vision and multimedia research, yet it is quite challenging due to the great difference between face photos and sketches. In this paper, we propose a novel approach for photo-sketch generation, aiming to automatically transform face photos into detail-preserving personal sketches. Unlike the traditional models synthesizing sketches based on a dictionary of exemplars, we develop a fully convolutional network to learn the end-to-end photo-sketch mapping. Our approach takes whole face photos as inputs and directly generates the corresponding sketch images with efficient inference and learning, in which the architecture is stacked by only convolutional kernels of very small sizes. To well capture the person identity during the photo-sketch transformation, we define our optimization objective in the form of joint generative-discriminative minimization. In particular, a discriminative regularization term is incorporated into the photo-sketch generation, enhancing the discriminability of the generated person sketches against other individuals. Extensive experiments on several standard benchmarks suggest that our approach outperforms other state-of-the-arts in both photosketch generation and face sketch verification.
引用
收藏
页码:627 / 634
页数:8
相关论文
共 50 条
  • [1] End-to-End Deep Sketch-to-Photo Matching Enforcing Realistic Photo Generation
    Capozzi, Leonardo
    Pinto, Joao Ribeiro
    Cardoso, Jaime S.
    Rebelo, Ana
    [J]. PROGRESS IN PATTERN RECOGNITION, IMAGE ANALYSIS, COMPUTER VISION, AND APPLICATIONS, CIARP 2021, 2021, 12702 : 451 - 460
  • [2] End-to-End Blood Pressure Prediction via Fully Convolutional Networks
    Baek, Sanghyun
    Jang, Jiyong
    Yoon, Sungroh
    [J]. IEEE ACCESS, 2019, 7 : 185458 - 185468
  • [3] Face photo-sketch portraits transformation via generation pipeline
    Guo, Mengsi
    Xiong, Mingfu
    Huang, Jin
    Hu, Xinrong
    Peng, Tao
    [J]. VISUAL COMPUTER, 2024,
  • [4] End-to-End Object Detection with Fully Convolutional Network
    Wang, Jianfeng
    Song, Lin
    Li, Zeming
    Sun, Hongbin
    Sun, Jian
    Zheng, Nanning
    [J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 15844 - 15853
  • [5] END-TO-END BINARY REPRESENTATION LEARNING VIA DIRECT BINARY EMBEDDING
    Liu, Liu
    Rahimpour, Alireza
    Taalimi, Ali
    Qi, Hairong
    [J]. 2017 24TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2017, : 1257 - 1261
  • [6] End-to-End Efficient Representation Learning via Cascading Combinatorial Optimization
    Jeong, Yeonwoo
    Kim, Yoonsung
    Song, Hyun Oh
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 11371 - 11379
  • [7] DetPS: A Fully Convolutional End-to-end Parking Slot Detector
    Wang, Yinan
    Guan, Yingzhou
    Cao, Rongchuan
    [J]. 2022 IEEE 17TH CONFERENCE ON INDUSTRIAL ELECTRONICS AND APPLICATIONS (ICIEA), 2022, : 1051 - 1056
  • [8] Handwritten Text Segmentation via End-to-End Learning of Convolutional Neural Networks
    Junho Jo
    Hyung Il Koo
    Jae Woong Soh
    Nam Ik Cho
    [J]. Multimedia Tools and Applications, 2020, 79 : 32137 - 32150
  • [9] Leukocyte Segmentation via End-to-End Learning of Deep Convolutional Neural Networks
    Lu, Yan
    Fan, Haoyi
    Li, Zuoyong
    [J]. INTELLIGENCE SCIENCE AND BIG DATA ENGINEERING: VISUAL DATA ENGINEERING, PT I, 2019, 11935 : 191 - 200
  • [10] Handwritten Text Segmentation via End-to-End Learning of Convolutional Neural Networks
    Jo, Junho
    Koo, Hyung Il
    Soh, Jae Woong
    Cho, Nam Ik
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2020, 79 (43-44) : 32137 - 32150