Facial Image Augmentation from Sparse Line Features Using Small Training Data

被引:3
|
作者
Hung, Shih-Kai [1 ]
Gan, John Q. [1 ]
机构
[1] Univ Essex, Sch Comp Sci & Elect Engn, Colchester, Essex, England
关键词
Data augmentation; Convolutional neural networks; Generative adversarial networks (GANs);
D O I
10.1007/978-3-030-85030-2_45
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Data collection is expensive in many research fields. Data augmentation from a very small dataset, such as synthesising realistic images from limited or incomplete information available from a small number of sample images, is still an enormous challenge using deep convolutional neural networks that traditionally require a large number of training data to achieve reasonable performance. For the purpose of manipulating the synthetic results with diversity, line features, which can be easily obtained through computer vision, hand-drawn lines, or customerdesigned sketches, can be utilized to provide extra details to effectively augment a small training dataset for many applications. In this paper, a novel conditional generative adversarial network (GAN) framework for synthesising photorealistic facial images using small training data and limited line features is proposed, where sparse line features are expected to simulate abstract and incomplete handdrawn sketches for introducing diversity in the augmented facial images. The proposed GAN framework can automatically recover the lost information caused by incomplete input features, which has been proved to efficiently reduce unexpected distortions but enhance data diversity with controllable sparse line features. Experimental results have demonstrated that the proposed method with a very small dataset, 50 training images only, can generate images of higher quality than the traditional translationmethods and preserve essential details to synthesise diverse but realistic facial images. Compared to the state-of-the-art methods, the proposed GAN framework can generate more photorealistic facial images using controllable sparse line features in terms of higher FID and KID scores as well as preference evaluation by human perception.
引用
收藏
页码:547 / 558
页数:12
相关论文
共 50 条
  • [31] AdvMask: A sparse adversarial attack-based data augmentation method for image classification
    Yang, Suorong
    Li, Jinqiao
    Zhang, Tianyue
    Zhao, Jian
    Shen, Furao
    PATTERN RECOGNITION, 2023, 144
  • [32] Data augmentation using image translation for underwater sonar image segmentation
    Lee, Eon-ho
    Park, Byungjae
    Jeon, Myung-Hwan
    Jang, Hyesu
    Kim, Ayoung
    Lee, Sejin
    PLOS ONE, 2022, 17 (08):
  • [33] Satellite image classification using sparse codes of multiple features
    Sheng, Guofeng
    Yang, Wen
    Chen, Lijun
    Sun, Hong
    2010 IEEE 10TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING PROCEEDINGS (ICSP2010), VOLS I-III, 2010, : 952 - 955
  • [34] Facial Expression Recognition using Convolutional Neural Network with Data Augmentation
    Ahmed, Tawsin Uddin
    Hossain, Sazzad
    Hossain, Mohammad Shahadat
    Ul Islam, Raihan
    Andersson, Karl
    2019 JOINT 8TH INTERNATIONAL CONFERENCE ON INFORMATICS, ELECTRONICS & VISION (ICIEV) AND 2019 3RD INTERNATIONAL CONFERENCE ON IMAGING, VISION & PATTERN RECOGNITION (ICIVPR) WITH INTERNATIONAL CONFERENCE ON ACTIVITY AND BEHAVIOR COMPUTING (ABC), 2019, : 336 - 341
  • [35] Application of multidomain sensor image fusion and training data augmentation for enhanced CNN image classification
    Arnous, Ferris, I
    Narayanan, Ram M.
    Li, Bing C.
    JOURNAL OF ELECTRONIC IMAGING, 2022, 31 (01)
  • [36] Locating facial features in image sequences using neural networks
    Reinders, MJT
    Koch, RWC
    Gerbrands, JJ
    PROCEEDINGS OF THE SECOND INTERNATIONAL CONFERENCE ON AUTOMATIC FACE AND GESTURE RECOGNITION, 1996, : 230 - 235
  • [37] Small, sparse, but substantial: techniques for segmenting small agricultural fields using sparse ground data
    Marvaniya, Smit
    Devi, Umamaheswari
    Hazra, Jagabondhu
    Mujumdar, Shashank
    Gupta, Nitin
    INTERNATIONAL JOURNAL OF REMOTE SENSING, 2021, 42 (04) : 1512 - 1534
  • [38] Robust facial expression recognition of a speaker using thermal image processing and updating of fundamental training data
    Nakanishi, Yuu
    Yoshitomi, Yasunari
    Asada, Taro
    Tabuse, Masayoshi
    ARTIFICIAL LIFE AND ROBOTICS, 2013, 17 (3-4) : 342 - 349
  • [39] On Gradient Descent Training Under Data Augmentation with On-Line Noisy Copies
    Hagiwara, Katsuyuki
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2023, E106D (09) : 1537 - 1545
  • [40] Multi-scale features fusion from sparse LiDAR data and single image for depth completion
    Wang, Benzhang
    Feng, Yiliu
    Liu, Hengzhu
    ELECTRONICS LETTERS, 2018, 54 (24) : 1375 - 1376