Facial Image Augmentation from Sparse Line Features Using Small Training Data

被引：3

作者：

Hung, Shih-Kai ^{[1
]}

Gan, John Q. ^{[1
]}

机构：

[1] Univ Essex, Sch Comp Sci & Elect Engn, Colchester, Essex, England

来源：

ADVANCES IN COMPUTATIONAL INTELLIGENCE, IWANN 2021, PT I | 2021年 / 12861卷

关键词：

Data augmentation; Convolutional neural networks; Generative adversarial networks (GANs);

D O I：

10.1007/978-3-030-85030-2_45

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Data collection is expensive in many research fields. Data augmentation from a very small dataset, such as synthesising realistic images from limited or incomplete information available from a small number of sample images, is still an enormous challenge using deep convolutional neural networks that traditionally require a large number of training data to achieve reasonable performance. For the purpose of manipulating the synthetic results with diversity, line features, which can be easily obtained through computer vision, hand-drawn lines, or customerdesigned sketches, can be utilized to provide extra details to effectively augment a small training dataset for many applications. In this paper, a novel conditional generative adversarial network (GAN) framework for synthesising photorealistic facial images using small training data and limited line features is proposed, where sparse line features are expected to simulate abstract and incomplete handdrawn sketches for introducing diversity in the augmented facial images. The proposed GAN framework can automatically recover the lost information caused by incomplete input features, which has been proved to efficiently reduce unexpected distortions but enhance data diversity with controllable sparse line features. Experimental results have demonstrated that the proposed method with a very small dataset, 50 training images only, can generate images of higher quality than the traditional translationmethods and preserve essential details to synthesise diverse but realistic facial images. Compared to the state-of-the-art methods, the proposed GAN framework can generate more photorealistic facial images using controllable sparse line features in terms of higher FID and KID scores as well as preference evaluation by human perception.

引用

页码：547 / 558

页数：12

共 50 条

[31] AdvMask: A sparse adversarial attack-based data augmentation method for image classification
Yang, Suorong
Li, Jinqiao
Zhang, Tianyue
Zhao, Jian
Shen, Furao
PATTERN RECOGNITION, 2023, 144
[32] Data augmentation using image translation for underwater sonar image segmentation
Lee, Eon-ho
Park, Byungjae
Jeon, Myung-Hwan
Jang, Hyesu
Kim, Ayoung
Lee, Sejin
PLOS ONE, 2022, 17 (08):
[33] Satellite image classification using sparse codes of multiple features
Sheng, Guofeng
Yang, Wen
Chen, Lijun
Sun, Hong
2010 IEEE 10TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING PROCEEDINGS (ICSP2010), VOLS I-III, 2010, : 952 - 955
[34] Facial Expression Recognition using Convolutional Neural Network with Data Augmentation
Ahmed, Tawsin Uddin
Hossain, Sazzad
Hossain, Mohammad Shahadat
Ul Islam, Raihan
Andersson, Karl
2019 JOINT 8TH INTERNATIONAL CONFERENCE ON INFORMATICS, ELECTRONICS & VISION (ICIEV) AND 2019 3RD INTERNATIONAL CONFERENCE ON IMAGING, VISION & PATTERN RECOGNITION (ICIVPR) WITH INTERNATIONAL CONFERENCE ON ACTIVITY AND BEHAVIOR COMPUTING (ABC), 2019, : 336 - 341
[35] Application of multidomain sensor image fusion and training data augmentation for enhanced CNN image classification
Arnous, Ferris, I
Narayanan, Ram M.
Li, Bing C.
JOURNAL OF ELECTRONIC IMAGING, 2022, 31 (01)
[36] Locating facial features in image sequences using neural networks
Reinders, MJT
Koch, RWC
Gerbrands, JJ
PROCEEDINGS OF THE SECOND INTERNATIONAL CONFERENCE ON AUTOMATIC FACE AND GESTURE RECOGNITION, 1996, : 230 - 235
[37] Small, sparse, but substantial: techniques for segmenting small agricultural fields using sparse ground data
Marvaniya, Smit
Devi, Umamaheswari
Hazra, Jagabondhu
Mujumdar, Shashank
Gupta, Nitin
INTERNATIONAL JOURNAL OF REMOTE SENSING, 2021, 42 (04) : 1512 - 1534
[38] Robust facial expression recognition of a speaker using thermal image processing and updating of fundamental training data
Nakanishi, Yuu
Yoshitomi, Yasunari
Asada, Taro
Tabuse, Masayoshi
ARTIFICIAL LIFE AND ROBOTICS, 2013, 17 (3-4) : 342 - 349
[39] On Gradient Descent Training Under Data Augmentation with On-Line Noisy Copies
Hagiwara, Katsuyuki
IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2023, E106D (09) : 1537 - 1545
[40] Multi-scale features fusion from sparse LiDAR data and single image for depth completion
Wang, Benzhang
Feng, Yiliu
Liu, Hengzhu
ELECTRONICS LETTERS, 2018, 54 (24) : 1375 - 1376

← 1 2 3 4 5 →