Facial Landmark Feature Fusion in Transfer Learning of Child Facial Expressions

被引:1
|
作者
Witherow, Megan A. [1 ]
Samad, Manar D. [2 ]
Diawara, Norou [3 ]
Iftekharuddin, Khan M. [1 ]
机构
[1] Old Dominion Univ, Dept Elect & Comp Engn, Vis Lab, Norfolk, VA 23529 USA
[2] Tennessee State Univ, Dept Comp Sci, Nashville, TN 37203 USA
[3] Old Dominion Univ, Dept Math & Stat, Norfolk, VA 23529 USA
来源
基金
美国国家科学基金会;
关键词
Facial expression recognition; transfer learning; feature fusion; facial landmarks; child facial expressions;
D O I
10.1117/12.2641898
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Automatic classification of child facial expressions is challenging due to the scarcity of image samples with annotations. Transfer learning of deep convolutional neural networks (CNNs), pretrained on adult facial expressions, can be effectively finetuned for child facial expression classification using limited facial images of children. Recent work inspired by facial age estimation and age-invariant face recognition proposes a fusion of facial landmark features with deep representation learning to augment facial expression classification performance. We hypothesize that deep transfer learning of child facial expressions may also benefit from fusing facial landmark features. Our proposed model architecture integrates two input branches: a CNN branch for image feature extraction and a fully connected branch for processing landmark-based features. The model-derived features of these two branches are concatenated into a latent feature vector for downstream expression classification. The architecture is trained on an adult facial expression classification task. Then, the trained model is finetuned to perform child facial expression classification. The combined feature fusion and transfer learning approach is compared against multiple models: training on adult expressions only (adult baseline), child expression only (child baseline), and transfer learning from adult to child data. We also evaluate the classification performance of feature fusion without transfer learning on model performance. Training on child data, we find that feature fusion improves the 10-fold cross validation mean accuracy from 80.32% to 83.72% with similar variance. Proposed fine-tuning with landmark feature fusion of child expressions yields the best mean accuracy of 85.14%, a more than 30% improvement over the adult baseline and nearly 5% improvement over the child baseline.
引用
收藏
页数:6
相关论文
共 50 条
  • [1] Feature fusion for facial landmark detection
    Perakis, Panagiotis
    Theoharis, Theoharis
    Kakadiaris, Ioannis A.
    PATTERN RECOGNITION, 2014, 47 (09) : 2783 - 2793
  • [2] Feature Fusion for Facial Landmark Point Location
    Zhang, Gang
    Chen, Jiansheng
    PROCEEDINGS OF SAI INTELLIGENT SYSTEMS CONFERENCE (INTELLISYS) 2016, VOL 2, 2018, 16 : 33 - 41
  • [3] Transfer Learning Approach to Multiclass Classification of Child Facial Expressions
    Witherow, Megan A.
    Samad, Manar D.
    Iftekharuddin, Khan M.
    APPLICATIONS OF MACHINE LEARNING, 2019, 11139
  • [4] Facial Expressions Recognition Based on Delaunay Triangulation of Landmark and Machine Learning
    Ayeche, Farid
    Alti, Adel
    TRAITEMENT DU SIGNAL, 2021, 38 (06) : 1575 - 1586
  • [5] Deep Adaptation of Adult-Child Facial Expressions by Fusing Landmark Features
    Witherow, Megan A.
    Samad, Manar D.
    Diawara, Norou
    Bar, Haim Y.
    Iftekharuddin, Khan M.
    IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2024, 15 (03) : 847 - 858
  • [6] Landmark calibration for facial expressions and fish classification
    Chaturvedi, Iti
    Chen, Qian
    Cambria, Erik
    McConnell, Desmond
    SIGNAL IMAGE AND VIDEO PROCESSING, 2022, 16 (02) : 377 - 384
  • [7] Landmark calibration for facial expressions and fish classification
    Iti Chaturvedi
    Qian Chen
    Erik Cambria
    Desmond McConnell
    Signal, Image and Video Processing, 2022, 16 : 377 - 384
  • [8] Visible-to-Thermal Transfer Learning for Facial Landmark Detection
    Poster, Domenick D.
    Hu, Shuowen
    Short, Nathan J.
    Riggan, Benjamin S.
    Nasrabadi, Nasser M.
    IEEE ACCESS, 2021, 9 : 52759 - 52772
  • [9] Production of Facial Expressions Using Facial Feature Positioning and Deformation
    Sheu, Jia-Shing
    Wu, Ying-Ming
    Chuang, Yung-Chuan
    Hsiao, Ying-Tung
    Chen, Ching-Guo
    2012 IEEE INTERNATIONAL SYMPOSIUM ON INDUSTRIAL ELECTRONICS (ISIE), 2012, : 1158 - 1163
  • [10] USING SPARSE CODING FOR LANDMARK LOCALIZATION IN FACIAL EXPRESSIONS
    Cuculo, Vittorio
    Lanzarotti, Raffaella
    Boccignone, Giuseppe
    2014 5TH EUROPEAN WORKSHOP ON VISUAL INFORMATION PROCESSING (EUVIP 2014), 2014,