Ordinal information based facial expression intensity estimation for emotional interaction: a novel semi-supervised deep learning approach

被引:2
|
作者
Xu, Ruyi [1 ]
Han, Jiaxu [2 ]
Chen, Jingying [1 ,2 ,3 ]
机构
[1] Cent China Normal Univ, Natl Engn Res Ctr Educ Big Data, 152 Luoyu Rd, Wuhan 430079, Hubei, Peoples R China
[2] Cent China Normal Univ, Natl Engn Res Ctr Elearning, 152 Luoyu Rd, Wuhan 430079, Hubei, Peoples R China
[3] Ningbo Yuxing Educ Technol Co Ltd, Ningbo 315200, Zhejiang, Peoples R China
基金
中国国家自然科学基金;
关键词
Expression intensity estimation; Siamese network; Semi-supervised learning; Ordinal regression; Social interaction analysis; RECOGNITION; SIAMESE;
D O I
10.1007/s00607-022-01140-y
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Emotional understanding and expression plays a critical role in social interaction. To analyze children's emotional interaction automatically, this study focuses on developing a novel network architecture and a reliable algorithm for expression intensity estimation to measure children's facial expression responses to emotional stimuli. The facial expression intensity variation provides temporal dynamic information of facial behavior, which is critical to interpreting the meaning of expression. In order to avoid laborious manual annotations for expression intensity, existing unsupervised methods attempt to identify relative intensity using ordinal information within a facial expression sequence; however, they fail to estimate absolute intensity accurately. Moreover, appropriate features are needed to represent the continuous appearance changes caused by expression intensity to improve the model's ability to distinguish subtle differences in expression. This study therefore presents a novel semi-supervised method to estimate expression intensity using salient deep learning features. First, the facial expression is represented by the difference response of the convolutional neural network backbone between the target expression and its responding neutral expression, with the goal of suppressing the effects of expression-unrelated features on expression intensity estimation. Then, the pairwise data constructed with ordinal information is input into a Siamese network with a combined hinge loss that guides learning the relative intensity on unlabeled pairwise frames, the absolute intensity of a few labeled key frames, and the intensity range of most unlabeled frames. The average pearson correlation coefficient, intraclass correlation coefficient, and mean absolute error are 0.7683, 0.7405, and 0.1698 on the extended Cohn-Kanade dataset (CK+), and 0.7804, 0.6684, and 0.1864 on the Binghamton University 4D Facial Expression Dataset using the proposed method, results that are superior to the state of the art. The cross-dataset experiment indicates that the proposed method is promising for the analysis of children's emotional interactions.
引用
收藏
页码:1121 / 1138
页数:18
相关论文
共 50 条
  • [1] Ordinal information based facial expression intensity estimation for emotional interaction: a novel semi-supervised deep learning approach
    Ruyi Xu
    Jiaxu Han
    Jingying Chen
    [J]. Computing, 2024, 106 : 1121 - 1138
  • [2] GONet: A Semi-Supervised Deep Learning Approach For Traversability Estimation
    Hirose, Noriaki
    Sadeghian, Amir
    Vazquez, Marynel
    Goebel, Patrick
    Savarese, Silvio
    [J]. 2018 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2018, : 3044 - 3051
  • [3] Semi-supervised Learning of Deep Difference Features for Facial Expression Recognition
    Xu, Can
    Xu, Ruyi
    Chen, Jingying
    Liu, Leyuan
    [J]. PATTERN RECOGNITION AND COMPUTER VISION, PT III, 2018, 11258 : 245 - 254
  • [4] Facial landmark detection by semi-supervised deep learning
    Tang, Xin
    Guo, Fang
    Shen, Jianbing
    Du, Tianyuan
    [J]. NEUROCOMPUTING, 2018, 297 : 22 - 32
  • [5] Facial Expression Intensity Estimation Using Ordinal Information
    Zhao, Rui
    Gan, Quan
    Wang, Shangfei
    Ji, Qiang
    [J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 3466 - 3474
  • [6] Semi-Supervised Deep Neural Network for Joint Intensity Estimation of Multiple Facial Action Units
    Zhang, Yong
    Fan, Yanbo
    Dong, Weiming
    Hu, Bao-Gang
    Ji, Qiang
    [J]. IEEE ACCESS, 2019, 7 : 150743 - 150756
  • [7] Toward a Semi-Supervised Learning Approach to Phylogenetic Estimation
    Silvestro, Daniele
    Latrille, Thibault
    Salamin, Nicolas
    [J]. SYSTEMATIC BIOLOGY, 2024,
  • [8] Trend-Aware Supervision: On Learning Invariance for Semi-Supervised Facial Action Unit Intensity Estimation
    Chen, Yingjie
    Zhang, Jiarui
    Wang, Tao
    Liang, Yun
    [J]. THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 1, 2024, : 483 - 491
  • [9] Semi-Supervised Adaptive Label Distribution Learning for Facial Age Estimation
    Hou, Peng
    Geng, Xin
    Huo, Zeng-Wei
    Lv, Jia-Qi
    [J]. THIRTY-FIRST AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 2015 - 2021
  • [10] A Novel Semi-Supervised Learning Approach to Pedestrian Reidentification
    Han, Hua
    Ma, Wenjin
    Zhou, MengChu
    Guo, Qiang
    Abusorrah, Abdullah
    [J]. IEEE INTERNET OF THINGS JOURNAL, 2021, 8 (04) : 3042 - 3052