High-Level Geometry-based Features of Video Modality for Emotion Prediction

被引:13
|
作者
Weber, Raphael [1 ]
Barrielle, Vincent [2 ]
Soladie, Catherine [1 ]
Seguier, Renaud [1 ]
机构
[1] IETR, FAST, CentraleSupelec, Ave Boulaie, F-35576 Cesson Sevigne, France
[2] Dynamixyz, 80 Ave Buttes de Coesmes, F-35700 Rennes, France
关键词
HEAD POSE ESTIMATION; FACIAL EXPRESSIONS; REPRESENTATION;
D O I
10.1145/2988257.2988262
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
The automatic analysis of emotion remains a challenging task in unconstrained experimental conditions. In this paper, we present our contribution to the 6th Audio/Visual Emotion Challenge (AVEC 2016), which aims at predicting the continuous emotional dimensions of arousal and valence. First, we propose to improve the performance of the multi-modal prediction with low-level features by adding high-level geometry-based features, namely head pose and expression signature. The head pose is estimated by fitting a reference 3D mesh to the 2D facial landmarks. The expression signature is the projection of the facial landmarks in an unsupervised person-specific model. Second, we propose to fuse the unimodal predictions trained on each training subject before performing the multimodal fusion. The results show that our high-level features improve the performance of the multi-modal prediction of arousal and that the subjects fusion works well in unimodal prediction but generalizes poorly in multimodal prediction, particularly on valence.
引用
收藏
页码:51 / 58
页数:8
相关论文
共 50 条
  • [21] The geometry of high-level visual representations
    Kriegeskorte, Nikolaus
    I-PERCEPTION, 2014, 5 (04): : 412 - 412
  • [22] The Geometry of High-Level Colour Space
    Muchhala, Mubaraka
    Scott-Samuel, Nick
    Baddeley, Roland
    PERCEPTION, 2019, 48 : 192 - 192
  • [23] Geometry-based algorithm for the prediction of nonpathologic mandibular movement
    Lemoine, Jeremy J.
    Xia, James J.
    Andersen, Clark R.
    Gateno, Bs Jaime
    Buford, William, Jr.
    Liebscbner, Michael A. K.
    JOURNAL OF ORAL AND MAXILLOFACIAL SURGERY, 2007, 65 (12) : 2411 - 2417
  • [24] A geometry-based slip prediction model for planetary rovers
    Ma, Hao
    Yang, Huan
    Li, Qunzhi
    Liu, Shaochuang
    COMPUTERS & ELECTRICAL ENGINEERING, 2020, 86
  • [25] Geometry-based estimation of occlusions from video frame pairs
    Ince, S
    Konrad, J
    2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING, 2005, : 933 - 936
  • [26] Geometry-based Motion Vector Scaling for Omnidirectional Video Coding
    Ghaznavi-Youvalari, Ramin
    Aminlou, Alireza
    2018 IEEE INTERNATIONAL SYMPOSIUM ON MULTIMEDIA (ISM 2018), 2018, : 127 - 130
  • [27] Geometry-based Partitioning for Predictive Video Coding with Transform Adaptation
    Blaeser, Max
    Schneider, Jens
    Sauer, Johannes
    Wien, Mathias
    2018 PICTURE CODING SYMPOSIUM (PCS 2018), 2018, : 134 - 138
  • [28] A Novel Hand Gesture Recognition Based on High-Level Features
    Li, Jing
    Wang, Jianxin
    Ju, Zhaojie
    INTERNATIONAL JOURNAL OF HUMANOID ROBOTICS, 2018, 15 (02)
  • [29] Motion Enhanced Model Based on High-Level Spatial Features
    Wu, Yang
    Guo, Lei
    Dai, Xiaodong
    Zhang, Bin
    Park, Dong-Won
    Ma, Ming
    Computers, Materials and Continua, 2022, 73 (03): : 5911 - 5924
  • [30] Motion Enhanced Model Based on High-Level Spatial Features
    Wu, Yang
    Guo, Lei
    Dai, Xiaodong
    Zhang, Bin
    Park, Dong-Won
    Ma, Ming
    CMC-COMPUTERS MATERIALS & CONTINUA, 2022, 73 (03): : 5911 - 5924