Deep Learning for Spatio-Temporal Modeling of Dynamic Spontaneous Emotions

被引:25
|
作者
Al Chanti, Dawood [1 ,2 ]
Caplier, Alice [1 ,2 ]
机构
[1] Univ Grenoble Alpes, CNRS, Grenoble INP, F-38000 Grenoble, France
[2] Univ Grenoble Alpes, Inst Engn, GIPSA Lab, Image & Signal Proc Dept, F-38000 Grenoble, France
关键词
Spatiotemporal phenomena; Visualization; Face recognition; Face; Videos; Machine learning; Computational modeling; 3D-CNN; ConvLSTM; deep learning; dynamic emotion; facial expression; SPP-net; spatiotemporal features; FACIAL EXPRESSION RECOGNITION;
D O I
10.1109/TAFFC.2018.2873600
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Facial expressions involve dynamic morphological changes in a face, conveying information about the expresser's feelings. Each emotion has a specific spatial deformation over the face and temporal profile with distinct time segments. We aim at modeling the human dynamic emotional behavior by taking into consideration the visual content of the face and its evolution. But emotions can both speed-up or slow-down, therefore it is important to incorporate information from the local neighborhood frames (short-term dependencies) and the global setting (long-term dependencies) to summarize the segment context despite of its time variations. A 3D-Convolutional Neural Networks (3D-CNN) is used to learn early local spatiotemporal features. The 3D-CNN is designed to capture subtle spatiotemporal changes that may occur on the face. Then, a Convolutional-Long-Short-Term-Memory (ConvLSTM) network is designed to learn semantic information by taking into account longer spatiotemporal dependencies. The ConvLSTM network helps considering the global visual saliency of the expression. That is locating and learning features in space and time that stand out from their local neighbors in order to signify distinctive facial expression features along the entire sequence. Non-variant representations based on aggregating global spatiotemporal features at increasingly fine resolutions are then done using a weighted Spatial Pyramid Pooling layer.
引用
收藏
页码:363 / 376
页数:14
相关论文
共 50 条
  • [1] Deep learning for spatio-temporal modeling: Dynamic traffic flows and high frequency trading
    Dixon, Matthew F.
    Polson, Nicholas G.
    Sokolov, Vadim O.
    [J]. APPLIED STOCHASTIC MODELS IN BUSINESS AND INDUSTRY, 2019, 35 (03) : 788 - 807
  • [2] Deep Learning Based Video Spatio-Temporal Modeling for Emotion Recognition
    Fonnegra, Ruben D.
    Diaz, Gloria M.
    [J]. HUMAN-COMPUTER INTERACTION: THEORIES, METHODS, AND HUMAN ISSUES, HCI INTERNATIONAL 2018, PT I, 2018, 10901 : 397 - 408
  • [3] Spatio-temporal deep learning fire smoke detection
    Wu Fan
    Wang Hui-qin
    Wang Ke
    [J]. CHINESE JOURNAL OF LIQUID CRYSTALS AND DISPLAYS, 2021, 36 (08) : 1186 - 1195
  • [4] Deep learning of spatio-temporal information for visual tracking
    Gwangmin Choe
    Ilmyong Son
    Chunhwa Choe
    Hyoson So
    Hyokchol Kim
    Gyongnam Choe
    [J]. Multimedia Tools and Applications, 2022, 81 : 17283 - 17302
  • [5] Dynamic spatio-temporal flow modeling with raster DEMs
    Nilsson, Hampus
    Pilesjo, Petter
    Hasan, Abdulghani
    Persson, Andreas
    [J]. TRANSACTIONS IN GIS, 2022, 26 (03) : 1572 - 1588
  • [6] Spatio-Temporal Deep Learning for Robotic Visuomotor Control
    Pierre, John M.
    [J]. CONFERENCE PROCEEDINGS OF 2018 4TH INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND ROBOTICS (ICCAR), 2018, : 94 - 103
  • [7] Deep learning of spatio-temporal information for visual tracking
    Choe, Gwangmin
    Son, Ilmyong
    Choe, Chunhwa
    So, Hyoson
    Kim, Hyokchol
    Choe, Gyongnam
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (12) : 17283 - 17302
  • [8] Terahertz spatio-temporal deep learning computed tomography
    Hung, Yi-Chun
    Chao, Ta-Hsuan
    Yu, Pojen
    Yang, Shang-Hua
    [J]. OPTICS EXPRESS, 2022, 30 (13) : 22523 - 22537
  • [9] Online Spatio-Temporal Learning in Deep Neural Networks
    Bohnstingl, Thomas
    Wozniak, Stanislaw
    Pantazi, Angeliki
    Eleftheriou, Evangelos
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 34 (11) : 8894 - 8908
  • [10] Deep Learning for Spatio-Temporal Data Mining: A Survey
    Wang, Senzhang
    Cao, Jiannong
    Yu, Philip S.
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2022, 34 (08) : 3681 - 3700