Temporal based Emotion Recognition inspired by Activity Recognition models

被引:2
|
作者
Mohan, Balaganesh [1 ]
Popa, Mirela [1 ]
机构
[1] Maastricht Univ, Fac Sci & Engn, Maastricht, Netherlands
关键词
Temporal shift module(TSM); Vision transformers; Emotion recognition; Action recognition;
D O I
10.1109/ACIIW52867.2021.9666356
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Affective computing is a subset of the larger field of human-computer interaction, having important connections with cognitive processes, influencing the learning process, decision-making and perception. Out of the multiple means of communication, facial expressions are one of the most widely accepted channels for emotion modulation, receiving an increased attention during the last few years. An important aspect, contributing to their recognition success, concerns modeling the temporal dimension. Therefore, this paper aims to investigate the applicability of current state-of-the-art action recognition techniques to the human emotion recognition task. In particular, two different architectures were investigated, a CNN-based model, named Temporal Shift Module (TSM) that can learn spatio-temporal features in 3D data with the computational complexity of a 2D CNN and a video based vision transformer, employing spatio-temporal self attention. The models were trained and tested on the CREMA-D dataset, demonstrating state-of-the-art performance, with a mean class accuracy of 82% and 77% respectively, while outperforming best previous approaches by at least 3.5%.
引用
收藏
页数:8
相关论文
共 50 条
  • [1] Real-time emotion recognition using biologically inspired models
    Anderson, K
    McOwan, PW
    [J]. AUDIO-BASED AND VIDEO-BASED BIOMETRIC PERSON AUTHENTICATION, PROCEEDINGS, 2003, 2688 : 119 - 127
  • [2] BIOLOGICALLY INSPIRED SPEECH EMOTION RECOGNITION
    Lotjidereshgi, Reza
    Gournay, Philippe
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 5135 - 5139
  • [3] Speech Emotion Recognition Based on Dynamic Models
    Lv, Guoyun
    Hu, Shuixian
    Lu, Xipan
    [J]. 2014 INTERNATIONAL CONFERENCE ON AUDIO, LANGUAGE AND IMAGE PROCESSING (ICALIP), VOLS 1-2, 2014, : 480 - 484
  • [4] Music-evoked emotion recognition based on cognitive principles inspired EEG temporal and spectral features
    Bo, Hongjian
    Ma, Lin
    Liu, Quansheng
    Xu, Ruifeng
    Li, Haifeng
    [J]. INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2019, 10 (09) : 2439 - 2448
  • [5] Music-evoked emotion recognition based on cognitive principles inspired EEG temporal and spectral features
    Hongjian Bo
    Lin Ma
    Quansheng Liu
    Ruifeng Xu
    Haifeng Li
    [J]. International Journal of Machine Learning and Cybernetics, 2019, 10 : 2439 - 2448
  • [6] Biologically inspired emotion recognition from speech
    Caponetti, Laura
    Buscicchio, Cosimo Alessandro
    Castellano, Giovanna
    [J]. EURASIP JOURNAL ON ADVANCES IN SIGNAL PROCESSING, 2011,
  • [7] Biologically inspired emotion recognition from speech
    Laura Caponetti
    Cosimo Alessandro Buscicchio
    Giovanna Castellano
    [J]. EURASIP Journal on Advances in Signal Processing, 2011
  • [8] Happy Emotion Recognition in Videos Via Apex Spotting and Temporal Models
    Samadiani, Najmeh
    Huang, Guangyan
    [J]. 2020 IEEE/WIC/ACM INTERNATIONAL JOINT CONFERENCE ON WEB INTELLIGENCE AND INTELLIGENT AGENT TECHNOLOGY (WI-IAT 2020), 2020, : 514 - 519
  • [9] SPEECH EMOTION RECOGNITION BASED ON LISTENER ADAPTIVE MODELS
    Ando, Atsushi
    Masumura, Ryo
    Sato, Hiroshi
    Moriya, Takafumi
    Ashihara, Takanori
    Ijima, Yusuke
    Toda, Tomoki
    [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 6274 - 6278
  • [10] EEG emotion recognition based on the dimensional models of emotions
    Othman, Marini
    Wahab, Abdul
    Karim, Izzah
    Dzulkifli, Mariam Adawiah
    Alshaikli, Imad Fakhri Taha
    [J]. 9TH INTERNATIONAL CONFERENCE ON COGNITIVE SCIENCE, 2013, 97 : 30 - 37