Spatial-Temporal Recurrent Neural Network for Emotion Recognition

被引:233
|
作者
Zhang, Tong [1 ,2 ]
Zheng, Wenming [3 ]
Cui, Zhen [4 ]
Zong, Yuan [3 ]
Li, Yang [1 ,2 ]
机构
[1] Southeast Univ, Key Lab Child Dev & Learning Sci, Minist Educ, Nanjing 210096, Jiangsu, Peoples R China
[2] Southeast Univ, Dept Informat Sci & Engn, Nanjing 210096, Jiangsu, Peoples R China
[3] Southeast Univ, Res Ctr Learning Sci, Minist Educ, Key Lab Child Dev & Learning Sci, Nanjing 210096, Jiangsu, Peoples R China
[4] Nanjing Univ Sci & Technol, Sch Comp Sci & Engn, Nanjing 210094, Jiangsu, Peoples R China
基金
中国国家自然科学基金;
关键词
Electroencephalogram (EEG) emotion recognition; emotion recognition; facial expression recognition; spatial- temporal recurrent neural network (STRNN);
D O I
10.1109/TCYB.2017.2788081
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we propose a novel deep learning framework, called spatial-temporal recurrent neural network (STRNN), to integrate the feature learning from both spatial and temporal information of signal sources into a unified spatial-temporal dependency model. In STRNN, to capture those spatially co-occurrent variations of human emotions, a multidirectional recurrent neural network (RNN) layer is employed to capture long-range contextual cues by traversing the spatial regions of each temporal slice along different directions. Then a hi-directional temporal RNN layer is further used to learn the discriminative features characterizing the temporal dependencies of the sequences, where sequences are produced from the spatial RNN layer. To further select those salient regions with more discriminative ability for emotion recognition, we impose sparse projection onto those hidden states of spatial and temporal domains to improve the model discriminant ability. Consequently, the proposed two-layer RNN model provides an effective way to make use of both spatial and temporal dependencies of the input signals for emotion recognition. Experimental results on the public emotion datasets of electroencephalogram and facial expression demonstrate the proposed STRNN method is more competitive over those state-of-the-art methods.
引用
收藏
页码:839 / 847
页数:9
相关论文
共 50 条
  • [1] Spatial-Temporal Feature Fusion Neural Network for EEG-Based Emotion Recognition
    Wang, Zhe
    Wang, Yongxiong
    Zhang, Jiapeng
    Hu, Chuanfei
    Yin, Zhong
    Song, Yu
    IEEE Transactions on Instrumentation and Measurement, 2022, 71
  • [2] Spatial-Temporal Feature Fusion Neural Network for EEG-Based Emotion Recognition
    Wang, Zhe
    Wang, Yongxiong
    Zhang, Jiapeng
    Hu, Chuanfei
    Yin, Zhong
    Song, Yu
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2022, 71
  • [3] Convolution spatial-temporal attention network for EEG emotion recognition
    Cao, Lei
    Yu, Binlong
    Dong, Yilin
    Liu, Tianyu
    Li, Jie
    Physiological Measurement, 2024, 45 (12)
  • [4] Sparse Spatial-Temporal Emotion Graph Convolutional Network for Video Emotion Recognition
    Liu, Xiaodong
    Xu, Huating
    Wang, Miao
    COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2022, 2022
  • [5] Recurrent Spatial-Temporal Attention Network for Action Recognition in Videos
    Du, Wenbin
    Wang, Yali
    Qiao, Yu
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2018, 27 (03) : 1347 - 1360
  • [6] Spatial-Temporal Recurrent Neural Network for Anomalous Trajectories Detection
    Cheng, Yunyao
    Wu, Bin
    Song, Li
    Shi, Chuan
    ADVANCED DATA MINING AND APPLICATIONS, ADMA 2019, 2019, 11888 : 565 - 578
  • [7] A Spatial-Temporal Recurrent Neural Network for Video Saliency Prediction
    Zhang, Kao
    Chen, Zhenzhong
    Liu, Shan
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 : 572 - 587
  • [8] Recurrent attention network using spatial-temporal relations for action recognition
    Zhang, Mingxing
    Yang, Yang
    Ji, Yanli
    Xie, Ning
    Shen, Fumin
    SIGNAL PROCESSING, 2018, 145 : 137 - 145
  • [9] SPRNN: A spatial-temporal recurrent neural network for crowd flow prediction
    Tang, Gaozhong
    Li, Bo
    Dai, Hong-Ning
    Zheng, Xi
    INFORMATION SCIENCES, 2022, 614 : 19 - 34
  • [10] Mask Adaptive Spatial-Temporal Recurrent Neural Network for Traffic Forecasting
    Hu, Xingbang
    Zhang, Shuo
    Zhang, Wenbo
    Huang, Hejiao
    ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PT V, PAKDD 2024, 2024, 14649 : 259 - 270