Fusing HOG and convolutional neural network spatial-temporal features for video-based facial expression recognition

被引:28
|
作者
Pan, Xianzhang [1 ]
机构
[1] Taizhou Univ, Inst Intelligent Informat Proc, Taizhou 318000, Peoples R China
关键词
computer vision; face recognition; feature extraction; support vector machines; emotion recognition; convolutional neural nets; video signal processing; convolutional neural network spatial-temporal features; video-based facial expression recognition; VFER; fundamental feature; visual features; comprehensive feature; video frame; HOG features; facial expressions; CNN shallow features; INFORMATION;
D O I
10.1049/iet-ipr.2019.0293
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Video-based facial expression recognition (VFER) is the fundamental feature of various computer vision applications. Visual features are the key factors for facial expression recognition. However, the gap between the visual features and the emotions is large. In order to bridge the gap, the proposed method utilises convolutional neural networks (CNNs) and histogram of oriented gradient (HOG) to obtain the more comprehensive feature for VFER. Firstly, it extracts shallow features from the video frame through a number of convolutional kernels in CNNs, which has the characteristics of displacement, scale and deformation invariance. Then, the HOG is employed to extract HOG features from CNN's shallow features, which are strongly correlated with facial expressions. Finally, the support vector machine (SVM) is employed to conduct the task of facial expression recognition. The extensive experiments on RML, CK+ and AFEW5.0 database show that this framework takes on the promising performance and outperforming the state of the arts.
引用
收藏
页码:176 / 182
页数:7
相关论文
共 50 条
  • [1] Spatial-temporal pyramid based Convolutional Neural Network for action recognition
    Zheng, Zhenxing
    An, Gaoyun
    Wu, Dapeng
    Ruan, Qiuqi
    [J]. NEUROCOMPUTING, 2019, 358 : 446 - 455
  • [2] A Deep Spatial and Temporal Aggregation Framework for Video-Based Facial Expression Recognition
    Pan, Xianzhang
    Ying, Guoliang
    Chen, Guodong
    Li, Hongming
    Li, Wenshu
    [J]. IEEE ACCESS, 2019, 7 : 48807 - 48815
  • [3] Deep Temporal-Spatial Aggregation for Video-Based Facial Expression Recognition
    Pan, Xianzhang
    Guo, Wenping
    Guo, Xiaoying
    Li, Wenshu
    Xu, Junjie
    Wu, Jinzhao
    [J]. SYMMETRY-BASEL, 2019, 11 (01):
  • [4] Spatial-Temporal Graph Convolutional Network for Video-based Person Re-identification
    Yang, Jinrui
    Zheng, Wei-Shi
    Yang, Qize
    Chen, Ying-Cong
    Tian, Qi
    [J]. 2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 3286 - 3296
  • [5] Video-based facial expression recognition using multimodal deep convolutional neural networks
    Pan, Xian-Zhang
    Zhang, Shi-Qing
    Guo, Wen-Ping
    [J]. Guangxue Jingmi Gongcheng/Optics and Precision Engineering, 2019, 27 (04): : 963 - 970
  • [6] Golf video tracking based on recognition with HOG and spatial-temporal vector
    Li Weixian
    Lou Xiaoping
    Dong Mingli
    Zhu Lianqing
    [J]. INTERNATIONAL JOURNAL OF ADVANCED ROBOTIC SYSTEMS, 2017, 14 (03):
  • [7] GEOSPATIAL-TEMPORAL CONVOLUTIONAL NEURAL NETWORK FOR VIDEO-BASED PRECIPITATION INTENSITY RECOGNITION
    Lin, Chih-Wei
    Yang, Suhui
    [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2021, : 1119 - 1123
  • [8] Video-Based Facial Expression Recognition using Deep Temporal-Spatial Networks
    Pan, Xianzhang
    Zhang, Shiqing
    Guo, WenPing
    Zhao, Xiaoming
    Chuang, Yuelong
    Chen, Ying
    Zhang, Haibo
    [J]. IETE TECHNICAL REVIEW, 2020, 37 (04) : 402 - 409
  • [9] Full Convolutional Network Based on Spatial-Temporal Features for the Video Eye Fixation Prediction
    Shi, Jiuchen
    Sun, Meijun
    Wang, Zheng
    Zhang, Dong
    [J]. Tianjin Daxue Xuebao (Ziran Kexue yu Gongcheng Jishu Ban)/Journal of Tianjin University Science and Technology, 2019, 52 (10): : 1062 - 1068
  • [10] A Mix Fusion Spatial-Temporal Network for Facial Expression Recognition
    Shu, Chang
    Xue, Feng
    [J]. PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2023, PT V, 2024, 14429 : 315 - 326