DUAL FOCUS ATTENTION NETWORK FOR VIDEO EMOTION RECOGNITION

被引:0
|
作者
Qiu, Haonan [1 ]
He, Liang [1 ]
Wang, Feng [1 ]
机构
[1] East China Normal Univ, Sch Comp Sci & Technol, Shanghai Key Lab Multidimens Informat Proc, Shanghai, Peoples R China
基金
中国国家自然科学基金;
关键词
Video emotion recognition; attention for video; deep learning;
D O I
10.1109/icme46284.2020.9102808
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Video emotion recognition is a challenging task due to complex scenes and various forms of emotion expression. Most existing works focus on fusing multiple features over the whole video clips. According to our observations, given a long video clip, the emotion is usually presented by only several actions/objects in a few short snippets, and the meaningful cues are buried in the noisy background. When human judging the emotion in videos, we first find the informative clips and then closely look for emotional cues in the frames. In this paper, we propose Dual Focus Attention Network to mimic this process. First, three kinds of features including action, object, and scene are extracted from videos. Second, Two attention modules are used to focus on the visual features of the videos from temporal and spatial dimensions respectively. With our dual focus attention network, we can effectively discover the most emotional frames along the time dimension and the most emotional visual cues in each frame. Our experiments conducted on two widely used datasets Ekman and VideoEmotion show that our proposed approach outperforms the existing approaches.
引用
收藏
页数:6
相关论文
共 50 条
  • [31] Emotion recognition with attention mechanism-guided dual-feature multi-path interaction network
    Li, Yaxuan
    Guo, Wenhui
    Wang, Yanjiang
    SIGNAL IMAGE AND VIDEO PROCESSING, 2024, 18 (SUPPL 1) : 617 - 626
  • [32] Sparse Spatial-Temporal Emotion Graph Convolutional Network for Video Emotion Recognition
    Liu, Xiaodong
    Xu, Huating
    Wang, Miao
    COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2022, 2022
  • [33] DualSANet: Dual Spatial Attention Network for Iris Recognition
    Yang, Kai
    Xu, Zihao
    Fei, Jingjing
    2021 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2021), 2021, : 888 - 896
  • [34] Video multimodal emotion recognition based on Bi-GRU and attention fusion
    Huan, Ruo-Hong
    Shu, Jia
    Bao, Sheng-Lin
    Liang, Rong-Hua
    Chen, Peng
    Chi, Kai-Kai
    MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (06) : 8213 - 8240
  • [35] Video multimodal emotion recognition based on Bi-GRU and attention fusion
    Ruo-Hong Huan
    Jia Shu
    Sheng-Lin Bao
    Rong-Hua Liang
    Peng Chen
    Kai-Kai Chi
    Multimedia Tools and Applications, 2021, 80 : 8213 - 8240
  • [36] Multipath Attention and Adaptive Gating Network for Video Action Recognition
    Haiping Zhang
    Zepeng Hu
    Dongjin Yu
    Liming Guan
    Xu Liu
    Conghao Ma
    Neural Processing Letters, 56
  • [37] Multipath Attention and Adaptive Gating Network for Video Action Recognition
    Zhang, Haiping
    Hu, Zepeng
    Yu, Dongjin
    Guan, Liming
    Liu, Xu
    Ma, Conghao
    NEURAL PROCESSING LETTERS, 2024, 56 (02)
  • [38] SDAN: Stacked Diverse Attention Network for Video Action Recognition
    Zhu, Xiaoguang
    Huang, Siran
    Fan, Wenjing
    Cheng, Yuhao
    Shao, Huaqing
    Liu, Peilin
    2021 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS), 2021,
  • [39] Emotion Recognition via Multiscale Feature Fusion Network and Attention Mechanism
    Jiang, Yiye
    Xie, Songyun
    Xie, Xinzhou
    Cui, Yujie
    Tang, Hao
    IEEE SENSORS JOURNAL, 2023, 23 (10) : 10790 - 10800
  • [40] Speech Emotion Recognition via Multi-Level Attention Network
    Liu, Ke
    Wang, Dekui
    Wu, Dongya
    Liu, Yutao
    Feng, Jun
    IEEE SIGNAL PROCESSING LETTERS, 2022, 29 : 2278 - 2282