Context-Aware Memory Attention Network for Video-Based Action Recognition

被引:1
|
作者
Koh, Thean Chun [1 ]
Yeo, Chai Kiat [1 ]
Vaitesswar, U. S. [1 ]
Jing, Xuan [2 ]
机构
[1] Nanyang Technol Univ, Sch Comp Sci & Engn, Singapore, Singapore
[2] NCS Pte Ltd, NEXT Prod & Platform, Singapore, Singapore
关键词
Action Recognition; Deep Learning; Convolutional Neural Network; Attention;
D O I
10.1109/IVMSP54334.2022.9816216
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Human action recognition is very popularly researched in the computer vision community. The current challenge is to render it efficient enough for wide deployment. In this paper, we propose a human action recognition model which does not require optical flow extraction and 3D convolution, called Context-Aware Memory Attention Network (CAMA-Net). It consists of an attention module called Context-Aware Memory Attention Module which is used to calculate the relevance score between the key and value pairs from the backbone output. The proposed method is evaluated and tested on popular public action recognition datasets, UCF101 and HMDB51. The results demonstrate the strength of our proposed model as it outperforms existing baseline models.
引用
收藏
页数:5
相关论文
共 50 条
  • [1] Towards efficient video-based action recognition: context-aware memory attention network
    Koh, Thean Chun
    Yeo, Chai Kiat
    Jing, Xuan
    Sivadas, Sunil
    [J]. SN APPLIED SCIENCES, 2023, 5 (12)
  • [2] Towards efficient video-based action recognition: context-aware memory attention network
    Thean Chun Koh
    Chai Kiat Yeo
    Xuan Jing
    Sunil Sivadas
    [J]. SN Applied Sciences, 2023, 5
  • [3] Context-Aware Attention Network for Human Emotion Recognition in Video
    Liu, Xiaodong
    Wang, Miao
    [J]. ADVANCES IN MULTIMEDIA, 2020, 2020
  • [4] Context-aware attention network for image recognition
    Leng, Jiaxu
    Liu, Ying
    Chen, Shang
    [J]. NEURAL COMPUTING & APPLICATIONS, 2019, 31 (12): : 9295 - 9305
  • [5] Context-aware attention network for image recognition
    Jiaxu Leng
    Ying Liu
    Shang Chen
    [J]. Neural Computing and Applications, 2019, 31 : 9295 - 9305
  • [6] CANet: Comprehensive Attention Network for video-based action recognition
    Gao, Xiong
    Chang, Zhaobin
    Ran, Xingcheng
    Lu, Yonggang
    [J]. KNOWLEDGE-BASED SYSTEMS, 2024, 296
  • [7] Context-aware Cascade Attention-based RNN for Video Emotion Recognition
    Sun, Man-Chin
    Hsu, Shih-Huan
    Yang, Min-Chun
    Chien, Jen-Hsien
    [J]. 2018 FIRST ASIAN CONFERENCE ON AFFECTIVE COMPUTING AND INTELLIGENT INTERACTION (ACII ASIA), 2018,
  • [8] Stacked Multimodal Attention Network for Context-Aware Video Captioning
    Zheng, Yi
    Zhang, Yuejie
    Feng, Rui
    Zhang, Tao
    Fan, Weiguo
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (01) : 31 - 42
  • [9] Video-Based Eye Tracking to Detect the Attention Shift: A Computer Classroom Context-Aware System
    Kuo, Yung-Lung
    Lee, Jiann-Shu
    Hsieh, Min-Chai
    [J]. INTERNATIONAL JOURNAL OF DISTANCE EDUCATION TECHNOLOGIES, 2014, 12 (04) : 66 - 81
  • [10] Context-Aware Cross-Attention for Skeleton-Based Human Action Recognition
    Fan, Yanbo
    Weng, Shuchen
    Zhang, Yong
    Shi, Boxin
    Zhang, Yi
    [J]. IEEE ACCESS, 2020, 8 (08): : 15280 - 15290