Metric-Based Attention Feature Learning for Video Action Recognition

被引:9
|
作者
Kim, Dae Ha [1 ]
Anvarov, Fazliddin [1 ]
Lee, Jun Min [1 ]
Song, Byung Cheol [1 ]
机构
[1] Inha Univ, Dept Elect & Comp Engn, Incheon 22212, South Korea
关键词
Feature extraction; Measurement; Three-dimensional displays; Task analysis; Two dimensional displays; Licenses; Kernel; Body action recognition; 3D CNN; attention map learning; distance metric learning;
D O I
10.1109/ACCESS.2021.3064934
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Conventional approaches for video action recognition were designed to learn feature maps using 3D convolutional neural networks (CNNs). For better action recognition, they trained the large-scale video datasets with the representation power of 3D CNN. However, action recognition is still a challenging task. Since the previous methods rarely distinguish human body from environment, they often overfit background scenes. Note that separating human body from background allows to learn distinct representations of human action. This paper proposes a novel attention module aiming at only action part(s), while neglecting non-action part(s) such as background. First, the attention module employs triplet loss to differentiate active features from non-active or less active features. Second, two attention modules based on spatial and channel domains are proposed to enhance the feature representation ability for action recognition. The spatial attention module is to learn spatial correlation of features, and the channel attention module is to learn channel correlation. Experimental results show that the proposed method achieves state-of-the-art performance of 41.41% and 55.21% on Diving48 and Something-V1 datasets, respectively. In addition, the proposed method provides competitive performance even on UCF101 and HMDB-51 datasets, i.e., 95.83% on UCF-101 and 74.33% on HMDB-51.
引用
收藏
页码:39218 / 39228
页数:11
相关论文
共 50 条
  • [1] Attention-Set based Metric Learning for Video Face Recognition
    Hu, Yibo
    Wu, Xiang
    He, Ran
    [J]. PROCEEDINGS 2017 4TH IAPR ASIAN CONFERENCE ON PATTERN RECOGNITION (ACPR), 2017, : 97 - 102
  • [2] Collaborative Spatiotemporal Feature Learning for Video Action Recognition
    Li, Chao
    Zhong, Qiaoyong
    Xie, Di
    Pu, Shiliang
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 7864 - 7873
  • [3] Cholesky Decomposition-Based Metric Learning for Video-Based Human Action Recognition
    Chen, Si
    Shen, Yuanyuan
    Yan, Yan
    Wang, Dahan
    Zhu, Shunzhi
    [J]. IEEE ACCESS, 2020, 8 : 36313 - 36321
  • [4] Statistical Adaptive Metric Learning for Action Feature Set Recognition in the Wild
    Dai, Shuanglu
    Man, Hong
    [J]. ADVANCES IN VISUAL COMPUTING, PT I (ISVC 2015), 2015, 9474 : 657 - 667
  • [5] Statistical adaptive metric learning in visual action feature set recognition
    Dai, Shuanglu
    Man, Hong
    [J]. IMAGE AND VISION COMPUTING, 2016, 55 : 138 - 148
  • [6] Variational Metric Scaling for Metric-Based Meta-Learning
    Chen, Jiaxin
    Zhan, Li-Ming
    Wu, Xiao-Ming
    Chung, Fu-lai
    [J]. THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 3478 - 3485
  • [7] Cardinal, a metric-based Active learning framework
    Abraham, Alexandre
    Dreyfus-Schmidt, Leo
    [J]. SOFTWARE IMPACTS, 2022, 12
  • [8] Metric-Based Frame Selection and Deep Learning Model With Multi-Head Self Attention for Classification of Ultrasound Lung Video Images
    Nehary, Ebrahim A.
    Rajan, Sreeraman
    Rossa, Carlos
    [J]. IEEE ACCESS, 2024, 12 : 79297 - 79310
  • [9] Human Action Recognition by Discriminative Feature Pooling and Video Segment Attention Model
    Moniruzzaman, Md
    Yin, Zhaozheng
    He, Zhihai
    Qin, Ruwen
    Leu, Ming C.
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2022, 24 : 689 - 701
  • [10] Metric-Based Key Frame Extraction for Gait Recognition
    Wei, Tuanjie
    Li, Rui
    Zhao, Huimin
    Chen, Rongjun
    Zhan, Jin
    Li, Huakang
    Wan, Jiwei
    [J]. ELECTRONICS, 2022, 11 (24)