Deep Metric Learning for Human Action Recognition with SlowFast Networks

被引:1
|
作者
Shi, Shanmeng [1 ]
Jung, Cheolkon [1 ]
机构
[1] Xidian Univ, Sch Elect Engn, Xian, Peoples R China
基金
中国国家自然科学基金;
关键词
Human action recognition; deep metric learning; multi-similarity loss; SlowFast networks;
D O I
10.1109/VCIP53242.2021.9675393
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we propose deep metric learning for human action recognition with SlowFast networks. We adopt SlowFast Networks to extract slow-changing spatial semantic information of a single target entity in the spatial domain with fast-changing motion information in the temporal domain. Since deep metric learning is able to learn the class difference between human actions, we utilize deep metric learning to learn a mapping from the original video to the compact features in the embedding space. The proposed network consists of three main parts: 1) two branches independently operating at low and high frame rates to extract spatial and temporal features; 2) feature fusion of the two branches; 3) joint training network of deep metric learning and classification loss. Experimental results on the KTH human action dataset demonstrate that the proposed method achieves faster runtime with less model size than C3D and R3D, while ensuring high accuracy.
引用
收藏
页数:5
相关论文
共 50 条
  • [1] Deep metric learning for open-set human action recognition in videos
    Matheus Gutoski
    André Eugênio Lazzaretti
    Heitor Silvério Lopes
    [J]. Neural Computing and Applications, 2021, 33 : 1207 - 1220
  • [2] Deep metric learning for open-set human action recognition in videos
    Gutoski, Matheus
    Lazzaretti, Andre Eugenio
    Lopes, Heitor Silverio
    [J]. NEURAL COMPUTING & APPLICATIONS, 2021, 33 (04): : 1207 - 1220
  • [3] Efficient dual attention SlowFast networks for video action recognition
    Wei, Dafeng
    Tian, Ye
    Wei, Liqing
    Zhong, Hong
    Chen, Siqian
    Pu, Shiliang
    Lu, Hongtao
    [J]. COMPUTER VISION AND IMAGE UNDERSTANDING, 2022, 222
  • [4] Deep Learning for Human Action Recognition
    Shekokar, R. U.
    Kale, S. N.
    [J]. 2021 6TH INTERNATIONAL CONFERENCE FOR CONVERGENCE IN TECHNOLOGY (I2CT), 2021,
  • [5] SlowFast Networks for Video Recognition
    Feichtenhofer, Christoph
    Fan, Haoqi
    Malik, Jitendra
    He, Kaiming
    [J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 6201 - 6210
  • [6] Slowfast Diversity-aware Prototype Learning for Egocentric Action Recognition
    Dai, Guangzhao
    Shu, Xiangbo
    Yan, Rui
    Huang, Peng
    Tang, Jinhui
    [J]. PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 7549 - 7558
  • [7] Robust Human Activity Recognition based on Deep Metric Learning
    Abdu-Aguye, Mubarak G.
    Gomaa, Walid
    [J]. ICINCO: PROCEEDINGS OF THE 16TH INTERNATIONAL CONFERENCE ON INFORMATICS IN CONTROL, AUTOMATION AND ROBOTICS, VOL 1, 2019, : 656 - 663
  • [8] Human Action Recognition by Learning Spatio-Temporal Features With Deep Neural Networks
    Wang, Lei
    Xu, Yangyang
    Cheng, Jun
    Xia, Haiying
    Yin, Jianqin
    Wu, Jiaji
    [J]. IEEE ACCESS, 2018, 6 : 17913 - 17922
  • [9] Human Action Recognition Using Deep Neural Networks
    Koli, Rashmi R.
    Bagban, Tanveer, I
    [J]. PROCEEDINGS OF THE 2020 FOURTH WORLD CONFERENCE ON SMART TRENDS IN SYSTEMS, SECURITY AND SUSTAINABILITY (WORLDS4 2020), 2020, : 376 - 380
  • [10] Survey on deep learning methods in human action recognition
    Koohzadi, Maryam
    Charkari, Nasrollah Moghadam
    [J]. IET COMPUTER VISION, 2017, 11 (08) : 623 - 632