Dynamic Spatio-Temporal Specialization Learning for Fine-Grained Action Recognition

被引:14
|
作者
Li, Tianjiao [1 ]
Foo, Lin Geng [1 ]
Ke, Qiuhong [2 ]
Rahmani, Hossein [3 ]
Wang, Anran [4 ]
Wang, Jinghua [5 ]
Liu, Jun [1 ]
机构
[1] Singapore Univ Technol & Design, ISTD Pillar, Singapore, Singapore
[2] Monash Univ, Dept Data Sci & AI, Melbourne, Vic, Australia
[3] Univ Lancaster, Sch Comp & Commun, Lancaster, England
[4] ByteDance, Beijing, Peoples R China
[5] Harbin Inst Technol, Sch Comp Sci & Technol, Harbin, Peoples R China
来源
基金
新加坡国家研究基金会;
关键词
Action recognition; Fine-grained; Dynamic neural networks; HUMAN NEURAL SYSTEM; FACE; REPRESENTATIONS; IDENTITY;
D O I
10.1007/978-3-031-19772-7_23
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The goal of fine-grained action recognition is to successfully discriminate between action categories with subtle differences. To tackle this, we derive inspiration from the human visual system which contains specialized regions in the brain that are dedicated towards handling specific tasks. We design a novel Dynamic Spatio-Temporal Specialization (DSTS) module, which consists of specialized neurons that are only activated for a subset of samples that are highly similar. During training, the loss forces the specialized neurons to learn discriminative fine-grained differences to distinguish between these similar samples, improving fine-grained recognition. Moreover, a spatio-temporal specialization method further optimizes the architectures of the specialized neurons to capture either more spatial or temporal fine-grained information, to better tackle the large range of spatio-temporal variations in the videos. Lastly, we design an Upstream-Downstream Learning algorithm to optimize our model's dynamic decisions during training, improving the performance of our DSTS module. We obtain state-of-the-art performance on two widely-used fine-grained action recognition datasets.
引用
收藏
页码:386 / 403
页数:18
相关论文
共 50 条
  • [31] Fine-Grained Crowdsourcing for Fine-Grained Recognition
    Jia Deng
    Krause, Jonathan
    Li Fei-Fei
    2013 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2013, : 580 - 587
  • [32] JOINT LEARNING ON THE HIERARCHY REPRESENTATION FOR FINE-GRAINED HUMAN ACTION RECOGNITION
    Leong, Mei Chee
    Tan, Hui Li
    Zhang, Haosong
    Li, Liyuan
    Lin, Feng
    Lim, Joo Hwee
    2021 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2021, : 1059 - 1063
  • [33] Fine-grained skeleton action recognition with pairwise motion salience learning
    Li H.
    Tu Z.
    Xie W.
    Zhang J.
    Scientia Sinica Informationis, 2023, 53 (12) : 2440 - 2457
  • [34] TaiChi: A Fine-Grained Action Recognition Dataset
    Sun, Shan
    Wang, Feng
    Liang, Qi
    He, Liang
    PROCEEDINGS OF THE 2017 ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL (ICMR'17), 2017, : 434 - 438
  • [35] CTM: Cross-time temporal module for fine-grained action recognition
    Qian, Huifang
    Zhang, Jialun
    Yi, Jianping
    Shi, Zhenyu
    Zhang, Yimin
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2024, 244
  • [36] Learning Dynamic Spatio-Temporal Relations for Human Activity Recognition
    Liu, Zhenyu
    Yao, Yaqiang
    Liu, Yan
    Zhu, Yuening
    Tao, Zhenchao
    Wang, Lei
    Feng, Yuhong
    IEEE ACCESS, 2020, 8 : 130340 - 130352
  • [37] A Spatio-temporal Feature Learning Approach for Dynamic Scene Recognition
    Ullah, Ihsan
    Petrosino, Alfredo
    PATTERN RECOGNITION AND MACHINE INTELLIGENCE, PREMI 2017, 2017, 10597 : 591 - 598
  • [38] Fine-grained Temporal Contrastive Learning for Weakly-supervised Temporal Action Localization
    Gao, Junyu
    Chen, Mengyuan
    Xu, Changsheng
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 19967 - 19977
  • [39] A Spatio-temporal Graph Transformer driven model for recognizing fine-grained data human activity
    Mao, Yan
    Zhang, Guoyin
    Ye, Cuicui
    ALEXANDRIA ENGINEERING JOURNAL, 2024, 104 : 31 - 45
  • [40] Fine-grained predicting urban crowd flows with adaptive spatio-temporal graph convolutional network
    Yang, Xu
    Zhu, Qiang
    Li, Peihao
    Chen, Pengpeng
    Niu, Qiang
    NEUROCOMPUTING, 2021, 446 : 95 - 105